A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University...

30
A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University

Transcript of A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University...

Page 1: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

A demonstration of clustering in protein contact maps for α-helix pairs

Robert Fraser, University of Waterloo

Janice Glasgow, Queen’s University

Page 2: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

2

Hypothesis

Properties of the three-dimensional configuration of a pair of α-helices may be predicted from the contact map corresponding to the pair.

Page 3: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

3

What to expect

• The α-helix

• Protein structure prediction

• Packing of helices

• Prediction of packing

• Future work

Page 4: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

4

Where do proteins come from?

What we’re interested in

Image from http://www.accessexcellence.org/RC/VL/GG/images/protein_synthesis.gif

Page 5: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

5

Amino acid residues

• Two representations of the same α-helix

• Left helix shows residues only

• Right helix shows all backbone atoms

Page 6: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

6

Why α-helices?

• Protein Structure Prediction– The 3D structure of a

protein is primarily determined by groups of secondary structures

– The α-helix is the most common secondary structure

Page 7: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

7

Protein Structure Prediction

• is hard.

• There are many different approaches• Crystallization techniques are time-consuming• Tryptych

– Case based-reasoning approach to prediction– Uses different experts as advisors– Works from a contact map

Page 8: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

8

The Goal

• 2D representation 3D• Toy example:

A C

BC 4 5 0

B 3 0 5

A 0 3 4

A B C

Page 9: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

9

The Goal

If we apply a threshold of 4.5 units, can we still determine the shape?

C 4 5 0

B 3 0 5

A 0 3 4

A B C

A C

B

C T F T

B T T F

A T T T

A B C

?

Page 10: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

10

Protein Structure Prediction• Build (or predict) contact map• Predict the three-dimensional structure from the

contact map• N×N matrix, N= # of amino acids in the protein• C(i,j) = true if the distance from amino acid i to j is

less than a threshold (ie. 7Å)

?

Page 11: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

11

Refining a contact map

Contact map for entire protein

Contact map for pair of α-helices

Interface region of contact map

Page 12: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

12

Hypothesis (again)

Properties of the three-dimensional configuration of a pair of α-helices may be predicted from the contact map corresponding to the pair.

Page 13: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

13

Packing of helices

• A nice simple boolean property

• Found by rotating both helices to be axis-aligned, then comparing coordinates

packing non-packing

90°not 90°

Page 14: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

14

Prediction of packing

Need a map comparison method

Page 15: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

15

Comparing maps

• Comparing two maps is not straightforward

• Reflections are insignificant

Page 16: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

16

Comparing maps

• Translations are (generally) insignificant

Page 17: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

17

Prediction of packing

• Classify the maps based on contact locations

central corneredge

Page 18: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

18

Packing classes

Page 19: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

19

Packing data by class

• No clustering to be done on the central and edge classes

• Reflections of each corner map were used to build the data set

Page 20: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

20

Clustering Corner Maps

• Each corner map is transformed to a 15×15 map• A map corresponds to a 225 character binary string• Distance can measure using cosine

Page 21: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

21

Clustering Results

Page 22: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

22

Nearest Neighbour

• Closest neighbour by cosine distance

• Of the 862 contact maps in the data set, only 9 had a packing value different from the nearest neighbour

• Almost 99% accuracy!

Page 23: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

23

Contributions

• Prediction of helix pair properties– 1st to establish that contact maps cluster– Developed a novel contact map classification

scheme for helix pairs– 1st to demonstrate that helix pair properties

may be predicted from contact maps

Page 24: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

24

Future Work

• Implement the packing advisor (done)

• Explore other properties of α-helices that could be predicted from contact maps– Interhelical angle– Interhelical distance

Page 25: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

25

Thanks

• Questions?

Supported by

NSERC, PRECARN IRIS, Queen’s

Page 26: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

26

Clustering Results

Non-packing Mixed

Packing

Page 27: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

27

Levels of Structure

Primary

Secondary

Page 28: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

28

α-helix properties

Page 29: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

29

Tryptych

Page 30: A demonstration of clustering in protein contact maps for α-helix pairs Robert Fraser, University of Waterloo Janice Glasgow, Queen’s University.

30

Chothia packing model