Protein Structure

Protein: Structure and Function

Books:

1. Lehninger Principle of Biochemistry ( by Nelson and Cox)

Amino acid

Properties:1. -carbon is bonded to four different groups except glycine2. -carbon is a chiral center3. Two possible stereoisomers (called enantiomers)4. Optically active (except glycine): rotate plan polarized light5. Can act as acids and bases

Amino acids in the human body

Essential amino acid

Non-essential amino acid

1. Leucine2. Isoleucine3. Valine4. Histidine5. Lysine

6. Methionine7. Phenylalanine8. Threonine9. Tryptophan

1. Alanine2. Arginine3. Asparagine4. Aspartic acid5. Cysteine6. Glutamic acid7. Glutamine

8. Glycine9. Proline10. Serine11. Tyrosine

Obtained from nutrition

Synthesized by the body

Non-polar, aliphatic R groups

Gly/G Ala/A Pro/P Val/V

Leu/L Ile/I Met/M

Polar, uncharged R groups

1. Cysteine forms dimercalled cystine

2. Disulfide residues are strongly hydrophobic

Ser/S Thr/T Cys/C

Asn/N Gln/Q

Positively charged R groups

Lys/K Arg/R His/H

Negatively charged R groups

Asp/D Glu/E

Aromatic R groups

1. Non-polar2. Hydrophobic3. Form H-bond4. Absorb UV light (280 nm)

Phe/F Tyr/Y Trp/W

pKa values for carboxyl and amino groups

Amino acid have characteristic titration curve1. Without ionisable R group

pI = (pK1 + pK2 )/2

pK1 = for acidpK2 = for base

2. With ionisable R group

pI = ?

1. All protein polymers are constructed from the same set of 20 amino acids.

2. Polymers of proteins are called polypeptides.

3. A protein consists of one or more polypeptides folded and coiled into a specific conformation

4. The physical and chemical characteristics of the R group determine the unique characteristics of a particular amino acid.

Protein

Peptide Bond

Properties of Peptide bond

1. Planar2. Double bond character (which prevent rotation

about this bond)3. Uncharged (helps to form tightly packed globular

structure4. Two conformations possible (cis and trans)5. All peptide bonds are in proteins are trans6. Other than peptide bonds in protein helps to take

many conformational structure7. Average half-life of peptide bond is 7 years

(intracellular condition)

Fully extended polypeptide chain

Both bond can rotate

and are zero

trans-Peptide group

cis-Peptide group

Peptide and Protein

Peptides: 1. Short polymers formed from the linking of (usually less

than or equal to 100) amino acids and comprise 2. Some of the basic components of human biological

processes, including enzymes, hormones, and antibodies.

Protein:1. A functional, polypeptide chain composed of at

least around fifty amino acids put together. 2. They play a critical role in biochemical reactions

within cells.

1. Simple protein: contain only amino acid residues2. Conjugated proteins: contain permanently associated chemical

components in addition to amino acids(a) Lipoproteins: contain lipid(b) glycoproteins: contain sugar(c) metalloprotein: contain specific metal

Types of Protein

Structure of Protein

I. The primary structure determines the folding of the polypeptide to give a functional protein

II. Polar amino acids (acidic, basic and neutral) are hydrophilic and tend to be placed on the outside of the protein.

III. Non-polar (hydrophobic) amino acids tend to be placed on the inside of the protein

IV. The possible conformations are very large

V. Most are useless, natural selection picks out the best

Primary Structure

1. Elongated2. Dynamic, heterogeneous3. Very rich in H-bond4. Every protein has unique amino acid sequence5. Sequence decide the mechanism of action6. Sequence determine the 3-D structure7. Sequence reveals about its evolutionary history

Primary structure of protein

1. Starting of the polypeptide chain : N-terminal (amino group)

2. End terminal of the polypeptide chain: C-terminal (carboxylic group)

Human insulin

1. Strong interactions(a) Covalent bonding(b) Ionic bonding(c) Resonance bonding

2. Weak Interaction(a) Van der Waals interaction

(i) Polar-polar (ii) Polar-non polar(iii) Non polar- non polar

(b) Hydrogen bonding

3. Effect of medium(a) Screening of field(b) Surface interaction(c) Hydrophobic environment(d) Hydrophilic environment

Molecular interactions

The polypeptide chain can fold into regular structures like:

1. Alpha helix2. Beta-sheet3. Turns and loops

These are called secondary structure which helps to form final 3-D structure

Secondary structure of proteinThe folding of the N-C terminals of the chain using many different interactions

Alpha-helix1. It is coiled structure stabilized by intra-chain H-bonds

between NH and CO groups (situated 4 residue ahead in the sequence) except end terminal groups

2. Pitch of alpha-helix 5.4 A

3. Both right handed and left handed helix are allowed , however right handed alpha-helices are energetically more favourable because there is less steric clash between the side chain and backbone

4. Content in the protein: alpha-helices may be 100%

5. Glycine, serine and threonine make amino terminal residue (N-cap) in alpha-helix

6. Glycine and asparagine makes carboxyl terminal (C-cap) of alpha-helix

Helical Wheel: Each residue can be plotted every 360/3.6=100 around a circle or spiral

Representation of -helices in protein

-helicesH bond between residues i, i+4

Rise per residue, d = 1.5

# of residues per turn, n = 3.6

Pitch of helix= n x d = 5.4

The -helix has a dipole moment

The dipole of a peptide unit. Numbers in boxes give the approximate fractional charges of the atoms of the peptide unit

The Dipoles of peptide units are aligned along the helical axis

Beta- sheet/strand1. It is stabilized by H-bonding between chains

2. This secondary structure may associated through side chain interactions and form super-secondary structure called motif

3. Beta-sheet is almost fully extended

4. The distance between adjacent amino acids along a beta-strand is roughly 3.5 A (in contrast to 1.5 A in alpha-helix)

5. A beta- sheet is formed by linking two pr more beta strands by H-bonds

6. Beta strand represented by broad arrows pointing in the direction of C-terminal

7. Beta sheet formation is important in fatty acid-binding proteins and lipid metabolism

-Strand

The side chain (green) are alternatively above and below the plane of the strand

NN

C

C

Anti-parallel -sheet

Adjacent -strand run in opposite direction. H-bond between NH and CO groups connect each amino acid on an adjacent strand and stabilize the structure

Parallel -sheet

Adjacent -strand run in same direction. H-bond between NH and CO groups connect each amino acid on one strand with two different amino acids on the adjacent strand

N

N

C

C

Mixed -sheet

Turns and Loops

1. Polypeptide chain can change direction with the help of turn and loops

2. CO group of residue i is H-bonded with NH group of residue i+3

3. This particular H-bond interaction stabilizes abrupt changes in the direction of polypeptide chain

4. Turn and loops connect alpha-helices and beta strand and allow a peptide chain to fold back on itself to make a compact structure

5. Loops often contain hydrophilic residues and are found on the protein surface

6. Turn or loops contain 5 residues or less

7. Beta turn connects different anti-parallel beta strands

Ramachandran plot

Phi ()

Psi ()

Phi () and Psi () rotate, allowing the polypeptide to assume its various conformations

some conformations of the polypeptide backbone result in steric hindrance and are disallowed

glycine has no side chain and is therefore conformationally highly flexible (it is often found in turns)

no stericclashes

permittedif atoms aremore closelyspaced

Tertiary structure of protein

Spatial arrangement of amino acid residues that are far apart in the sequence

1. This folding is sometimes held together by strong covalent bonds (e.g. cysteine-cysteine disulphide bridge)

2. Bending of the chain takes place at certain amino acids (e.g. proline)

3. Hydrophobic amino acids tend to arrange themselves insidethe molecule

4. Hydrophilic amino acids arrange themselves on the outside

Quaternary structure of protein

1. Polypeptide chains can assemble into multi sub-units

2. Sub units are spatially arranged

3. Helix-loop-helix: two helices connected by a turn

4. Coiled-coil: two alpha helices interact in parallel through their hydrophobic edge

5. Helix-bundle: several alpha-helices that associate in an anti-parallel manner

6. Beta-alpha-beta unit: two parallel beta strand linked to an intervening alpha helix by two loops

Levels of structure in proteins

Protein structure: overview

Structural element Description

primary structure amino acid sequence of protein

secondary structure helices, sheets, turns/loops

super-secondary structure (motif) association of secondary structures

domain self-contained structural unit

tertiary structure folded structure of whole protein includes disulfide bonds

quaternary structure assembled complex (oligomer) homo-oligomeric (1 protein type) hetero-oligomeric (>1 type)

Physical parameters of protein

Size of the protein roughly between 1nm to 10nm

Persistence length of protein ranges between 0.3 nm to 0.8 nm

Elastic modulus of protein ranges between 1200 to 2000 pN/nm2

The protein structure must obey

1. The bond lengths and bond angles should be distorted as little as possible

2. No two atoms should approach one another more closely than is allowed by there van der Waals radii

3. The amide group must remain planar and in the trans configuration. This allows only rotation about the two bonds adjacent to the alpha-carbon

4. Some kind of non-covalent binding is necessary to stabilized a regular folding

Domain structures core is exclusively built from helices

Domain structures core comprises of antiparallel sheets, usually two sheets packed against each other

/ Domain structures made from combinations of -- motifs that form a predominantly parallel sheets surrounded by helices

Structure of proteins

Human plasma retinol binding protein. Retinol molecule (vitamin A) bound inside the barrel

Triosephosphateisomerase

Structure of proteins

Structure Function

Structure Mechanism

Structure Origins/Evolution

Structure-based Drug Design

Solving the Protein Folding Problem

Solving Protein Structures

Atomic resolution pictures of macromolecules X-ray Crystallography (first applied in 1961 - Kendrew & Perutz) NMR Spectroscopy (first applied in 1983 - Ernst & Wuthrich)

QHTAWCLTSEQHTAAVIWDCETPGKQNGAYQEDCAHHHHHHCCEEEEEEEEEEECCHHHHHHHCCCCCCC

Crystallographic structure of Myoglobin

10

(1958, Sir John Kendrew)

Protein Structure solved by X-ray crystallography

PDB contains 75000 structures mostly determined by X-ray crystallography and NMR. About 3-5 new structures per day

Total Yearly

Hemoglobin A: Val-His-Leu-Thr-Pro-Glu-Glu-Lys-Hemoglobin S: Val-His-Leu-Thr-Pro-Val-Glu-Lys-

sticky patch causes hemoglobin S to agglutinate (stick together) and form fibers which deform the red blood cell

Importance of Protein StructureUsing electrophoresis, Pauling showed that individuals with sickle cell disease had a modified form of Hb

101 residues. each residue can assume three different conformations

the total number of structures would be 3100, which is equal to 5 1047

If it takes 10-13 s to convert one structure into another

the total search time would be 5 1047 10-13 s

which is equal to 5 1034 s, or 1.6 1027 years.

Protein folding: Levinthals paradox

The enormous difference between calculated and actual folding times is called Levinthal's paradox.

There should be some pathways for folding

Factors affecting protein folding

1. Space packingproteins are like liquid and gases instead of crystalline solidit helps in forming structure but space packing is not enough

2. Internal residue: Folding is directed mainly by internal residues not by surface residues. (Hydrophobic force-driven folding)

3. Protein structures are hierarchically organized

4. Protein structures are highly adaptable

5. Secondary structure can be context dependent and can be predicted by algorithms

6. Changing the fold of a protein

Speed limit of protein folding

For a single domain protein The approximate folding time (folding) is given by N/100 s

-protein fold faster than the protein or protein

folding = k exp(G/kBT)

Hydrophobic effect

Conformational entropy

Electrostatics

Hydrogen bonding

van der Waals interaction

Protein folding

The main driving force for folding water soluble globularprotein molecules is to pack hydrophobic side chains into the interior of the molecule , thus creating a HYDROPHOBIC CORE &HYDROPHILLIC SURFACE.

Problem- How to create such a hydrophobic core from a protein chain ???

?Protein folding

Compact (in general)Defined structure

Insulin

Molten Globule

1. Secondary structure that is present in a native protein forms within a few microsecond

2. This is because of hydrophobic collapse3. It is larger by (5-15%) in size of native conformations4. Side chains are not ordered/packed5. Structure fluctuation is much larger 6. Not thermodynamically stable

Folding protein moves over energy surface from unfolded to folded state:

unfolded

folded

Energy landscape governs folding

100

k BT

Deg

ree

of

nativ

enes

s

H= bond stretching + bending of angles + Bond rotations + van der waals interaction + electrostatic interaction

Folding is a complex process

Misfolding

Macromolecules must fold into correct shape to function properly: Structure Function

Disease(non-native structures)

Many diseases with large impacts involve protein misfolding:

Alzheimers Disease A peptide

Parkinsons Disease -synuclein

Creutzfeldt-Jakob disease Prion

Huntingtons Disease Huntingtin protein

Amyotrophic Lateral Sclerosis Superoxide dismutase

Type II Diabetes Amylin

Misfolding and aggregation are complex

nativelyfolded

amyloid fibrils

Many different species and steps involved in misfolding and aggregation:

protein synthesis unfolded

ribosome

partiallyunfolded

partially-unfoldedand aggregated

disorderedaggregates

degradedfragments

non-native structuredaggregates

misfolded

Chiti & Dobson, Annu. Rev. Biochem., (2006)

Force-extension curvesMove traps apart at constant rate to stretch handles and apply force to a single-molecule:

50 mM MOPS, 200 mM KCl, pH 7.0

unfolding

stretching handles

molecule unfoldsrefolding

Apparently two-state folding

molecule unfolds

Reconstructing landscapes from FECsBased on Jarzysnki equality for relating equilibrium free energy to non-equilibrium work:

Hummer & Szabo, PNAS (2001)

Many applications, never validated experimentally

Forc

e

Distance

Work

But energy is dissipated!

WGeqm

Recover equilibrium energy fromfluctuations in work done:

Prob

abili

ty

Work

Geqm

W

Jarzynski, Phys. Rev. Lett. (1997)

Jarzynski equality:

=

TkW

TkG

B

eqmnon

B

eqm expexp

Method to calculate free energy profile G(x)

Reconstructing energy landscape profilesUsing equilibrium probability distributions:

Exte

nsio

n (n

m)

Time (s)

Prob

abili

ty d

ensit

y

( ) ( )expB

G xP x

k T

( ) ( )logBG x k T P x =

Thousands oftransitions!

Extension (nm)

Free energy (kJ/mol)

Invert

Deconvolution to remove effects of compliant handles:

Prob

abili

ty

Extension (nm)

P(x)

deconvolution

Residual

Free

ene

rgy

(kJ/

mol

)

Extension (nm)

G(x)

raw data

Woodside et al., Science (2006)

parameter-free modelfor folding

Woodside et al., PNAS (2006)

Slow and time consuming experiments

Computational approach for protein folding

1. Energy minimization(a) Steepest(b) Conjugated gradient

2. Monte Carlo Simulation(a) Random Sampling(b) Stimulated annealing

3. Molecular dynamics(a) Compute conformational change(b) Calculate trajectories at thermal condition and fond

the ensemble averaged physical quantity

Denaturants high temperatures

- cause protein unfolding, aggregation

low temperatures- some proteins are sensitive to cold denaturation

heavy metals (e.g., lead, cadmium, etc.)- highly toxic; efficiently induce the stress response

proteotoxic agents (e.g., alcohols, cross-linking agents, etc.)

oxygen radicals, ionizing radiation- cause permanent protein damage

chaotropes (urea, guanidine hydrochloride, etc.)- highly potent at denaturing proteins;often used in protein folding studies

Protein folding/unfolding

Force spectroscopy

AFM Optical Tweezers

Protein-Protein Interaction NetworksYeast ~6000 proteins, ~3 interactions per protein, i.e. ~>20,000 interactions. Humans ~100,000 interactions

Nat. Biotechnol. 18, 12571261 (2000)

Which two proteins will interact?

AND, which will not?

The ANSWER lies in the nature of theinteracting surfaces

A-B, A-C forms poorly matched surfaces, few weak bonds are formed, broken apart by thermal motion

A-D offers well matched surfaces, enough noncovalent bonds are formed to create a stable interface

Forces driving protein-protein interaction

Long-range attractive interactions

electrostatic steering

Short-range non-covalent forces:

Hydrophobic interactions

van der Waals attraction

Hydrogen bonds

Ion pairs

Other factors:

Shape and charge complementarity

Secondary structure

Amino acid composition

Slide Number 1Slide Number 2Slide Number 3Slide Number 4Slide Number 5Slide Number 6Slide Number 7Slide Number 8Slide Number 9Slide Number 10Slide Number 11Slide Number 12Slide Number 13Slide Number 14Slide Number 15Slide Number 16Slide Number 17Slide Number 18Slide Number 19Slide Number 20Slide Number 21Slide Number 22Slide Number 23Slide Number 24Slide Number 25Slide Number 26Slide Number 27Helical Wheel: Each residue can be plotted every 360/3.6=100 around a circle or spiralSlide Number 29Slide Number 30Slide Number 31Slide Number 32Slide Number 33Slide Number 34Slide Number 35Slide Number 36Ramachandran plotSlide Number 38Slide Number 39Slide Number 40Protein structure: overviewSlide Number 42Slide Number 43Slide Number 44Slide Number 45Slide Number 46Slide Number 47Slide Number 48Slide Number 49Slide Number 50Slide Number 51Slide Number 52Slide Number 53Slide Number 54Slide Number 55Slide Number 56Slide Number 57Slide Number 58Slide Number 59Slide Number 60Slide Number 61Slide Number 62DenaturantsSlide Number 64Slide Number 65Slide Number 66

Protein Structure

Documents

Transcript of Protein Structure