Steven Plotkin - University of British Columbia · 2007. 7. 26. · Size ~ 30 Å We can...

Multiple minima problems in biopolymers

Steven Plotkin

-P.A.M. Dirac

Mechanical behavior of glassy materials, 7/2007

Consider a protein such as this one:

∆E ~ 102 kBT (~102 H-bonds, H-φ) , ∆S ~ 102 kB ,∆F = ∆E –T ∆S ~ 10 kBT ~ 0.1 kT/aaProteins are only marginally stable(Facilitates their degradation)

2ci2Proteinase inhibitor~30 nm end-to-end,~70 aa (small)M=10,000 Au

Size ~ 30 Å We can coarse-grain it so we can study itsfolding process on a computer –(must be careful!)

CI2 is 1 member of 1 class of proteins

How many proteins are there in humans?

There are betw 30,000-40,000 genes in humans

Proteins are abundantly synthesized

From translation rate, # ribosomes/cell,avg protein length: ~10,000 proteins ·s-1 ·cell-1

>> metabolic requirements ~ 100 proteins ·s-1 ·cell-1

~99% of synthesized proteins have been recycled(prokaryotic)

For proteins the energy landscape must be largely downhill

N

~ 70N (CI2)

Paradigm: Protein dynamics occurs on a funneled energy landscape

E

“Progress Coordinate”

“Reaction coordinate”

“Foldedness”

“Q”Wolynes, Onuchic, Dill…

Funneled, high barriers

Disconnectivity trees

Funneled, low barriers

Clusters together minimaw barriers no higher than a given threshold

Frustrated/GlassyDJ Wales “Energy Landscapes” (Cambridge 2003)

“Random Heteropolymer”

DJ Wales “Energy Landscapes” (Cambridge 2003)

Protein-like Heteropolymer

Replica symmetry breaking and ultrametricity

The glass transitionrepresented throughthe sequence avgdtree that describeshow replica states are clusteredas T

Replica calculations for Virial expansion (H Orland ‘88, Shakhnovich ‘89)

Including correlations, calcof residual polymer entropy

RSB order parameter

Plotkin, Wang, Wolynes (1996)

0.7η =

1η =

12 G

T T≈Probability distribution of similarity q, for

( )dx q dq=

At T< Tg, system freezes into a basin of states rather than a single oneIf the system is less dense, basin be large with much entropy.

The glass temperatures for the REM and GREM are very close

So in practice the REM glass temperature may be usedfor the onset of non-self averaging behavior

The glass temperature is below the folding temperature over most of therange of the landscape.

glass T

folding T

“foldedness” (fract of native contacts)

“compactness”(fract of total possible contacts)

• Free energylandscape is self-averaging.

• Diffusion coefficientdecreases as the proteinfolds.

• Folding dynamics arenon-glassy and fast.

A funneled landscape avoids the glassy physics that characterizes random sequences.

E

“Q”Is this the only way?

One can imagine other landscapes that satisfy the thermodynamic and kinetic requirements of foldingDensity of states could still be RHP-likeBut barriers that make folding slow could be unusually smallalong a particular “buffed” coordinate.These landscapes must be rugged enough to be “glassy” at TbioOnly weakly gapped/funneled in the conventional sense

0.5GNDP >

Plotkin, Wolynes PNAS (2003)

The buffing mechanism for foldingWhat are the fraction of paths below F‡?

( )ENF

( )

Fi

Fi

EEE

SQEE

E

e

e

δδ

δδ

∫

∫

=

⎥⎥⎥

⎦

⎤

⎢⎢⎢

⎣

⎡

⎟⎟⎠

⎞⎜⎜⎝

⎛∂∂

−∂∂

−

⎥⎥⎥

⎦

⎤

⎢⎢⎢

⎣

⎡

⎟⎟⎠

⎞⎜⎜⎝

⎛∂∂

−∂∂

−

≠

∗

∗

∫

∫−

N

N

NF

NFmdN

NF

NFmdN

F

FEFp

0

0

2

2

D

DV

|( )( )0,|,

0,|,

KK

iNU

iNSQ

ENFENF

=

Diff seqs have different F(Ne)to escape fromlocal traps

Gaussianly distributedincrement

These are distributed aboutthe mean F(Ne)

( ) ( )2

SQ i E2EE

K 1 K K V K

4d F

dNF E N

N m F Fδ δ∗

⎛ ⎞∂ ∂ ∂− + + = −⎜ ⎟∂ ∂ ∂⎝ ⎠

Slowly varying

( ) 11

H2

212

The adiabatic approximation with ground state dominance givesan exponentially rare fraction of buffed sequences

( )2 2 1 2"surface tension"2

22 2ln

28 2e

b bGT N N NF

bf F

π ν

σ≠≠

⎛ ⎞−⎜ ⎟⎝ ⎠− − − ×

≈

Mean-fieldbehavior

Ruggedness/stability

Sequence fluctuations

Funneled sequences are also exponentially rare in N:

( ) G Ge eT N Tf ε−∆ −∆ ≈ =

How rare are “buffed seqs” compared to funneled seqs?Quantitatively address this by finding the probability pathways are buffed tosay 4 kT, vs. the probability a sequence is funneled enough s.t.Plot of pfun and pbuff vs. chain length:Funneling becomes dominant for long chains But there is a crossover

4 UG kT≠∆ =

A compound mechanism?Funneling removes entropyIn reduced ensemble, dynamics between traps mediated by buffing in later stages of folding.

Plotkin, Wolynes PNAS (2003)

E

“Progress Coordinate”

“Reaction coordinate”

“Foldedness”

“distance”

What quantity can tell us how far a structure is from folded?

(N)

AND

BND

Should be a geometrical quantity depending only on the 2 structures considered.

Some heuristic distance measures widely used:

Q = fraction of native ‘contacts’ present

1Fp

1 3Q =

N 1Fp ∼

0Q =

Another heuristic distance measure widely used is

N

( )21

1 N Ni i

iRMSD

Nα

=

= −∑ r rα

works better for these structures:

However it is not a true “distance”

Real (minimal) distance beads could move would be:

( )21

1 N Ni i

iMRSD

Nα

=

= −∑ r r RMSD≤

Both RMSD, MRSD fail to give the proper distance a partly foldedprotein needs to move if chain non-crossing is important.

e.g.

0MRSD ≈

Can we generalize the notion of distance between 2 points to calculatethe distance between 2 curves? (Plotkin PNAS (2007) in press).

The distance between 2 points is obtained from the variationalminimization of the functional:

( )B

A

D t = ∆⎡ ⎤⎣ ⎦ ∫r rA

B

0t =

t T=

2 2 2B

A

x y z= ∆ + ∆ + ∆∫

2 2 2

0

T

dt x y z= + +∫

( )gµν µνδ=

( )dxxdt

=

Usually this action is written non-parametrically:

2 21B

A

x

x

D dx y z′ ′= + +∫dyydx

′ =

But for the general problem we have no guarantee y,z are simple functions of x.

Minimization

( )* tr( )* *D D t⎡ ⎤= ⎣ ⎦r

2

0

0T

D dtδ δ= =∫ r gives the distance

and , the minimal transformation

Ba 1D object.

A

2

0

0T

D dtδ δ= =∫ rSpecifically gives

0d Ldt

∂⎛ ⎞ =⎜ ⎟∂⎝ ⎠r2L = r

ˆ0d d

dt dt⎛ ⎞

= =⎜ ⎟⎜ ⎟⎝ ⎠

r vr

ˆ veˆ ∴ =v const in direction

Straight-line motion results

s.t satisfies B.C.’s A, B is a solution.( ) ( )0 ˆt v t=v vHowever any ( )tr

( ) ( )00

ˆt

t v t= + ∫r A v

We could have fixed a gauge from the outset, or we can fix it at the end(easier).

We want to generalize this treatment to find the distancebetween any two space curves ( ) ( ), A Bs sr r

First, write the distance functional in the same way as before:

( ) 20 0

[ , ]L T

D s t ds dt= ∫ ∫r r

is the accumulated ‘area’ of the minimal transformation,a 2-D object, with B.C.s

( ) ( ), A Bs sr r

( )* ,s tr (

The distance between 2 space curves

) ( ) ( ) ( )* *,0 , ,A Bs s s T s= =r r r r

It is not a 3-D “soap-film” area:

L

d

0soapA =

ABD Ld=

Is it a 4-D space-time area, as in classical relativistic string theory?

( ) ( )22 20 0

[ , ]L T

Nambu GotoS s t ds dt− ′ ′= − ⋅∫ ∫r r r r r

s∂′ ≡∂rr

( ) 20 0

[ , ]L T

D s t ds dt= ∫ ∫r rour

The “simplest” example:

( )A sr( )B sr

The minimal transformation is not a simple rotation!

xy

x y Ly x L+ == − + 2 2D L=

x y

Our action does not map to the world-sheet area of the classical relativistic string.

static string still accumulatesarea

( ) 20 0

[ , ]L T

D s t ds dt= ∫ ∫r rMinimizing the action:

results trivially in straight line motion- the MRSD.

Inextensible strings must include the constraint:

s∂′ =∂rr( )2 , 1s t′ =r “rubber band”

Minimize:

( ) ( )( )2 20 0

[ , ] ,L T

D s t ds dt s tλ ′= −∫ ∫r r r

( )( )

0L tδδ τ⎡ ⎤⎣ ⎦ =rr

The extremum gives the Euler-Lagrange equation:

( )* ,s trSolve for , the minimal transformation.

ˆˆ λ λ′= +v κ trecast EL eqn:

unit velocity curvaturetangent

whether an extremal transformation is a minimum can be determinedfrom the second variation:

( )20 0

1 02

L T

D ds dtδ δ δ δ δ′ ′= ⋅ ⋅ + ⋅ ⋅ >∫ ∫ r I r r Λ r

( ) ( ) ( ) ( )* *,0 , ,A Bs s s T s= =r r r rB.C.s at t=0,T are given by

String ends are free at s=0,L, so conjugate momenta must vanish:

0,

0ss L

L

=

∂∴ = =

′∂p

r

( )2 2,L s tλ ′= −r rbut for our

ˆ 0s λ= =p t

( ) ( ) 0, , 0t L tλ λ∴ = = (string tension vanishes)

Boundary conditions

ˆˆ λ′=v tBut then EL eqnend pts

ˆˆ λ λ′= +v κ t simplifies toAnd since v̂ is ˆ ⊥ v

ˆˆ 0λ′∴ ⋅ =v tˆ 0⋅ =v tso either -> pure rotation

ˆ 0=v0λ′ = and thereforeor -> straight line motion

Also trivial soln 0=v

For numerical solutions, discretize the chain (method of lines)EL PDE becomes a set of coupled ODEs

Similar to the analysis of B.C.s, the ODEs for particle i have solutions:

-> pure rotation

-> straight line motion

-> trivial soln 0=v

Simplest case:

Weierstrass-Erdmanncorner conditions

Mohazzab, Plotkin

To apply the theory, additional terms may be included in the Lagrangian:

•Curvature constraints – persistence length•Non-crossing - Edwards constraint:

( ) ( ) ( )( )1 2 1 20 0

[ , ] , ,L L

NCV s t ds ds s t s tδ= −∫ ∫r r r

0RMSDD ≈2D =

Applications to proteins.Structures with , MIN MAXD D

Does ?FDk( )Does ?iiN FD p

Research opportunities!in the Plotkin Group

1 PDF1 Grad student

Steven Plotkin - University of British Columbia · 2007. 7. 26. · Size ~ 30 Å We can...

Documents

Transcript of Steven Plotkin - University of British Columbia · 2007. 7. 26. · Size ~ 30 Å We can...

Questioning : where can we go?theory.physics.uci.edu/slides/2014/Zumalacarregui.pdf · Questioning : where can we go? Miguel Zumalac arregui Instut fur Theoretische Physik - University

arXiv:1907.06133v1 [stat.ME] 13 Jul 2019 · 2015]). we only give the earliest reference we can track for each category to highlight the chronicle of methodology development. We will

How low can we go? Getting below β *=3.5m R. Bruce , R.W. Assmann

Lambda-Calculuslcs.ios.ac.cn/~yzhang/fopl/01_lambda.pdf · contained in the pure λ-calculus, but can be represented byλ-terms. This is about encoding, just as we can represent data

Supplementary Material of the Paper: Reciprocal ...€¦ · We report the effects of on training the overall RCF-GAN in Fig. 1. From this ﬁgure, we can ﬁnd that the proposed RCF-GAN

Efficient Spectral Methods for Learning Mixture Modelsqingqinghuang.github.io/files/qq_2016_specMix_slides.pdf8 Contribution / Outline of the talk Can we learn “non-worst-cases”

Multiplexing Switching and Routingliacs.leidenuniv.nl/~wijshoffhag/NETWERKEN2020/... · 2020-03-02 · Routing for Packet Switched Networks Like circuit switching can we differentiate

The Fourier Transform in One and More Dimensions · Now suppose that we know the Fourier transform of the density distribution, which we will call F(u,v). It can be written as (17)

Modelling of the Transformation of Coarse Grained ...378-1 Modelling of the Transformation of Coarse Grained Microstructure of α+β Titanium Alloys Along with FEM Simulation of Hot

To continue…. We can speak of “Organic Chemistry” And “Inorganic Chemistry”

Can we infer compositional similarities of soil and plant samples? Wilfred, Michelle, Geoff C/N ratios δ 13 C FTIR.

HMI Filter Calibration - Stanford Solar Observatories …sun.stanford.edu/~couvidat/HMI/meeting_july06.pdfHMI Filter Calibration Purpose: to make sure we can characterize the transmission

Together We Can!€¦ · Content Strategy για TripAdvisor, Booking.com και εκπαίδεʑση προσʙπικού που διαχειρίζεται τα συγκεκριμένα

Mini-Course 1: SGD Escapes Saddle PointsWhy do we use SGD? Initially because: I Much cheaper to compute using mini-batch I Can still converge to global minimum in convex case I Can

Comparison of coarse-grained (MARTINI) and atomistic ...

What can we learn from η production in proton-proton collisions? Joe Seele MIT and University of Colorado.

Non-Comparison Based Sorting. How fast can we sort? Insertion Sort: O(n 2 ) Merge Sort, Quick Sort (expected), Heap Sort: O(nlgn) Can we sort faster than.

Hysteresis of liquid adsorption in porous media by coarse … · · 2016-07-25Hysteresis of liquid adsorption in porous media by coarse-grained Monte Carlo with direct experimental

228Th breakthrough in Ra samples: what can we know and ...

Aerodynamics & Aeroelasticity - NTUA...Aerodynamics & Aeroelasticity Certification of WT 10 How can we describe real (turbulent) wind? The turbulent wind model We need to describe