milestoning - web.ma.utexas.edu · on a pc it takes months to compute 100 ns (10-7s). 6 the problem...

36
1 Milestoning Ron Elber and Anthony West Department of Chemistry and Biochemistry and Institute of Computational Sciences and Engineering (ICES) University of Texas at Austin $ NIH

Upload: others

Post on 18-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

1

Milestoning

Ron Elber and Anthony WestDepartment of Chemistry and Biochemistry and

Institute of Computational Sciences and Engineering (ICES)University of Texas at Austin

$ NIH

Page 2: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

2

Motivation

• Understanding Myosin II:– A protein that converts biochemical energy

(ATP) to mechanical energy in the muscles

Page 3: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

3

Myosin Stroke Cycle (Lymn-Taylor)

S. Mesentean et al., JMB, 367, 591–602 (2007)

Page 4: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

4

Movie of the reaction path for the recovery stroke in myosin

Page 5: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

5

Simulation background• Computer simulations use classical

mechanics to simulate protein dynamics

Md2X

dt2= !"U X( )

X t + #t( ) = X t( ) +V t( ) $ #t !#t 2

2M

!1 $"U X t( )( )

V (t + #t) =V t( )!#t

2M

!1 $ "U X t( )( ) +"U X t + #t( )( )%& '(

Δt about 10-15s (a femtosecond) to maintain stability of thealgorithm.We are interested in time scale of 10-3s (a millisecond). On a PC ittakes months to compute 100 ns (10-7s).

Page 6: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

6

The problem

• Twelve orders of magnitude differencebetween the basic time steps (10-15s) andthe biological time scale (10-3s).

• How to bridge the time scale gap?– Coarsening space and time while keeping

molecular details

Page 7: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

7

The concept of Milestones

R P

q•Microscopic molecular motions tend to be diffusive at long time scales

•Following in detail individual trajectories is computationally expensive

•Can we compute the trajectories in pieces? The pieces will give us acoarse grained model.••Is it correct?Is it correct? Is it computationally efficient?Is it computationally efficient?

The interfaces between which we compute the pieces of thetrajectories are called MilestonesMilestones

Page 8: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

8

Is computing trajectory piecesmore efficient than computing

it as a whole?

Page 9: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

9

Chopping trajectories iscomputationally efficient (I)

RP

L

Parallelization Pieces of trajectories are trivial to parallelize.Speed up proportional to the number of processorsSpeed up proportional to the number of processors

Page 10: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

10

Chopping trajectories iscomputationally efficient (II)

RP

L

Diffusive processes The time t to diffuse a path of length L isproportional to L2 . If we divide the path to N segments the timebecomes (L/M)2. There are M Milestones and therefore the timerequired is M*(L/M)2=L2/M. Speed up factor of M number ofMilestones.

Page 11: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

11

Chopping trajectories is computationallyefficient (III)

RP

Activated process If one of N1 trajectories that starts in Rreaches q1, and one of N2 trajectories that starts at q1 reaches q2the total number of trajectories that we need to sample q2starting from R is N1*N2. Using Milestoning we only need N1+N2Speed up exponential in the number of Milestones - MSpeed up exponential in the number of Milestones - M

L

U(q)

q1 q2

Page 12: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

12

Building a coarse grained model (which is equation-free):What do we keep from the short chopped trajectories?

R P

ii !1 i +1

1. Initialize trajectories at the Milestones from a stationary time-independent distribution assumed known (e.g. canonical).

2. Compute trajectories between any pair of Milestones (i,j) witha shared volume, estimate the first passage time distribution

Kij!( )

Page 13: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

13

The local first passage time distribution

Kij !( )" The probability density that a trajectory that starts

at Milestone i will terminate exactly after time ! at

Milestone j

Kij !( ) #d!0

$

% = pij " The probability that a trajectory that was

initiated at Milestone i will terminate at Milestone j

pijj

& = 1 -- A normalization condition on pij (all traj terminate)

This is all the microscopic information needed. No mechanical modelis required.

Page 14: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

14

Example for a typical Kij!( )

Page 15: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

15

What do we want to compute from the coarsemodel?

R P

Pi!1

t( ) Pit( ) P

i+1t( )

Pit( ) microscopically it is the probability of being somewhere

between Milestones i-1 and i+1 at time t such that the lastMilestone passed by the trajectory is Milestone i.

Provides a blurred (coarse) spatial description withoutreducing the number of degrees of freedom

PNt( )

Page 16: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

16

• Define : prob of being at i at time t

• Define : prob of transition to i at t

• Define : conditional probability of a transition fromi to j after incubation time τ.

•Then hopping dynamics are defined by:

The QK picture

Qi(t) = P 0( )

i! t " 0+( ) + Q

j(t ')K

j ,i(t " t ')#

$%&

0

t

' dt 'j

(

Pi(t) = Q

i(t ') 1" K

i, j() )

j

(#

$*

%

&+d)

0

t" t '

'#

$**

%

&++0

t

' dt '

P

i(t)

Q

i(t)

Ki, j

!( )

Page 17: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

17

With the matrixdetermined, compute long time kinetics

Kij!( )

Qi(t) = P

i0( )! t " 0+( ) + Q

j(t ')K

ji(t " t ')#

$%&

0

t

' dt '

Pi(t) = Q

i(t ') 1" K

ij(( )

j

)#

$*

%

&+d(

0

t" t '

'#

$**

%

&++0

t

' dt '

• by direct integration• by Laplace transform (Shalloway)• by trajectory statistics (Vanden Eijnden)

Page 18: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

18

Computing the average first passage time fromMilestone 1 to N

! = !12

1( )+ ...+ ! ij

n( )+ ...+ ! kN

L( )"# $%l=1,...,L

& ' p12 ' ' ' pij ' ' ' pkN( )

! ijn( ) is a random variable sampled from Kij !( )

pij = Kij !( )d!0

(

) is the transition probability from i to j

pij = 1j

& A few points:The limit is consideredThe N state is absorbing τNk=0 pNN=1

To average over τ…

L!"

Page 19: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

19

Compute <τ>

Define a random matrix T( )ij! pij" ij

the matrix P( )ij! pij of size N # N

the vector "̂L( )! "1

L( )," 2

L( ),...," NL( )( )

T

with elements " iL( )

that are the overall first passage times using L steps

to go from i to N

Then

"̂L( )= P

L$lTP

l$11l=1

L

% = PL$lT1

l=1

L

%

where 1= 1,1,...,1( )T

and P1= 1

1 2 3 4 5 6 Nij

Page 20: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

20

The last tricks to compute <τ>For L!" the trajectory is absorbed at Milestone N .

Since the time at N does not count (#NN = 0) $ #̂ L+1= #̂

L

#̂L( )= P

L% lT1

l=1

L

&

#̂L+1( )

= PL+1% l

T1l=1

L+1

& = P PL% lT1

l=1

L

& + T1 = P#̂L( )+ T1

I % P( )#̂ = T1 (remove eigenvector 1 from the set),

T1 = 0 (I - P)1 = 0, define, I ,P,T of size N %1( ) ' N %1( )).

T is a random matrix and the average first passage time

is obtained by avergaing over elements of T . T( )ij

= pij # ij

Only the first moments of # ij are required to compute #̂ .

#̂ = I % P( )%1

T 1

Page 21: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

21

Is Milestoning correct (or whatare the assumptions)?

R P

ii !1 i +1

Page 22: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

22

Assumption

Let Sj be the hypersurface of Milestone j.

Let Xj be a coordinate vector Xj !R3Nand Xj !Sj .

" Xj( ) is the distribution at Sj initiating the short trajectories.

#ij Xi( ) is the distribution obtained from first passage traj on Sj

if initiated at Si according to " Xi( )

Assumption : #ij Xi( ) = " Xj( )

Sj

Si

Xj

Xi

Implies:• Loss of memory• committers

Page 23: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

23

Committers are special surfaces with equal probability ofreaching for the first time the product and the not the

reactants

p1 p2 p3

R

P

Committer surfaces can be calculated exactly (solvingpartial differential equations) in 2-3 dimensions forBrownian dynamics and approximated at higherdimensions and other types of equations of motion.

Page 24: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

24

Or a tunnel picture

Relaxation in planes faster than transition between planes.General but heuristic.

Page 25: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

25

Toy model: 1D box simulation

• Microscopic dynamics are Brownian• Simulations run at various temperatures

and for 4, 8, and 16 milestones

!dX

dt= "#U + R

R = 0 R t( )R t '( ) = C$ t " t '( )

Page 26: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

26

1D reaction curves (5000 trajs/MLST)

Page 27: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

27

2D simulation

q

Page 28: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

28

Memory loss demonstration

0.15!"!

2

0

6.34, ( )K d! ! ! !"# $

% &' ()!

"

Page 29: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

29

Alanine Dipeptide

JCP, 126, 145104 (2007)

Page 30: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

30

Reflecting boundary conditions

Absorbing boundary condition

Preparing initial conditions by samplingq !"

Page 31: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

31

Torsion velocity auto-correlation indicates whenmilestoning assumption is being violated

~ 400fsr

!

Page 32: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

32

Average Incubation Times vs. Velocity Relaxation Time

r! !" >

01

1( )i

M

s

i

K dM

! ! ! !"

=

# $%

1090253581713051137319129375873587431144

(fs)M !

Page 33: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

33

Rate Results

Page 34: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

34

Reaction curves

Page 35: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

35

• Milestoning divides RC into fragments whose kineticscan be computed independently then “glued” together

• Provides factor of improvement in computationalefficiency on serial machines, plus exp bootstrapping

• Uses LFPTDs from microscopic dynamics:• System distribution given by simple integral

equations that can be easily solved numerically• Correct kinetics for solvated alanine dipeptide (x 9

speedup)• Predicts microsecond Scapharca rate with ~10 ns total

serial time• Sub-millisecond rate for myosin recovery stroke with

total run time (serial) - speedup : more than 10,000

Summary

M

( )sP t

( )sK !

±

200sample ! 241mlst ! 0.01ns + 0.1ns ! 241 = 501.6ns

Page 36: Milestoning - web.ma.utexas.edu · On a PC it takes months to compute 100 ns (10-7s). 6 The problem •Twelve orders of magnitude difference between the basic time steps (10-15s)

36

Milestoning papers• Ron Elber, "A milestoning study of the kinetics of an allosteric

transition: Atomically detailed simulations of deoxy Scapharcahemoglobin", Biophysical J. ,2007 92: L85-L87

• Anthony M.A. West, Ron Elber, and David Shalloway, "Extendingmolecular dynamics timescales with milestoning: Example ofcomplex kinetics in a solvated peptide", J. Chem. Phys.126,145104(2007)

• Anton K. Faradjian and Ron Elber, "Computing time scales fromreaction coordinates by milestoning", J. Chem. Phys. 120:10880-10889(2004)

• Code (moil + zmoil) available fromhttps://wiki.ices.utexas.edu/clsb/wiki