milestoning - web.ma.utexas.edu · on a pc it takes months to compute 100 ns (10-7s). 6 the problem...
TRANSCRIPT
1
Milestoning
Ron Elber and Anthony WestDepartment of Chemistry and Biochemistry and
Institute of Computational Sciences and Engineering (ICES)University of Texas at Austin
$ NIH
2
Motivation
• Understanding Myosin II:– A protein that converts biochemical energy
(ATP) to mechanical energy in the muscles
3
Myosin Stroke Cycle (Lymn-Taylor)
S. Mesentean et al., JMB, 367, 591–602 (2007)
4
Movie of the reaction path for the recovery stroke in myosin
5
Simulation background• Computer simulations use classical
mechanics to simulate protein dynamics
Md2X
dt2= !"U X( )
X t + #t( ) = X t( ) +V t( ) $ #t !#t 2
2M
!1 $"U X t( )( )
V (t + #t) =V t( )!#t
2M
!1 $ "U X t( )( ) +"U X t + #t( )( )%& '(
Δt about 10-15s (a femtosecond) to maintain stability of thealgorithm.We are interested in time scale of 10-3s (a millisecond). On a PC ittakes months to compute 100 ns (10-7s).
6
The problem
• Twelve orders of magnitude differencebetween the basic time steps (10-15s) andthe biological time scale (10-3s).
• How to bridge the time scale gap?– Coarsening space and time while keeping
molecular details
7
The concept of Milestones
R P
q•Microscopic molecular motions tend to be diffusive at long time scales
•Following in detail individual trajectories is computationally expensive
•Can we compute the trajectories in pieces? The pieces will give us acoarse grained model.••Is it correct?Is it correct? Is it computationally efficient?Is it computationally efficient?
The interfaces between which we compute the pieces of thetrajectories are called MilestonesMilestones
8
Is computing trajectory piecesmore efficient than computing
it as a whole?
9
Chopping trajectories iscomputationally efficient (I)
RP
L
Parallelization Pieces of trajectories are trivial to parallelize.Speed up proportional to the number of processorsSpeed up proportional to the number of processors
10
Chopping trajectories iscomputationally efficient (II)
RP
L
Diffusive processes The time t to diffuse a path of length L isproportional to L2 . If we divide the path to N segments the timebecomes (L/M)2. There are M Milestones and therefore the timerequired is M*(L/M)2=L2/M. Speed up factor of M number ofMilestones.
11
Chopping trajectories is computationallyefficient (III)
RP
Activated process If one of N1 trajectories that starts in Rreaches q1, and one of N2 trajectories that starts at q1 reaches q2the total number of trajectories that we need to sample q2starting from R is N1*N2. Using Milestoning we only need N1+N2Speed up exponential in the number of Milestones - MSpeed up exponential in the number of Milestones - M
L
U(q)
q1 q2
12
Building a coarse grained model (which is equation-free):What do we keep from the short chopped trajectories?
R P
ii !1 i +1
1. Initialize trajectories at the Milestones from a stationary time-independent distribution assumed known (e.g. canonical).
2. Compute trajectories between any pair of Milestones (i,j) witha shared volume, estimate the first passage time distribution
Kij!( )
13
The local first passage time distribution
Kij !( )" The probability density that a trajectory that starts
at Milestone i will terminate exactly after time ! at
Milestone j
Kij !( ) #d!0
$
% = pij " The probability that a trajectory that was
initiated at Milestone i will terminate at Milestone j
pijj
& = 1 -- A normalization condition on pij (all traj terminate)
This is all the microscopic information needed. No mechanical modelis required.
14
Example for a typical Kij!( )
15
What do we want to compute from the coarsemodel?
R P
Pi!1
t( ) Pit( ) P
i+1t( )
Pit( ) microscopically it is the probability of being somewhere
between Milestones i-1 and i+1 at time t such that the lastMilestone passed by the trajectory is Milestone i.
Provides a blurred (coarse) spatial description withoutreducing the number of degrees of freedom
PNt( )
16
• Define : prob of being at i at time t
• Define : prob of transition to i at t
• Define : conditional probability of a transition fromi to j after incubation time τ.
•Then hopping dynamics are defined by:
The QK picture
Qi(t) = P 0( )
i! t " 0+( ) + Q
j(t ')K
j ,i(t " t ')#
$%&
0
t
' dt 'j
(
Pi(t) = Q
i(t ') 1" K
i, j() )
j
(#
$*
%
&+d)
0
t" t '
'#
$**
%
&++0
t
' dt '
P
i(t)
Q
i(t)
Ki, j
!( )
17
With the matrixdetermined, compute long time kinetics
Kij!( )
Qi(t) = P
i0( )! t " 0+( ) + Q
j(t ')K
ji(t " t ')#
$%&
0
t
' dt '
Pi(t) = Q
i(t ') 1" K
ij(( )
j
)#
$*
%
&+d(
0
t" t '
'#
$**
%
&++0
t
' dt '
• by direct integration• by Laplace transform (Shalloway)• by trajectory statistics (Vanden Eijnden)
18
Computing the average first passage time fromMilestone 1 to N
! = !12
1( )+ ...+ ! ij
n( )+ ...+ ! kN
L( )"# $%l=1,...,L
& ' p12 ' ' ' pij ' ' ' pkN( )
! ijn( ) is a random variable sampled from Kij !( )
pij = Kij !( )d!0
(
) is the transition probability from i to j
pij = 1j
& A few points:The limit is consideredThe N state is absorbing τNk=0 pNN=1
To average over τ…
L!"
19
Compute <τ>
Define a random matrix T( )ij! pij" ij
the matrix P( )ij! pij of size N # N
the vector "̂L( )! "1
L( )," 2
L( ),...," NL( )( )
T
with elements " iL( )
that are the overall first passage times using L steps
to go from i to N
Then
"̂L( )= P
L$lTP
l$11l=1
L
% = PL$lT1
l=1
L
%
where 1= 1,1,...,1( )T
and P1= 1
1 2 3 4 5 6 Nij
20
The last tricks to compute <τ>For L!" the trajectory is absorbed at Milestone N .
Since the time at N does not count (#NN = 0) $ #̂ L+1= #̂
L
#̂L( )= P
L% lT1
l=1
L
&
#̂L+1( )
= PL+1% l
T1l=1
L+1
& = P PL% lT1
l=1
L
& + T1 = P#̂L( )+ T1
I % P( )#̂ = T1 (remove eigenvector 1 from the set),
T1 = 0 (I - P)1 = 0, define, I ,P,T of size N %1( ) ' N %1( )).
T is a random matrix and the average first passage time
is obtained by avergaing over elements of T . T( )ij
= pij # ij
Only the first moments of # ij are required to compute #̂ .
#̂ = I % P( )%1
T 1
21
Is Milestoning correct (or whatare the assumptions)?
R P
ii !1 i +1
22
Assumption
Let Sj be the hypersurface of Milestone j.
Let Xj be a coordinate vector Xj !R3Nand Xj !Sj .
" Xj( ) is the distribution at Sj initiating the short trajectories.
#ij Xi( ) is the distribution obtained from first passage traj on Sj
if initiated at Si according to " Xi( )
Assumption : #ij Xi( ) = " Xj( )
Sj
Si
Xj
Xi
Implies:• Loss of memory• committers
23
Committers are special surfaces with equal probability ofreaching for the first time the product and the not the
reactants
p1 p2 p3
R
P
Committer surfaces can be calculated exactly (solvingpartial differential equations) in 2-3 dimensions forBrownian dynamics and approximated at higherdimensions and other types of equations of motion.
24
Or a tunnel picture
Relaxation in planes faster than transition between planes.General but heuristic.
25
Toy model: 1D box simulation
• Microscopic dynamics are Brownian• Simulations run at various temperatures
and for 4, 8, and 16 milestones
!dX
dt= "#U + R
R = 0 R t( )R t '( ) = C$ t " t '( )
26
1D reaction curves (5000 trajs/MLST)
27
2D simulation
q
28
Memory loss demonstration
0.15!"!
2
0
6.34, ( )K d! ! ! !"# $
% &' ()!
"
29
Alanine Dipeptide
JCP, 126, 145104 (2007)
30
Reflecting boundary conditions
Absorbing boundary condition
Preparing initial conditions by samplingq !"
31
Torsion velocity auto-correlation indicates whenmilestoning assumption is being violated
~ 400fsr
!
32
Average Incubation Times vs. Velocity Relaxation Time
r! !" >
01
1( )i
M
s
i
K dM
! ! ! !"
=
# $%
1090253581713051137319129375873587431144
(fs)M !
33
Rate Results
34
Reaction curves
35
• Milestoning divides RC into fragments whose kineticscan be computed independently then “glued” together
• Provides factor of improvement in computationalefficiency on serial machines, plus exp bootstrapping
• Uses LFPTDs from microscopic dynamics:• System distribution given by simple integral
equations that can be easily solved numerically• Correct kinetics for solvated alanine dipeptide (x 9
speedup)• Predicts microsecond Scapharca rate with ~10 ns total
serial time• Sub-millisecond rate for myosin recovery stroke with
total run time (serial) - speedup : more than 10,000
Summary
M
( )sP t
( )sK !
±
200sample ! 241mlst ! 0.01ns + 0.1ns ! 241 = 501.6ns
36
Milestoning papers• Ron Elber, "A milestoning study of the kinetics of an allosteric
transition: Atomically detailed simulations of deoxy Scapharcahemoglobin", Biophysical J. ,2007 92: L85-L87
• Anthony M.A. West, Ron Elber, and David Shalloway, "Extendingmolecular dynamics timescales with milestoning: Example ofcomplex kinetics in a solvated peptide", J. Chem. Phys.126,145104(2007)
• Anton K. Faradjian and Ron Elber, "Computing time scales fromreaction coordinates by milestoning", J. Chem. Phys. 120:10880-10889(2004)
• Code (moil + zmoil) available fromhttps://wiki.ices.utexas.edu/clsb/wiki