the effective fragment molecular orbital method

The Effective Fragment Molecular OrbitalMethod

Casper Steinmann1 Dmitri G. Fedorov2 Jan H. Jensen1

1Department of Chemistry, University of Copenhagen, Denmark

2AIST, Umezono, Tsukuba, Ibaraki, Japan

September 14th, 2011

Outline

1 Motivation

2 The Fragment Molecular Orbital Method

3 The Effective Fragment Potential Method

4 The Effective Fragment Molecular Orbital Method

5 Results

2 / 45

Motivation

Treatment of Very Large Systems• We want quantum mechanics (QM) to do chemistry.

• We want the speed of force-fields (MM) to treat large systems.

Usually done via hybrid QM/MM methods

We propose a fragment based, on-the-fly parameterless polarizableforce-field.

• A merger between FMO and EFP.

• FMO: Faster FMO by the use of classical approximations

• EFP: Flexible EFP’s.

3 / 45

FMO

4 / 45

FMO2 MethodUsing a single Slater determinant to represent |ΨX〉, we obtain

fXφXk = ǫXk φX

k .

Here,fX = hX + V X + gX = hX + gX ,

where

V X =

all∑

C∈X

−ZC

|r1 −RC |+

all∑

K 6∈X

∫

ρK(r′)

|r1 − r′|dr′

6 / 45

FMO2 Methodexpanding our molecular orbitals φ in a basis set

φXk =

∑

µ

CXµkχµ

we obtain, the Fock matrix elements of V X

V Xµν = 〈µ|V X |ν〉 =

all∑

K 6∈X

uKµν +

all∑

K 6∈X

υKµν

which are given as

uKµν = 〈µ|

∑

C∈K

−ZC

|r1 −RC ||ν〉,

andυKµν =

∑

λσ∈K

DKλσ(µν|λσ).

7 / 45

FMO approximationsFMO2 formally scales as O(N2), wants to be O(N). We needdistance-based approximations:

RI,J = mini∈I,j∈J

|ri − rj |

rvdwi + rvdwj

• 1) Approximate ESP (Resppc):

uKµν =

∑

λσ∈K

DKλσ(µν|λσ) →

∑

C∈K

〈µ|QC

|r1 −RC ||ν〉

• 2) Approximate dimer interaction (Resdim):

EIJ ≈ EI+EJ+Tr(

DIuJ)

+Tr(

DJuI)

+∑∑

DIµνD

Jλσ(µν|λσ)

Usually, Resdim and Resppc are equal (2.0)

8 / 45

A couple of pictures to help

9 / 45


10 / 45

Covalent Bonds in FMOIn FMO, bonds are detatched instead of capped.

BDA|-BAA

11 / 45

HOP vs. AFOTwo methods in FMO: hybrid orbital projection (HOP) and adaptivefrozen orbitals (AFO)

Both modifies the Fock-operator

fX = hX + gX +∑

k

Bk|φ〉〈φ|

• HOP: External model system generated and used

• AFO: Generated on the fly automatically:

12 / 45

AFO

13 / 45

AFO

14 / 45

AFO

We shall return to AFO later ...

15 / 45

EFP

16 / 45

EFPEFP is an approximation to the RHF interaction energy, Eint

Eint = ERHF −∑

I

E0I ≈ EEFP

The EFP energy

EEFP =∑

I>J

∆EEFPIJ + Eind

total

∆EEFPIJ = Ees

IJ + (ExrIJ + Ect

IJ )

EesIJ using distributed multipoles.

Eindtotal using induced dipoles based on distributed polarizabilities.

17 / 45


Eint = ERHF −∑

I

E0I ≈ EEFP

The EFP energy

EEFP =∑

I>J

∆EEFPIJ + Eind

total

∆EEFPIJ = Ees

IJ + (ExrIJ + Ect

IJ )



• The internal geometry is fixed.

17 / 45


Eint = ERHF −∑

I

E0I ≈ EEFP

The EFP energy

EEFP =∑

I>J

∆EEFPIJ + Eind

total

∆EEFPIJ = Ees

IJ + (ExrIJ + Ect

IJ )



• The internal geometry is fixed.

• You need to construct the EFP’s before you can use them

17 / 45

EFMO

18 / 45

What is the EFMO method?

You start with FMO ...• Remove the ESP

Now you have N gas phase calculations.

Then you mix in some EFP• Use EFP to describe many-body interactions

19 / 45

EFMO RHF Energy

The two-body FMO2 method on a system of N fragments

EEFMO =∑

I

E0I −→ do MAKEFP

+

Resdim≥RI,J∑

IJ

(

E0IJ − E0

I − E0J − Eind

IJ

)

+

Resdim<RI,J∑

IJ

EesIJ

+ Eindtot ,

• QM: Gas phase RHF (and MP2) calculations

• MM: Interaction energies by Effective Fragment Potentials

* obtain E0I , q, µ and Ω and α from RHF via a fake MAKEFP run.

20 / 45


21 / 45


22 / 45

Correlation in EFMOCorrelation as in FMO

E = EEFMO + ECOR.

Here ECOR is

ECOR =

N∑

I

ECORI +

RI,J<Rcor∑

IJ

(

ECORIJ − ECOR

I − ECORJ

)

.

23 / 45

EFMO vs. EFP vs. FMO

EFMO vs. EFP• EFMO energy includes internal energy, i.e. total energy can be

obtained.

• Short range interactions are computed using QM.

we assume Eex and Ect are negligible when RI,J > Resdim.

EFMO vs. FMO• No ESP, i.e. one SCC iteration.

• Many-body interactions are entirely classical.

General EFMO considerations• Calculation of classical parameters on-the-fly.

• Every EFMO calculation requires re-evaluation of EFPparameters.

24 / 45

Rigorous Analysis of Small Water Clusters• Water trimer: Estimate lower bound to energy error

• Water pentamer: Estimate upper bound to energy error

25 / 45

Rigorous Analysis of Small Water TrimerKitaura-Morokuma energy analysis

∆3Eint 6-31G(d) 6-31++G(d)-1.9 kcal/mol -1.45 kcal/mol

Many-body terms ∆3Eind +∆3Exr +∆3Ect +∆3EMIX

26 / 45




∆3Eind 6-31G(d) 6-31++G(d)-0.9 kcal/mol -1.67 kcal/mol

EFMO error 1.0 kcal/mol -0.22 kcal/mol

26 / 45




∆3Eind 6-31G(d) 6-31++G(d)-0.9 kcal/mol -1.67 kcal/mol

EFMO error 1.0 kcal/mol -0.22 kcal/mol

6-31G(d):• Lower bound to the error in energy is ≈ 0.33 kcal/mol/HB

6-31++G(d):• Lower bound to the error in energy is ≈ -0.07 kcal/mol/HB

26 / 45

Rigorous Analysis of Small Water PentamerKitaura-Morokuma energy analysis

∆nEint 6-31G(d) 6-31++G(d)-6.10 kcal/mol -5.14 kcal/mol

Many-body terms ∆nEind +∆nEEX +∆nECT +∆nEMIX

27 / 45




∆nEind 6-31G(d) 6-31++G(d)-1.97 kcal/mol -3.68 kcal/mol

EFMO error -4.13 kcal/mol -1.46 kcal/mol

27 / 45




∆nEind 6-31G(d) 6-31++G(d)-1.97 kcal/mol -3.68 kcal/mol

EFMO error -4.13 kcal/mol -1.46 kcal/mol

6-31G(d):• Upper bound to the error in energy is ≈ 1.03 kcal/mol/HB

6-31++G(d):• Upper bound to the error in energy is ≈ 0.37 kcal/mol/HB

27 / 45

Rigorous analysis of Small Water Clusters6-31G(d):

• Lower bound to the error in energy is ≈ 0.33 kcal/mol/HB• Upper bound to the error in energy is ≈ 1.03 kcal/mol/HB

6-31++G(d):• Lower bound to the error in energy is ≈ -0.07 kcal/mol/HB• Upper bound to the error in energy is ≈ 0.37 kcal/mol/HB

28 / 45

20 water molecule clusters

6-31G(d) ∆E [kcal/mol/HB] [kcal/mol] Resdim = 2.0EFMO 0.53 15.8FMO2 -0.39 -11.6

6-31++G(d)EFMO -0.08 -2.5FMO2 -0.76 -21.8

29 / 45

EFMO Gradient

∂EEFMO

∂xI

=∑

I

∂

∂xI

E0I

+

RI,J≤Rcut∑

I>J

(

∂

∂xI

∆E0IJ −

∂

∂xI

EindIJ

)

+

RI,J>Rcut∑

I>J

∂

∂xI

EesIJ +

∂

∂xI

Eindtotal + TM

xI

TMI is the contribution to the gradient on atom I due to torques

arising from nearby atoms.

30 / 45

EFMO Gradient• For water clusters, ∇XI

EEFMOa − EEFMO

n ≈ 10−4 Hartree / Bohr

31 / 45

EFMO Gradient• For water clusters, ∇XI

EEFMOa − EEFMO

n ≈ 10−4 Hartree / Bohr

"Those are not good gradients."

31 / 45

Wait until you see the covalently bonded systems then.

32 / 45

EFMO for Covalent Systems

Covalent systems pose a problem in EFMO because ...• ... Inherent close (and even overlapping) electrostatics.

• ... Inherent close position of polarizable points and nearbyelectrostatics.

33 / 45

Back to the drawing board

C1 C

H

H

H

H

C5

H

H

C C

H

HHH

HH

+

a) b) c)

34 / 45

Back to the drawing board• 1) The frozen orbital (during the SCF) is allowed to mix during

Foster-Boys localization.

35 / 45


Foster-Boys localization.• 1) Crap. Errors ≈ 50 kcal/mol. FMO2: ≈ 5 kcal/mol

35 / 45



• 2) The localized orbital is kept frozen, i.e. as it is during the SCF.

35 / 45



• 2) The localized orbital is kept frozen, i.e. as it is during the SCF.• 2) Works, Errors grows with system size and are on-par with

FMO2. Requires much more screening.

35 / 45

Energies for Conformers of Polypeptides

k(R, α, β) = 1− exp(

−√

αβ|R|2)(

1 +√

αβ|R|2)

AM,X =1

N

N∑

I

∣

∣EMI − EX

I

∣

∣ .

0.1 0.2 0.3 0.4 0.5 0.6

2

4

6

8

10

12

AEFMO,X

[kca

l/mol

]

Peptide MAD for EFMO/2 vs. Screening Parameter

P1(RHF)P1(MP2)P2(RHF)P2(MP2)P3(RHF)P3(MP2)

36 / 45

Energies for Conformers of Polypeptides

k(R, α, β) = 1− exp(

−√

αβ|R|2)(

1 +√

αβ|R|2)

AM,X =1

N

N∑

I

∣

∣EMI − EX

I

∣

∣ .

P1 P2 P30

1

2

3

4

5

6

7

8

AM,X

[kca

l/mol

]

Peptide MAD for 2 Residues per Fragment

FMO2-RHF/HOPFMO2-MP2/HOPFMO2-RHF/AFOFMO2-MP2/AFOEFMO-RHFEFMO-MP2

37 / 45

Energies for Proteins

Table: Energy Error of EFMO and FMO2/AFO compared to ab initiocalculations on proteins using two residues per fragment.

Nres EFMO FMO2/AFO

Rcut = 2.0 Rcut = 2.0

RHF MP2 RHF MP2

1L2Y 20 3.2 -4.3 1.7 6.41UAO 10 1.8 1.5 0.4 1.4

• Timings: 5 times faster than FMO2.

• Requires lots of screening.

38 / 45

Back to the drawing board• The backbone is the main problem.

• Errors around 10−3(10−4) Hartree / Bohr

• Timings: 1.5 times faster than FMO2-MP2

39 / 45

Timings

5 10 15 20 25 30 35 40Total CPU count

5

10

15

20

25

30

35

40

Gain

-Fac

tor i

n CP

U w

allti

me

Gain-Factor in CPU walltime for increasing CPU count

EFMO, FMO2 40 / 45

Summary• Successful merger of the FMO and EFP method

• For molecular clusters, it performer pretty good.

• For systems with covalent bonds, work is needed.

• Faster than FMO2, roughly same accuracy.

41 / 45

Outlook• EFMO-PCM (meeting with Hui Li tomorrow)

• EFMO QM/MM (Based on FMO/FD)

• More EFP, less QM (Spencer Pruitt)

42 / 45

AcknowledgementsJan H. JensenDmitri G. Fedorov

Bad Boys of Quantum Chemistry:Anders ChristensenMikael W. Ibsen (FragIt)Luca De Vico (FragIt)Kasper Thofte

$$ - Insilico Rational Engineering of Novel Enzymes (IRENE)

43 / 45

Thank you for your attention

proteinsandwavefunctions.blogspot.com

44 / 45

Gradient Contribution

JK

I

dvdu

dw

IuvII

II

r

r1

2

dwmA = −dum

A − dvmA

dumA = wA (τmA · vA)+uA×wA (τmA ·wA)

dvmA = −wA (τmA · uA)+vA×wA (τmA ·wA)

45 / 45

the effective fragment molecular orbital method

Education

e mix3 e ind

e mix26

n e ct

n e ex

e inte int

efp ind eij

kcalmolmanybody terms

system of n fragments