loschmidt.chemi.muni.cz · web viewexploring the challenges of computational enzyme design by...

26
Exploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin Kolev, 1 Jiri Damborsky, 2,3 Zbynek Prokop, 2,3 Arieh Warshel 1 1 Department of Chemistry, University of Southern California, 3620 McClintock Avenue, Los Angeles, California 90089, United States. 2 Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Kotlarska 2, 625 00, Brno, Czech Republic 3 International Clinical Research Center St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic Abstract Rational enzyme design presents a major challenge that has not been overcome by computational approaches. One of the key challenges is the difficulty in assessing the magnitude of the maximum possible catalytic activity. In an attempt to overcome this challenge, we introduce a strategy that takes an active enzyme (assuming that its activity is close to the maximum possible activity), design mutations that reduce the catalytic activity and then try to restore that catalysis by mutating other residues. Here we take as a test case the enzyme haloalkane dehalogenase (DhlA), with a 1,2-dichloroethane substrate. We start by demonstrating our ability to reproduce the results of single mutations. Next, we design mutations that reduce the enzyme activity and finally design double mutations that are aimed at restoring the activity. Using the computational predictions as a guide, we conduct an experimental study that confirms our 1

Upload: others

Post on 20-Feb-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

Exploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a

Dehalogenase

Garima Jindal,1 Katerina Slanska,2 Veselin Kolev,1 Jiri Damborsky,2,3 Zbynek Prokop,2,3 Arieh

Warshel1

1Department of Chemistry, University of Southern California, 3620 McClintock Avenue, Los Angeles,

California 90089, United States.2Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic

Compounds in the Environment RECETOX, Masaryk University, Kotlarska 2, 625 00, Brno, Czech

Republic3International Clinical Research Center St. Anne’s University Hospital, Pekarska 53, 656 91 Brno,

Czech Republic

AbstractRational enzyme design presents a major challenge that has not been overcome by computational

approaches. One of the key challenges is the difficulty in assessing the magnitude of the maximum

possible catalytic activity. In an attempt to overcome this challenge, we introduce a strategy that takes

an active enzyme (assuming that its activity is close to the maximum possible activity), design

mutations that reduce the catalytic activity and then try to restore that catalysis by mutating other

residues. Here we take as a test case the enzyme haloalkane dehalogenase (DhlA), with a 1,2-

dichloroethane substrate. We start by demonstrating our ability to reproduce the results of single

mutations. Next, we design mutations that reduce the enzyme activity and finally design double

mutations that are aimed at restoring the activity. Using the computational predictions as a guide, we

conduct an experimental study that confirms our prediction in one case and leads to inconclusive

results in another case with 1,2-dichloroethane as substrate. Interestingly, one of our predicted double

mutants catalyzes dehalogenation of 1,2-dibromoethane more efficiently than the wild type enzyme.

1

Page 2: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

Significance

The goal of rational computer-aided enzyme design is hampered by the lack of knowledge of the

maximum possible rate enhancement. We address this problem by considering the enzyme DhlA,

which is naturally adapted for the degradation of dihalogenated ethanes. Using empirical valence

bond calculations, we determine the effect of destroying and then restoring the catalytic

efficiency of different mutants. One of our predicted cycle is confirmed experimentally, while the

other attempt remains inconclusive. We believe that the proposed strategy provides a very

powerful way of validating and refining approaches for computational enzyme design.

1. IntroductionThe need to progress with enzyme design is a major practical and fundamental challenge.(1, 2)

Unfortunately, despite an interesting progress(3-6) the main advances have been made by directed

evolution and not by computational design.(7, 8) Furthermore, at present there are major difficulties in

using directed evolution to obtain the catalytic effects that approach those of naturally evolved

enzymes.

Apparently, reasonable attempts to use charge-charge interactions with a high dielectric, to predict the

effect of distanced ionized groups on the reactivity of the substrate (e.g.(1)), are not likely to lead to a

large catalytic effect. Thus, it seems that one must focus on groups at a closed and medium distance to

the substrate (although directed evolution appears to also use distanced groups). In the case of groups

in the proximity of the substrate, it seems that approximations such as the linear response

approximation (LRA) are not likely to give sufficiently reliable results, and one should probably move

to the more expensive empirical valence bond (EVB) calculations. However, even with the EVB

method, it is not clear what does it take to make a reliable prediction. For example, we recently

encountered (9) a major stumbling block in the study of the directed evolution of Kemp eliminases. We

found that the ability to evaluate the catalytic power of each step in the evolution process is not

sufficient for constructing a good catalyst. We have found out that apart from the knowledge of the key

catalytic residues in the active site, we also need to figure out what are the other residues that will lead

to the correct preorganization of the catalytic residues.

In view of the difficulties of obtaining reliable prediction of the sequence needed for optimal catalysis,

we explore here a part of the problem. We try here to computationally mutate residues that are known

to contribute to catalysis and then attempt to find mutations that restore the activity. Such a strategy is

useful since we know the maximum possible catalysis; probably close to that of the wild type (wt)

2

Page 3: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

enzyme, in case of a single free energy mechanism. Thus, we should be able to assess the effectiveness

of some crucial steps in a refined computational design approach. As a model system, we take the

enzyme haloalkane dehalogenase DhlA, which catalyzes the conversion of toxic haloalkanes to

alcohols.(10) DhlA from bacterium Xanthobacter autotrophicus GJ10 converts 1,2-dichloroethane

(DCE) to chloroethanol via a series of steps that involve an SN2 reaction, hydrolysis, isomerization and

a Cl- ion departure.(11) The SN2 step involves the attack of the nucleophilic D124 on DCE, the

formation of an ester intermediate, which subsequently undergoes hydrolysis and leads to the formation

of an alcohol (Fig. 1). The final step involves the departure of Cl- ion from the active site which is

believed to be preceded by an isomerization.

The catalytic triad, which is commonly found in α/β-hydrolases, is composed of D124, H289 and

D260. Other residues, that have been proven important using mutation studies, include W125, W175,

F172, and V226 as shown in Fig 2(a). It has been found using both experimental and computational

studies that W125 and W175 help in stabilizing the departing Cl- ion by forming H-bonds, while F172

has been shown to interact with the DCE substrate. Val226 does not interact with the substrate directly,

though it forms van der Waals interactions with Trp125 and Phe172. The mutants V226A(12) and

F172W(13) are interesting as they lead to a decrease in the rate of the SN2 step, but to an increase in the

rate of the Cl- ion release. The hydrolysis step becomes the rate limiting step in the V226A, F172W and

W175Y mutants. Apart from these, other residues that line the active site cavity include; E56, F128,

F164, F222, L262 and L263. The backbone NH group of W125 and E56 form H-bonds with the

oxygen of the carboxylate of D124. These two amide groups form an oxyanion hole, which is similar to

that in other hydrolases.

The current design study used the x-ray structure of DhlA (pdb code: 2DHC(11)) as a starting point for

the EVB simulations. We started by reproducing the observed effects of single mutations and then

explored the effort needed for computer aided restoration of the wild type activity after computational

mutations of key catalytic residues. This was followed by experimental determination of the effects of

the predicted mutations on the individual catalytic steps, with focus on the rate of SN2 reaction step.

Our computational catalytic restoration is confirmed in one test case, while another case remains

inconclusive.

2. Results and discussion2.1 Computational studies

We started by using the EVB approach to calculate the activation free energy of the SN2 step in the wt

enzyme and several known mutants. The excellent agreement between the calculated and experimental

3

Page 4: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

values (Table 1 and Fig. 3) encouraged us to try to predict mutants that would reduce the catalytic

activity and then to look for other mutants that would enhance the catalytic activity of the enzyme.

Before addressing the above challenge, we explored the role of the E56 residue and its ionization state.

This residue forms a part of the oxyanion hole and its backbone NH forms an H-bond with the D124

nucleophile as shown in Fig. 2(b). Thus, one might assume that the presence of E56 is crucial for the

dehalogenase activity. The pKa of E56 was found to be 9.58, by using the Protein Dipole Langevin

Dipole method within its semimacroscopic-linear response approximation (PDLD/S-LRA)(14). This

suggests that it should be protonated during the reaction. In fact, if E56 is assumed to be ionized, the

activation free energy becomes ~3 kcal/mol higher than when it is unionized. The E56Q mutation led

to only 0.5 kcal/mol increase in the activation free energy relative to that of the wt (where E56 is in the

unionized form). Thus, it appears that E56 is not ionized and it might not be crucial for the catalytic

activity. This offers an attractive target for our mutational study, which is explored below.

Another interesting residue is F128 which is replaced by Ala in the LinB and DhaA enzymes(15, 16),

and it has been postulated that the presence of this residue enables the catalysis of DCE, which is

otherwise not a good substrate for the other two enzymes.(17) It presumably does so by rendering

compactness to the active site. Thus, it would be interesting to see the effect of mutating phenylalanine

to alanine in the current system as well. In the mutant F128A, the active site cavity might enlarge and

can help in the catalysis of larger and bulkier haloalkanes. Additionally, residues F164 and F222 were

considered important in previous studies.(18) We also found it interesting to explore the mutations of

A149 and A227 to charged residues (based on calculating the electrostatic contribution to catalysis of

mutants of these residues). The results of the computationally tested mutants that were inspired by the

above consideration are given in Table 2. Some of the studied mutants led to a significant reduction in

catalysis and thus could serve as candidates for the second step of our study, i.e., experimental

verification.

In looking for a way to restore the activity of the W125F and W175F/Y mutants (Table 1), we noted

that the stabilization offered by the tryptophan residues to the departing Cl- ion is lost in these mutants

(resulting in an increase in the activation free energy). We therefore decided to mutate an additional

residue that can stabilize the Cl- ion. Reasonable candidates were V226 and E56 that were found to be

at a suitable position to interact with the Cl- ion (Fig. 4). Thus, these residues were replaced with N/Q,

where the NH2 side chain can form H-bonds with the Cl- ion.

In deciding which mutations to explore, we propagate MD trajectories with different mutational

options. Our analysis of the trajectories of the W175Y/V226Q mutant reveals that though the NH2

group of the Q226 side chain interacts with the Cl- ion, the existing H-bond between W125 and the Cl-

4

Katka Slánská, 10/23/18,
grammatical error
Page 5: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

ion is lost, thereby resulting in an increase in the activation free energy (Fig. 4). In the W175Y/E56N

mutant, the NH2 group of N56 forms H-bonds with the OH group of Y175, and thus should be useful

for lowering the activation free energy. It should also be noted that in the enzymes LinB and DhaA, the

role of W175 is played by asparagine residue. Thus, we explored the effect of the E56N mutation in

DhlA, to see whether it can play the stabilizing role offered by W175. The EVB calculations of the

double mutants, W125F/V226N and W125F/V226Q, were carried out using different starting

structures. For the W125F/V226N mutant, a total of 7 profile starting from 7 different trajectories were

studied. Out of these 7 simulations, 4 showed a decrease in the activation free energy with respect to

the W175Y mutant, whereas the others showed an increase in the activation free energy. Further

analysis revealed that for the cases where the H-bond between the mutated N226 and Cl - ion is

maintained, a decrease in activation free energy is obtained (15.9 kcal/mol). The W125F/V226Q

mutant also showed a decrease in the activation free energy (13.9 kcal/mol) when compared to the

W125F mutant (16.7 kcal/mol).

1.2 Experimental studies

Based on the above computational design, we have constructed and characterized experimentally four

mutants; E56Q, F164A, W175Y/E56N and W125F/V226Q. The first step involved the construction of

these variants followed by CD spectroscopy, specific activity measurements and transient kinetic

analysis. The specific activity was evaluated for both DCE and 1,2-dibromoethane (DBE).

All mutations were successfully introduced into the dhlA gene. The correct sequence was confirmed by

sequencing of both strands. Variants E56Q, F164A, W175Y/E56N and W125F/V226Q were expressed

in Escherichia coli BL21(DE3) and purified to homogeneity using the metallo-affinity

chromatography. Expression, purity and size of purified proteins were checked by SDS-PAGE.

Purification yields of E56Q, F164A, W175Y/E56N mutants were approximately 35-60 mg.l-1 of

culture. The W125/V226 double mutant was purified with a significantly lower yield, 5 mg.l-1 of

culture, and SDS-PAGE showed that most of the protein remained in the insoluble phase after the cell

lysis (Fig. S2). Proper folding and secondary structure of the newly constructed variants were verified

by far-UV CD spectroscopy (Supplementary Information). Similar to DhlA wt, the mutant enzymes

exhibited CD spectra with one positive peak at about 195 nm and two negative maxima at about 208-

210 nm and 221 nm, characteristic for alpha-helical content. W175Y/E56N retains a very similar

structure to the wt enzyme, while for the remaining variants, small changes in intensity and shape of

the spectra in comparison to the wt are visible. Observed variations reflect a partial loss of alpha-helical

structure caused by introduced mutations, implying slight conformational changes of the enzymes.

5

Page 6: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

Information obtained using SDS-PAGE and far-UV CD spectroscopy (+) for the W125/V226 variant

suggest that the protein is not properly folded and has limited solubility. Therefore, this variant was not

studied by the pre-steady-state-kinetics.

The rapid mixing of DhlA wt and its variants with DCE were associated with kinetic quenching of

fluorescence intensity of endogenous tryptophan (Fig. 6). The initial kinetic phase of fluorescence

traces was fitted to a single exponential to provide kobs and amplitudes for each substrate concentration

(Fig. 7). Fitting the dependence of observed rate on DCE concentration by Equation 1 (of the method

section) provided unique estimates for all rate constants (Table 3) of corresponding minimal reaction

pathway for DhlA wt and W175Y/E56N (Scheme 1). The dependence of amplitude on concentration of

DCE was used to calculate an estimate of apparent binding constant Ks,app for E56Q (Equation 2) and

W175Y/E56N. By using Ks,app as a fixed parameter, the initial estimates for k2 and (k3 +k4) constants

parameters could be obtained also for E56Q and W175Y/E56N by fitting the dependence of observed

rate on DCE concentration using Equation 1. The obtained rate constants were then used for calculation

of Ks using Equation 2. The equilibrium binding constant derived in this step was then used to fit again

the dependence of observed rate to provide final estimates of k2 and k3 constants. This fitting cycle was

repeated to verify the limits of the derived values and converge to a common solution.

Kinetic fluorescence quenching has not been detected for F164A. This result indicates that the SN2

reaction became the rate-limiting step for the conversion of DCE by this variant and the reaction does

not provide any observable accumulation of alkyl-enzyme intermediate. Since the first chemical step

determines the steady-state turnover, the steady-state equilibrium constant Km closely indicates

substrate binding equilibrium Ks. The 5.2 ± 0.9 mM value for the equilibrium binding of the DCE to

F164A is similar to the values estimated for wt enzyme. This result suggests that the binding affinity of

DCE has not been changed for the F164A variant.

The comparison between the computationally predicted and experimental activation free energies are

shown in Table 4. While for E56Q and W175Y/E56N the results agree with the computations, F164A

shows a variation. E56Q mutation shows that the E56 residue is not very important and its role can be

played by other residues as well. For W175Y/E56N which includes two mutations, we were able to

restore the catalytic activity. That is, although, the activity is lower than that of the wt (15.6 kcal/mol

for W175Y/E56N and 14.4 kcal/mol for the wt), it shows an improvement over the W175Y mutant

(18.3 kcal/mol). These results are encouraging and may provide a guide for the design of

dehalogenases. The probable reason behind the low activity of F164A might be a change in other steps

of the reaction. In this respect we note that our EVB calculations only reflect the energetics of the SN2

step and that there are no available values for the separate rate constants. F164A mutation might be

6

Page 7: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

affecting the hydrolysis or Cl- release step resulting in a low kcat. For the W125F/V226Q mutant, the

mutations might cause destabilization of the enzyme. In fact, Rossetta (19) estimated a 8 kcal/mol

destabilization in this double mutant in comparison to the wt enzyme (Supplementary Information).

Although this result might be a significant overestimate, the fact that no activity is detected, suggests

that the active site region changes significantly. Interestingly, mutants E56Q and W175Y/E56Q show a

stabilization when compared to the wt, whereas F164A and W125F/V226Q show destabilization

(Supplementary Information). Of course, we can try to use more reliable and time consuming ways to

assess the change in stability upon mutations (e.g., free energy perturbation methods) but such studies

are out of the scope of the present work.

The specific activities of DhlA wt and its variants toward DCE and DBE were assayed at 37°C (Table

5). For both substrates W125F/V226Q does not show any activity, while E56Q shows an improved

activity with DCE. Interestingly, all mutants show enhanced activity with DBE as the substrate. The

difference in the effect of mutations on two closely related substrates is remarkable. For DBE, the kcat

of the SN2 step, calculated using EVB simulations, remains largely unaffected in comparison to the

wild type. Thus, the enhanced activity of the mutants might be due to change in the hydrolysis or Cl -

ion release. The activities of E56Q and W175Y/E56N toward DBE are significantly improved over the

wt. Further studies aimed at understanding the origin of this difference in behavior of DCE and DBE

will be carried out.

3. ConclusionsThis work explored the issue of enzyme design focusing on one of the key difficulties in validating the

successes of the design process. That is, in trying to design effective enzyme it is not clear what is the

limit of maximum possible improvement. Thus, we “modified” the challenge and address the problem

of taking a known enzyme, destroying the catalytic residues and trying to restore catalysis. In this way

we know what is the likely maximum possible catalytic effect, and have a relatively well-defined

problem. In fact, even if there is a possible better enzyme than the wt enzyme, reaching the wt

activation free energy is a well-defined challenge that can provide a powerful guide for the successes of

enzyme design strategies.

In the present case we chose DhlA, noting several specific challenges. The reaction catalyzed by DhlA

is multistep and therefore a single mutation can change the identity of the rate-limiting step. Thus,

while a new mutant might lower the free energy for the SN2 step, it might have a negative effect on

other steps. For instance, the V226A mutation (12) decreases the activation free energy for the Cl-

7

Page 8: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

release while simultaneously increasing the free energy for the SN2 step, thereby resulting in a very

small increase in the overall kcat (3.3 for wt versus 3.8 for V226A mutant).

The success reported in Fig. 3 indicates that our approach allows one to determine the catalytic power

of a given mutant, whose structure is approximately known. However, as we found recently the

challenge is to design an optimal enzyme, where the active site groups are preorganized in the correct

way by other residues. The agreement between the computationally predicted activation energies and

experiments for mutants; E56Q and W175Y/E56N is remarkable. Additionally, the higher specific

activities with DBE as the substrate are noteworthy. Thus, it is encouraging to see that we obtained

some success in active site restoration in the current study. Of course, this idea should be extended in

the future to more than two mutations.

4. Methods 4.1 Computational Methods

The starting structure for the EVB calculations(20-22) was taken from the x-ray crystal structure of

DhlA (pdb code: 2DHC(11)). In this structure, the substrate is bound to the protein which circumvents

the docking approach. Additionally, previous computational studies involving dehalogenase have also

considered this PDB structure for the simulations. The EVB calculations were carried out using the

MOLARIS package with the ENZYMIX force field. The ESP charges for the two diabatic states that

represent the reactant and product were calculated using the B3LYP level of theory at the 6-311+G**

basis set using Gaussian software (23). The EVB region consists of the acetate group of Asp124 and

DCE. The FEP/US approach was used to calculate the activation free energies. The center of the

reactive species was taken as the center of the system which was immersed in a water sphere of 18 Å

and solvated using the surface constrained all atom solvent (SCAAS) model.(24) The long range

effects were treated using the local reaction field (LRF) method.(25) The system was first relaxed for at

least 100 ps using a step size of 1 fs and then 3 different starting structures were generated. For the FEP

mapping, the simulation was divided into 51 frames and each frame was simulated for 30 ps with a step

size of 1 fs resulting in a total simulation time of over 1.5 ns. The specific EVB parameters used in the

current calculations are given in the Supplementary Information.

4.2 Experimental Methods

Pre-steady state kinetics

Multiple-turnover stopped-flow fluorescence technique was used to study the DCE conversion

catalyzed by DhlA wt, E56Q, F164A, and W175Y/E56N. The experiments were performed by using

8

Page 9: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

SFM-300 stopped-flow instrument combined with MOS-200 spectrometer (BioLogic, France).

Fluorescence emission from Trp residues was observed through a 320 nm cut-off filter upon excitation

at 295 nm. All reactions were performed at 37°C in 100 mM glycine buffer and pH 8.6. The relaxations

of active site Trp fluorescence upon starting the reaction were recorded stepwise at different substrate

concentrations. The fluorescence traces were fitted to a single or double exponential using KinTek

Explorer (KinTek Corporation, USA). The dependence of observed rate and amplitude on DCE

concentration was fitted by Equation 1 and 2, respectively, by. nonlinear regression using Origin 8.0

(OriginLab corporation, USA).

❑❑❑❑

❑❑❑❑❑❑ (1)

A=[ S ] .❑lim ¿

K S , app+[S]¿

(2)

The corresponding kinetic Equation 1 and 2 were was derived from minimal kinetic pathway (Scheme

1) (27).

Scheme 1

where S and P is a substrate and a product, respectively, ES is enzyme substrate complex, EI is

covalent alkyl-enzyme intermediate, EP is enzyme product complex, KS is equilibrium dissociation

constant for enzyme substrate complex and ki is individual rate constants of ith step. Further details of

the methods are provided in the Supplementary Information. Analytical procedure has been used for

data analysis: (i) the fluorescence traces was fitted to a single exponential to provide observed rates and

amplitudes, (ii) Equation 1 describing the dependence of observed rate on substrate concentration was

9

Katka Slánská, 10/23/18,
k4 step added into the scheme
Page 10: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

derived from minimal reaction pathway (Scheme 1), and (iii) fitting the kinetic data (the dependence of

observed rate on DCE concentration derived from exponential fitting of fluorescence traces in the step

i) by Equation 1 provided estimates for the rate constants. The fitting of dependence of amplitude (A)

on concentration of DCE by hyperbolic model Equation 2, where Alim is the upper limit for the

observed amplitude, was used to calculate an estimate of the apparent equilibrium binding constant

Ks,app.

AcknowledgmentsThe authors would like to express thanks to Dr. Tana Koudelakova (Masaryk University, Brno, Czech

Republic) for collaboration on construction and purification of the mutant proteins. This work was

supported by NIH grants GM-24492 and R35GM-122472, and the Grant Agency of the Czech

Republic (GA16-07965S) and the Ministry of Education, Youth, and Sports of the Czech Republic

(LQ1605, LO1214, LM2015051, LM2015047, LM2015055). Dr. Jindal and Dr. Warshel would like to

express their gratitude to Mr. Dibyendu Mondal for helpful discussions. We also would like to thank

the University of Southern California’s High Performance Computing for computer time. Furthermore,

we are grateful for a generous computing time from Extreme Science and Engineering Discovery

Environment’s (XSEDE) Comet facility at the San Diego Supercomputing Center.

References

1. Frushicheva MP, Cao J, Chu ZT, & Warshel A (2010) Exploring challenges in rational enzyme design by simulating the catalysis in artificial kemp eliminase. Proc. Natl. Acad. Sci. USA 107(39):16869-16874.

2. Frushicheva MP, Cao J, & Warshel A (2011) Challenges and Advances in Validating Enzyme Design Proposals: The Case of Kemp Eliminase Catalysis. Biochemistry 50(18):3849-3858.

3. Kries H, Blomberg R, & Hilvert D (2013) De novo enzymes by computational design. Curr. Opin. Chem. Biol. 17(2):221-228.

4. Siegel JB, et al. (2010) Computational design of an enzyme catalyst for a stereoselective bimolecular Diels-Alder reaction. Science 329(5989):309-313.

5. Röthlisberger D, et al. (2008) Kemp elimination catalysts by computational enzyme design. Nature 453(7192):190.

6. Kiss G, Çelebi‐Ölçüm N, Moretti R, Baker D, & Houk K (2013) Computational enzyme design. Angew. Chem. Int. Ed. 52(22):5700-5725.

10

Page 11: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

7. Frushicheva MP, et al. (2014) Computer aided enzyme design and catalytic concepts. Curr. Opin. Chem. Biol. 21:56-62.

8. Roca M, Vardi-Kilshtain A, & Warshel A (2009) Toward accurate screening in computer-aided enzyme design. Biochemistry 48(14):3046-3056.

9. Jindal G, Ramachandran B, Bora RP, & Warshel A (2017) Exploring the Development of Ground-State Destabilization and Transition-State Stabilization in Two Directed Evolution Paths of Kemp Eliminases. ACS Catal. 7(5):3301-3305.

10. Janssen DB (2004) Evolving haloalkane dehalogenases. Curr. Opin. Chem. Biol. 8(2):150-159.

11. Verschueren KH, Seljée F, Rozeboom HJ, Kalk KH, & Dijkstra BW (1993) Crystallographic analysis of the catalytic mechanism of haloalkane dehalogenase. Nature 363(6431):693.

12. Schanstra JP, Ridder A, Kingma J, & Janssen DB (1997) Influence of mutations of Val226 on the catalytic rate of haloalkane dehalogenase. Protein Eng. 10(1):53-61.

13. Schanstra JP, et al. (1996) Kinetic characterization and X-ray structure of a mutant of haloalkane dehalogenase with higher catalytic activity and modified substrate range. Biochemistry 35(40):13186-13195.

14. Warshel A, Sharma PK, Kato M, & Parson WW (2006) Modeling electrostatic effects in proteins. Bba-Proteins Proteom 1764(11):1647-1676.

15. Nagata Y, et al. (1997) Purification and characterization of a haloalkane dehalogenase of a new substrate class from a gamma-hexachlorocyclohexane-degrading bacterium, Sphingomonas paucimobilis UT26. Appl. Environ. Microbiol. 63(9):3707-3710.

16. Kulakova AN, Larkin MJ, & Kulakov LA (1997) The plasmid-located haloalkane dehalogenase gene from Rhodococcus rhodochrous NCIMB 13064. Microbiology 143(1):109-115.

17. Damborský J & Koča J (1999) Analysis of the reaction mechanism and substrate specificity of haloalkane dehalogenases by sequential and structural comparisons. Protein Eng. 12(11):989-998.

18. Kmuníček J, et al. (2001) Comparative binding energy analysis of the substrate specificity of haloalkane dehalogenase from Xanthobacter autotrophicus GJ10. Biochemistry 40(30):8905-8917.

19. Kellogg EH, Leaver‐Fay A, & Baker D (2011) Role of conformational sampling in computing mutation‐induced changes in protein structure and stability. Proteins 79(3):830-838.

20. Warshel A & Weiss RM (1980) An empirical valence bond approach for comparing reactions in solutions and in enzymes. J. Am. Chem. Soc. 102(20):6218-6226.

21. Kamerlin SC & Warshel A (2011) The empirical valence bond model: theory and applications. Wiley Interdisciplinary Reviews: Computational Molecular Science 1(1):30-45.

22. Kamerlin SC & Warshel A (2010) The EVB as a quantitative tool for formulating simulations and analyzing biological and chemical reactions. Faraday Discuss. 145:71-106.

23. Frisch MJ, et al. (2009) Gaussian 09 Rev. D.01Wallingford, CT).

11

Page 12: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

24. King G & Warshel A (1990) Investigation of the free energy functions for electron transfer reactions. J. Chem. Phys. 93(12):8682-8692.

25. Lee FS & Warshel A (1992) A local reaction field method for fast evaluation of long‐range electrostatic interactions in molecular simulations. J. Chem. Phys. 97(5):3100-3107.

26. Shurki A, Štrajbl M, Villà J, & Warshel A (2002) How Much Do Enzymes Really Gain by Restraining Their Reacting Fragments? J. Am. Chem. Soc. 124(15):4097-4107.

27. Schanstra JP, Kingma J, & Janssen DB (1996) Specificity and kinetics of haloalkane dehalogenase. J. Biol. Chem. 271(25):14747-14753.

28. Kennes C, et al. (1995) Replacement of tryptophan residues in haloalkane dehalogenase reduces halide binding and catalytic activity. The FEBS Journal 228(2):403-407.

29. Krooshof GH, et al. (1998) Kinetic analysis and X-ray structure of haloalkane dehalogenase with a modified halide-binding site. Biochemistry 37(43):15013-15023.

Figures

Fig. 1. The SN2 step and the hydrolysis steps in the haloalkane dehalogenase DhlA catalyzed

conversion of DCE to chloroethanol and a Cl- ion.

(a) (b)Fig. 2. (a) The active site of DhlA with the substrate DCE and the residues that have been proven to be

important, using both experiments and computations. (b) Residues lining the active site that have been

studied in the current study. The dashed lines show the important H-bonding interactions.

12

Page 13: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

Fig. 3. Comparison of observed and calculated activation free energies (kcal/mol). The error bar on

water is not shown as this value is chosen arbitrarily during EVB parametrization.

Fig. 4. Attempt to restore the catalytic activity of the W175Y and W125F mutants. The mutated

residues are highlighted in blue. Distances are given in Å.

13

Page 14: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

Fig. 5. Calculated activation free energies (kcal/mol) for different mutants in the restoration cycle. The

observed values are provided in parentheses. Distances are given in Å.

14

Page 15: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

A B

C D

Fig. 6. Fluorescent traces recorded by multiple-turnover stopped-flow analysis upon rapid mixing of

6.8 μM DhlA wt (A), 6.8 μM E56Q (B), 6.2 μM F164A (C) and 6.7 μM W175Y/E56N (D) with

different DCE concentrations. Each trace shown is the average of five to ten individual experiments.

A B

Fig. 7. Dependence of the observed rate (A) and amplitude (B) on the concentration of DCE for

reactions with DhlA wt (●), E56Q (□) and W175Y/E56N (○).

15

Page 16: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

Tables

Table 1. Calculated and observed activation free energies (in kcal/mol) for different mutantsa

System ΔG‡2,obs ΔG‡

2,cal

Water (cage) 23.6 25.4b`

wt 15.3c 14.5W125F 17.6d 16.7F172Y 17.3d 15.7F172W 16.8c 17.3W175F 18.3e 17.9W175Y 18.3d 18.2V226A 16.0c 14.0

aEnergies in kcal/mol. bIn contrast to the regular EVB procedure we did not insist on calibrating the water reaction on the observed value since we are more interested in the effects of mutations. The value for water (cage) is taken from ref. (26). cBased on k2 - values for wt, F172W and V226A are taken from (13), (27) and (12), respectively. dBased on kcat - values for W125F, F172Y and W175Y are taken from (28), (13) and (29), respectively. e Based on specific activity of the W175Y mutant (29).

Table 2. Activation free energies (kcal/mol) calculated for different mutants

System ΔG‡2,cal

wt (E56 ionized) 18.2

wt (E56 unionized) 14.5E56Q 15.0A149D 13.0A227D 15.4F128A 14.4F164A 12.9F222A 15.0W125F/V226N 15.9/21.9a

W125F/V226Q 13.9W175Y/V226N 21.8W175Y/V226T 17.9W175Y/E56N 15.8W175Y/A149D 18.81BEE.pdb (W175Y mutant) 18.2 (18.3)b

1BEE.pdb+Y175Wc (conf1) 18.31BEE.pdb+Y175W (conf2) 14.1

a EVB calculations starting from different initial geometries gave different results. The lower activation energy is obtained for cases when the NH2 group of 226Q mutant forms H-bond with the departing Cl-

16

Page 17: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

ion. b The experimental activation free energy is given in parenthesis. cA back mutation of Y to W, where 1BEE.pdb serves as the starting point for the simulation of the W175Y mutant.

Table 3. Estimates of kinetic constants for the conversion of DCE by DhlA wt and its mutants.

DhlA wt

E56Q F164A W175Y/E56N

Ks (mM) 3.7 ± 0.5 9 5.4 ± 1.2a n.d. 3029 ± 102a

k2 (s-1) 201 ± 9 16 ± 9 n.d. 26 25 ± 10

k3 + k4 (s-1) 23 ± 3 27 26 ± 4 n.d. 9 10 ± 2

Km (mM) 0.5 ± 0.1 1.2 ± 0.1 5.2 ± 0.9 1.3 ± 0.1

kcat (s-1) 2.2 ± 0.1 1.1 ± 0.1 0.05 ± 0.01 0.31 ± 0.01

n.d. - not detected; acalculated from Ks,app (lower limit for Ks) calculated based on Equation 2.

Table 4. Calculated and observed activation free energies (kcal/mol)

a Energies in kcal/mol. bThe observed value is taken from k2. Experiments are done at 37°C in 100 mM glycine buffer at pH 8.6. Please note that the EVB calculations are done at the standard temperature of 300 K. cThe observed value is calculated from kcat of the whole reaction. For W175Y mutant, the observed value is taken from ref. (29).

Table 5. Specific activities of studied mutants toward DCE and DBE. Experiments were done at 37°C in 100 mM glycine buffer at pH 8.6. Concentrations of substrates were 12.4 mM and 11.3 mM for DCE and DBE, respectively. Enzyme concentrations varied from 52 to 282 nM depending on the catalytic activity.

Enzyme Specific activity/μmol.min-1.mg-1

DCE DBE

DhlA wt 2.0 ± 0.2a 2.9 ± 0.1

E56Q 2.5 ± 0.1 5.1 ± 0.3

F164A 0.022 ± 0.002 3.1 ± 0.1

W175Y/E56N 0.6 ± 0.2 4.2 ± 0.2

17

System ΔG‡2,cal ΔG‡

2,obs, ΔG‡obs

DhlA wt 14.5 14.4a E56Q 15.0 15.9a

F164A 12.9 19.4b

W175Y/E56N 15.8 15.6a

W175Y 18.2 18.3b W125F/V226Q 13.9 n.a.

Page 18: loschmidt.chemi.muni.cz · Web viewExploring the Challenges of Computational Enzyme Design by Rebuilding the Active Site of a Dehalogenase Garima Jindal, 1 Katerina Slanska, 2 Veselin

W125F/V226Q n.a.b n.a.b

a Values are given with standard deviations calculated from three independent runs. b n.a. no activity detected.

18