nmr gui ded approach to evolution of myoglobin protein
TRANSCRIPT
i
NMR guided approach to evolution of myoglobin protein into a catalyst &
AzulenylAlanine as a probe for mechanistic studies of antimicrobial peptides
A Thesis Submitted in Partial Fulfillment of the Requirements of the Renée Crown University Honors Program at
Syracuse University
Christos P Costeas
Candidate for Bachelor of Science in Biochemistry and Renée Crown University Honors
Spring 2020
Honors Thesis in Biochemistry
Thesis Advisor: _______________________ Dr. Olga Makhlynets
Thesis Reader: _______________________ Dr. Calros A. Castañeda
Honors Director: _______________________ Dr. Danielle Smith, Director
ii
Abstract 1) New catalytic properties have been generally introduced through a “bottom-up” directed evolution-based approach or through a rational design-based approach using structural information of the protein or computational simulation analysis. Recently, Reetz et al. demonstrated the only reported example of a redox mediated Kemp eliminase through coordination of the Kemp substrate to the Fe (II)-heme center using the heme containing cytochrome P450-BM3 which was then improved using rational design. This paper describes NMR-guidance approach as a novel, straightforward method for the design of new enzymes with non-natural functions. We speculated that differences in the HSQC spectra between the substrate analog bound and unbound states of a protein can be used to identify potentially important residue-substrate interactions. We chose, myoglobin as the protein scaffold and Kemp elimination as the novel function to be introduced. Through screening for accelerated formation of a yellow 2-cyano-4-nitrophenol product and based on NMR analysis, mutations were introduced near the heme. NMR successfully predicted T95 as an important residue for directed evolution. Both the facts that T95, is located outside the binding pocket, rendering it not possible to have otherwise been predicted and that the T95V variant showed more than ten-fold improvement in catalysis, underscore NMR-guidance as a simple, logical and powerful strategy for directed evolution. 2) A series of antimicrobial peptides (AMPs) with broad spectrum bactericidal activities have been identified in buffalo species. Buffalo CATHL4s (buCATHL4s) are newly identified AMPs that showed the ability to disrupt the membrane integrity of selected Gram positive (G+) and Gram negative (G-) bacteria and induced specific changes such as budding and pore-like structure formation on the bacterial membrane. Using a phylogenetic analysis, a conserved CATHL4 AMPs sequence was selected, modified and studied in collaboration with Gregory A. Caputo and his group, for its use as a potentially novel antimicrobial therapeutic agent. A therapeutic agent needs to avoid or adequately delay enzymatic degradation in order to act in the human body. This paper demonstrates how the introduction of an unnatural amino acid, β-(1-Azulenyl)-L-Alanine (AzAla) can resolve this issue. It has been previously shown by the Korendovch group that AzAla, a fluorescent pseudoisosteric analog of tryptophan, contrasting other tryprophan analogs, is very minimally affected by its local microenvironment and can be selectively excited at a more distant wavelength (342 nm) than most intrinsic fluorophores. AzAla displays simple single exponential fluorescence decay that allows for easy deconvolution of fluorescence lifetime data. This paper shows that AzAla can successfully replace tryptophan to form a novel peptide with the conserved structure and thus function of the corresponding AMP, but with the added advantages that AzAla has to offer, including its fluorescent properties and proteolytic enzyme resistance.
iii
Executive Summary
One of nature’s benevolent creations are elegantly constructed protein scaffolds made
from strategically placed amino acid residues. The structure and amino acid sequence of many
natural protein catalysts enable the catalysis of several reactions that would generally require
very harsh conditions or that would even be impossible to perform in the laboratory1.
Humans are constantly fascinated by nature’s infinite wisdom and magnificent creations.
Dr. Frances Arnold has dedicated a big part of her career in studying and performing protein
directed evolution. In fact, she was the recipient of the Nobel Prize in Chemistry 2018 for her
work2–5 conducting the directed evolution of enzymes, proteins that catalyze chemical reactions.
The ultimate purpose of protein engineering and evolution is to pioneer or discover proteins that
could, through their properties or function, be important tools for technological, scientific, or
medical applications6.
A protein’s function, its expression level, catalytic activity, and its various properties can
be directly linked to its amino acid sequence. This is therefore precisely what scientists target
when attempting to alter or engineer a protein’s properties and function7.
Different strategies are currently being used in the field for improving a protein’s
enzymatic activity. For instance, Dr. Arnold, inspired by natural evolution, used directed
evolution, entailing sequence diversification to generate a library of modified sequences,
followed by screening to identify variants with improved properties8. Another commonly used
method is computational-aided design, in which residue changes in the protein scaffold of the
wildtype are tested in simulated conditions and then the corresponding mutations are introduced,
and the mutant protein is tested in real life conditions9. The aforementioned methods have been
commonly used in research; however, they have various drawbacks. Concerning directed
iv
evolution, it is worth noting that the cycles of random mutagenesis and screening of all the
variants in each library is time-consuming6. In regard to computational design, very expensive
and high-performance technology is required, as well as highly trained personnel to use the
software. Additionally, simulations may not take into account all real-life conditions, thus not
yielding the same results as the experiments in real-life.
Taking into account the potentials of this upcoming field of protein evolution/design and
the drawbacks of pre-existing methods, we were inspired to find and test a method that will
provide a simple, logical and efficacious strategy for directed evolution, namely using HSQC
(Heteronuclear Single Quantum Correlation Spectroscopy) NMR Spectroscopy10. This NMR
spectroscopic technique allows the detection of interactions at the amino acid residue-specific
level or “atomic level.” When our protein is bound to a substrate or a substrate analog, we
predict that there will be a more significant difference in chemical shifts of free and bound states
in the residues involved with the enzyme substrate interaction, thus indicating the appropriate
protein part(s) to be altered, so as to improve protein activity.
A natural defense mechanism conserved in the innate immune system of all organisms,
including plants, animals and humans, involves antimicrobial peptides (AMPs). Cathelicidins are
AMPs that belong to the large group of positively charged peptides with amphipathic properties
and embody the main part of the immune system in many vertebrates, including humans and
farm animals. Numerous AMPs are stored in secretory granules of certain white blood cells and
can be released to fight against bacteria, enveloped viruses and fungi11.
Until now, several cathelicidins have been identified in cattle, horse, pig, sheep, deer,
chicken and many more species; however, recently new short buffalo cathelicidin peptides with
potent bactericidal properties non-toxic to mammalian cells were identified. These cathelicidins
v
may have potential translational applications for the development of novel antibiotics. However,
most natural peptides are susceptible to enzyme degradation when introduced to the human body.
To counter this issue, studies have shown that the introduction of non-natural amino acids into
peptide sequences inhibit or delay the recognition and subsequent degradation of the peptides by
human enzymes12.
In the development of peptides, studying their environment, structure, location and
properties fluorescent emission characteristics of its amino acid components have been used.
Tryptophan (Trp) has been very useful due to it being excited at different wavelengths than other
amino acids13. However, its emission is highly influenced by its environment which may cause
an issue in the accuracy of certain investigations. This issue14 has also been recently countered
using a non-natural amino acid, β-(1-azulenyl)-L-alanine, (AzAla). AzAla maintains constant
fluorescence properties in various environments and has even more distinct excitation
wavelength than the other natural amino acids. Furthermore, this amino acid mimics
tryptophan’s structure very closely, making it highly probable that if it is used to replace Trp it
may not change the peptide’s original structure15.
Using the newly found buffalo cathelicidins’ most conserved peptide sequences16, a novel
potentially antimicrobial peptide was designed, and AzAla was used to replace Trp at different
positions to investigate if resistance to enzymatic degradation and the fluorescent advantages of
AzAla could be incorporated into this design without altering its structure-dependent
antimicrobial function.
vi
Table of Contents
Abstract……………………………………….……………….………………………………. ii Executive Summary………………………….……………….………………………………. iii Acknowledgements...….…….………………………………………………………………... vii Chapter 1: Introduction………………………...……………………………………………… 1 Chapter 2: Rational Design of a Myoglobin Based Enzyme………………………...……… 10 Chapter 3: Azulenyl-Alanine Modified Antimicrobial Peptide…………………….…….… 22 Chapter 4: Material and Methods.……………….………………………………………...… 29 Chapter 5: Discussion………………………….……………………………………………… 37 References.………………………………...…………………………………………………… 41
vii
Acknowledgements
Research has been a huge part of my time here at Syracuse. I first fell in love with the
laboratory when I started working on the evolution of Myoglobin protein using a novel technique
in order to introduce a new enzymatic function. After my first semester in the lab, I couldn’t get
enough of it. For this I have to first thank Dr. Olga Makhlynets, my PI and mentor who has
welcomed me to work in her Laboratory and has introduced me to the various topics of research
conducted in her lab. I want to thank Dr. Areetha Renita D'Souza, my Post Doc mentor who has
guided me over the years and taught me most of what I know today about lab-etiquette and
research techniques. I also want to thank Mr. Sagar Bhattacharya and Ms. Jennifer Halim Yoon
who have introduced me into the wonders of the biochemistry laboratory when I first started, and
who have collaborated with and supported me throughout various projects. Furthermore, I
would like to thank Dr. Ivan Korendovych and the members of the Korendovych Lab, which
have also guided me throughout my time working in the Laboratory, for their constructive
feedback during lab meetings, and the summers we have spent together in the lab. Finally, I
would like to thank the Honors, Biology and Chemistry Departments for their support, and for
giving me the opportunity to put my research work together and present it in my Biochemistry
thesis.
1
Chapter 1: Introduction
Enzymes are biological protein catalysts defined by their ability to catalyze biochemical
reactions by accelerating the conversion of substrates into products in the active site of the
enzyme. Enzymes have many functional attributes and often evolve from a common ancestor-
sharing sequence and structure similarity. The Enzyme Commission (EC) of the Nomenclature
Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB),
initiated in 1956 by experts in the field, presented a framework for enzyme classification17.
According to the EC enzymes are classified into six different classes according to the type of
chemistry being carried out: 1) oxidoreductases which catalyze oxidation/reduction reactions, 2)
transferases, which transfer a chemical group, for example, a methyl or glycosyl moiety, 3)
hydrolases perform hydrolysis of chemical bonds, 4) lyases which also cleave chemical bonds by
other means than by oxidation or hydrolysis, 5) isomerases that catalyze geometric and structural
changes between isomers and 6), ligases which join two compounds with associated hydrolysis
of a nucleoside triphosphate molecule17. These EC classes are further divided into subclasses and
sub-subclasses in line with a variety of criteria such as the chemical bond cleaved or formed, the
reaction center, the transferred chemical group, and the cofactor used for catalysis. The final
level of classification defines substrate specificity1.
Enzyme evolution is crucial for the ability of organisms to adapt to the changing
environmental conditions and sustenance of their survival and reproduction. The ability to adapt
to chemical changes in the environment drives the gradual modification of enzyme function
through biological selective forces18. On the other hand, laboratory-guided protein evolution
exploits various techniques for the generation of protein variants and selection of desirable
functions. In the last decades, laboratory-directed protein evolution had a significant impact in
2
the chemical, pharmaceutical, and agricultural industries. Many times, enzymatic catalysts are
superior in many industrial processes because of their high selectivity and minimum energy
requirement6.
Directed enzyme evolution is attempting to address many shortcomings of biological
enzymes including the need for high availability at low cost, overcoming product inhibitions or
even design enzymes that are required for specific reactions that have yet to be identified6. In
order to accelerate directed protein evolution, scientists utilize various techniques for generation
of protein mutants and selection of desirable functions6. Protein engineering has grown
considerably with the development of new advanced recombinant DNA technologies and high-
throughput screening technologies6.
A group of enzymes that have been the target of directed evolution are cytochrome P450
enzymes. Cytochrome P450 enzymes is a very diverse set of heme-containing proteins, that
serve a broad spectrum of functions including catalyzing hydroxylation, N oxidation;
sulfoxidation; epoxidation; N, S, and O dealkylation; peroxidation; deamination; desulfuration;
and dehalogenation. In mammals, they catalyse reactions important in drug metabolism, blood
hemostasis, cholesterol biosynthesis, and steroidogenesis19. One example of directed enzyme
evolution of cytochrome P450 enzymes is the modification of the wild-type P450 BM-3, which
is specific for long-chain fatty acids20. Using X-ray crystallography, eight target amino acids
were identified and used for the creation of libraries by site-specific randomization mutagenesis.
Variants of each residue were screened by a spectroscopic assay using omega-p-
nitrophenoxycarboxylic acids as substrates20. Further analysis of the variants generated, led to
the identification of a variant able to efficiently hydroxylate indole, resulting in the formation of
indigo and indirubin20,21. The same variant was later found to be able to hydroxylate several
3
alkanes and alicyclic, aromatic, and heterocyclic compounds, all of which are non-natural
substrates for the wild-type enzyme 20.
Despite the decades of effort and the progress made in directed protein evolution,
designing efficient artificial enzymes remains a big challenge. Recently, Ehud Gazit and
colleagues showeed that supramolecular assemblies formed by a single amino acid residue in the
presence of zinc form incredibly efficient hydrolytic catalysts 22. Currently, all reported examples
of supramolecular catalysts inherently rely on very basic physico-chemical principles and
subunit structures. In general, enzymes rely on precise positioning of multiple amino acid
residues in space, aided by complex dynamics of substrate binding, product formation and
release to perform their functions23. The ability of successfully predicting the structures of
catalytically active assemblies and, subsequently, the necessary building blocks to achieve the
desired structures are of the ad most importance for further advancements in the field 23.
Using a rational design of protein evolution based on a thorough study of the structural
and mechanistic analysis, Makhlynets et al. modified a Due Ferri (DF) metalloenzyme to a single
metal ion-binding protein with zinc affinity that is higher than any other reported model
systems24. Metalloenzymes often utilize radicals in order to facilitate chemical reactions25.
DeGrado and coworkers discovered that model proteins can efficiently stabilize semiquinone
radical anion produced by oxidation of 3,5-di-tert-butylcatechol (DTBC) in the presence of two
zinc ions26,27. Makhlynets et al. showed that the number and the nature of metal ions have a
relatively minor effect on semiquinone stabilization and that a single metal ion can efficiently
stabilize semiquinone radical anion produced by oxidation of 3,5-di-tert-butylcatechol (DTBC)24.
They also demonstrated that hydrophobic sequestration, and interactions with the hydrophilic
residues in the protein interior are critical to create the right conditions for the catechol couple as
4
compared to bulk aqueous solution24. Makhlynets et al. created a modified single metal ion-
binding protein (4G-UFsc) with pM affinity for zinc that is higher than that of any other reported
model systems and comparable to many natural zinc-containing proteins using site-specific
mutagenesis to widen the channel that leads to the metal center and an in-depth structural
characterization of substrate interaction with the apo protein by NMR24.
Heme proteins are the most abundant and most widely used metalloporphyrins in nature.
They are divided in three main categories: 1) those which transport or store oxygen such as
hemoglobins and myoglobins, 2) those which have enzymatic activity such as cytochrome P450s
and 3) those involved in electron transfer as part of the electron transport chain. Natural heme
enzymes are involved in both reductive and oxidative chemistry28. Among those with oxidative
activity are oxygenases that use O2 to oxidize, usually oxygenate, substrates and peroxidases that
use H2O2 to oxidize substrates28.
Because of the diversity of functions of heme proteins and especially cytochrome P450s,
they have been the center of attention for protein engineering. With the development of better
analysis tools such as X-ray crystallography and NMR Spectroscopy, and molecular site specific
mutagenesis, a large number of new genetically encoded cytochrome P450 enzymes were
designed, many of which were adapted to catalyze unnatural reactions such as cyclopropanation,
N–H insertion, C–H amination, sulfimidation, and aziridination reactions, that no other natural
counterparts can2.
5
Figure 1. Natural reactions catalyzed by cytochrome P450s (blue) and new, non-natural reactions catalyzed by enzymes derived from cytochrome P450 by protein engineering and evolution (red). Adopted from Arnold FH. The nature of chemical innovation: new enzymes by evolution2.
The Kemp Elimination reaction is widely used by chemists as a model reaction for proton
transfer. The reaction involves a base-catalyzed ring opening of a benzisoxazole by cleaving a
N−O bond along with the deprotonation of carbon (C-3), producing o-cyano phenol
derivatives29. In the case where a carboxyl group exists on carbon-3, the corresponding reaction
is referred to as Kemp decarboxylation. This simple, one-step transformation without any
intermediate reaction, is highly sensitive to medium effects being over 108 times faster in an
aprotic dipolar solvent (e.g., CH3CN) than that in an aqueous solution29. Those characteristics
made the Kemp Elimination reaction a favorable model for studying many enzyme-catalyzed
proton transfer reactions, in which the carboxylate, amino, and imidazole group of amino acid
residues (histidine) on enzymes can function as general bases 30 (Figure 2).
6
Figure 2. The mechanism of both the Kemp elimination and Kemp decarboxylation. Adopted from Kemp Elimination. (2010). Comprehensive Organic Name Reactions and Reagents30.
In the first part of this study (Chapter 2), we used a rational design of protein
evolution guided by differences in HSQC NMR spectra of the Myoglobin protein in an
attempt to introduce novel Kemp eliminase activity to a non-enzymatic heme-containing
protein.
William F. DeGrado and his team used a computational and NMR guided minimalistic
approach to design a small (75-residue) de-novo Ca2+ dependent Kemp eliminase (AlleyCat for
ALLostEricallY Controlled cATalyst) by introducing a single mutation F92E in Calmodulin C-
terminus31. Calmodulin, a calcium-binding protein, was selected as the backbone because it had
no enzymatic activity and was small enough to facilitate NMR as well as crystallographic studies
and allow for ease in peptide synthesis or E. coli expression31.
One of the challenges of the study of cytochrome P450 protein is the ability to quantify
the amount of active P450 protein in a particular sample. Assays that measure P450 activity by
measurement of the oxidation of substrate drugs or other surrogate compounds are not practical
7
as they do not apply to all the P450 proteins32. Indirect P450 mRNA, or immunochemical
analysis of the samples may not be representative of the heme bound active protein levels33.
Guengerich et al. have recently described an assay of cytochrome P450 proteins based on the
spectral difference of reduced and carbon monoxide bound P450 proteins which is simple and
practical in routine research settings in the context of the stoichiometry of active P450 per unit
protein33.
Kemp elimination is usually catalyzed through an acid base mechanism, however, Reetz
et al. demonstrated the first example of redox catalyzed mechanism of kemp elimination reaction
using a heme containing protein, Cytochrome P450 34. Inspired by this discovery, we decided to
test our NMR guided approach to introduce this reaction mechanism to some protein which did
not previously perform this function. Cytochrome P450 however is a large protein with over 470
amino acid residues making it difficult to characterize by NMR. We selected myoglobin, as it is
a well-studied, small (156 amino acid residues) monomeric hemeprotein with no enzymatic
activity. Myoglobin is an excellent scaffold for NMR studies due to its small size, well-folded
structure and easy purification.
In the second part of this study (Chapter 3), we attempted to use a rational design
approach to make a novel buffalo cathelicidin-based antimicrobial peptide with proteolytic
resistance and solvent-independent emission properties.
Cathelicidins are an ancient class of small, cationic, antimicrobial peptides (AMPs).
These peptides are part of the innate immune system of many vertebrates. A series of new
homologs of cathelicidin4 (CATHL4), which were structurally diverse in their antimicrobial
domain, was identified in buffalo. AMPs of newly identified buffalo CATHL4s (buCATHL4s)
8
displayed potent antimicrobial activity against selected Gram positive (G+) and Gram negative
(G-) bacteria16. The peptides took the form of different secondary structure conformations in
aqueous and membrane-mimicking environments. Simulation studies suggested that the
amphipathic design of buCATHL4 allowed water permeation following membrane disruption.
Biswajit Brahma’s et al. study suggests short buffalo cathelicidin peptides with potent
bactericidal properties and low cytotoxicity have potential translational applications for the
development of novel antibiotics and antimicrobial peptidomimetics16.
Most peptides with natural amino acid sequences are readily degraded by proteolytic
enzymes. A challenge with the use of AMPs as antimicrobial therapeutics in the context of
chronic wounds or eye infections is their susceptibility to proteolytic degradation35. Tryptophan
substitution reduce, through amidation and acetylation, the proteolytic susceptibility of Catenin
derived peptides while also improving antimicrobial performance35. Similarly, the susceptibility
of AMPs to proteolytic degradation as they pass through the gastrointestinal path hinder the
possible use of AMPs in orally administered therapeutic applications35.
Fluorescence spectroscopy is an important tool in the study of proteins and their
interactions with their local environment. Emission characteristics of a protein depends on its
amino acid content and to a great extent on Tryptophan’s (Trp) ability to be excited separately
from all other amino acids36. The fact that Trp emission is highly dependent on solvent exposure
and is influenced by its interactions with neighboring amino acid side chains, complicates the use
of fluorescence spectroscopy in certain experimental settings. Experimental condition shifts due
to solvent changes or peptide interactions can disturb the spectral properties of the protein under
investigation15. It was recently demonstrated that the replacement of Tryptophan with β-(1-
azulenyl)-L-alanine can overcome the solvent sensitivity of spectral emission and introduce a
9
more favorable spectral properties (stable excitation at 342 nm) while retaining the native
protein’s characteristics36.
10
Chapter 2: Rational Design of a Myoglobin Based Enzyme
Replicating Reetz et al. CYT-P450-BM3 data34:
E. coli BL21(DE3) cells were transformed with the plasmids prsfDuet-1 (Novagen)
containing CYT-P450-BM3 and P450-BM3 A82F variant genes. CYT-P450-BM3 and P450-
BM3 A82F variant were produced and purified for biochemical characterization (Figure 3).
Figure 3. SDS (10%) page analysis indicating successful expression of protein expected at 120kDa. SDS page analysis of CYT-P450-BM3 purification using imidazole gradient: Lane 1: marker, 2: after induction, 3: load,4: flow through, 5: 25mM wash (100ml), 6: 50mM wash (100ml), 7-10: 100mM imidazole fractions, 11-14: 250mM imidazole fractions. (Left) SDS page analysis of CYT-P450-BM3 purification using imidazole gradient: 1: marker, 2: after induction, 3: after sonication, supernatant, 4: after sonication pellet, 5: flow-through, 6: wash-50mM imidazole, 7: A82F-250mM imidazole (right). Samples with persistent impurities were further purified through a Q-Sepharose column using FPLC.
To determine the concentration of each protein, carbon monoxide difference spectrum
was performed. Only the reduced ferrous form of the hemeprotein reacted with carbon monoxide
(CO) to form a complex that specifically produces a spectrum with a wavelength maximum at
450 nm. This would serve as an accurate way to establish the amount of functional P450 protein
for kinetic assays.
The reduced state will have a greater absorbance at 450 nm whereas the oxidized state
that will not bind to CO will have greater absorbance at 420 nm evident in both protein in Figure
11
4. Absorbances between 400 nm and 490 nm of the reduced and oxidized proteins were
subtracted and a ferrous CO versus ferrous difference spectrum diagram is prepared (Figure 5).
Figure 4. UV-Vis spectroscopy of CYT-P450-BM3 (right) and P450-BM3 A82F (left) in 50 mM potassium phosphate solution at pH 8.0 (Buffer), oxidized protein with buffer (A82F or P450-BM3 ox), oxidized protein with CO (A82F_CO or P450-BM ox_CO), and reduced protein in buffer with CO (A82F_CO_reduced or P4500BM3 red_CO). 1/10 diluted protein sample used in anaerobic cuvettes, bubbled CO using balloon attached to a syringe and needle, syringe was rinsed with 20mM sodium dithionite (freshly prepared), syringe was used to mix protein with reducing agent.
By subtracting the differences in absorbance at 490 nm of the ferrous CO and ferrous
proteins (ΔA490) from the differences between the two at 450 nm (ΔA450) and divide the answer
by 0.091 nmol/ml (extinction coefficient) the concentrations of each protein were estimated33.
Alternatively, using the ferrous CO versus ferrous difference spectrum (Figure 5), the amplitude
of the positive peak at 450 nm can be used and divided by 0.091nmol/ml to calculate the
concentration of each protein33.
12
Figure 5. Ferrous · CO versus ferrous difference spectrum used for the quantitation of active CYT-P450-BM3 (right) and P450-BM3 A82F (left) concentrations.
In their paper, Reetz et al. had established that CYT-P450-BM3 undergoes Kemp
elimination and then attempted to reshape the large substrate binding pocket of P450-BM3 to
better match the small size of the substrate34. Since the A82F mutation in P450-BM3 was
previously shown to greatly enhance the enzyme’s binding affinity for small molecules, they
decided to introduce the mutation into WT P450-BM3 which showed 129-fold improvement in
the catalytic efficiency (kcat/Km)34. We also found an increase in catalytic efficiency (Figure 6),
a tenfold improvement, which is lower but a significant one non-the-less. The inconsistency
could be due to their WT kinetic values being estimated due to having difficulty in solubilizing
the substrate for that reaction.
13
Table 1. Summary of kinetic parameters for P450 and variant catalyzing Kemp elimination generated by our laboratory and by Reetz et al. (25o C, 50 mM sodium phosphate buffer pH 8.0, 100 mM NaCl, 0.25 mM NADPH, 100 nM or 300 nM purified enzyme, 5% acetonitrile, pH 7.0)34.
Protein kcat (s-1) KM (mM) kcat/KM (M-1s-1)
P450-BM3 WT 0.48 ± 0.08 0.124 ± 0.05 3871
P450-BM3 WT,
Reetz et al. >1.5 >6 240 ± 60
P450-BM3 A82F 7.16 ± 2.02 0.255 ± 0.12 38990
P450-BM3 A82F,
Reetz et al. 8.4 ± 0.4 0.27± 0.03 31,000 ± 1,500
Figure 6. Kemp elimination of P450-BM3 A82F vs P450-BM3 WT.
14
Wildtype Myoglobin Expression and Purification:
Wild-type (WT) myoglobin (mb) cloned into pEXP5-NT/TOPO expression vector, was
expressed in BL21 (DE3) E. coli cells and purified using a nickel affinity column. The
polyhistidine-tag was expressed attached to the protein as it is used to purify the protein from the
bacteria lysate by affinity chromatography due to the histidine’s imidazole ring selectively
binding to the Nickel column. Although, according to literature this method could be used for
the purification of Myoglobin, after several attempts, it was found that the his-tag tagged
Myoglobin protein was difficult to elute from the column. It was speculated that the twelve
surface histidine residues in Myoglobin’s structure (Figure 7) might be sufficient, without a
polyhistidine-tag, for the protein to attach to the nickel affinity column. Therefore, the WT-mb
gene was cloned into a pET-28a(+) vector (without a his-tag) and competent BL21 E. coli cells
were used for transformation and expression of the protein.
Figure 7. Myoglobin with twelve surface histidines (blue) can bind to the nickel column without any His6-tag requirement. (PDB ID: 1MBN)
15
Myoglobin H64V Mutagenesis
The first mutation introduced to WT-mb was a valine substitution at position 64 (Figure
8). His64 is a distal histidine that helps myoglobin stabilize bound oxygen molecules. Thus, in
order to inhibit mb from performing its current activity –storing oxygen by binding it to heme-
we wanted to free up the heme containing site. Furthermore, valine being a smaller, hydrophobic
residue was selected to provide easier access of the substrate to the heme binding site.
Figure 8. Myoglobin H64V in pET28a (+) Colony screen. 1%agarose, 150V, TaqPCR protocol, Lane (1): 200 bp O’range Ruler DNA marker. Lane (2)-(16): Colonies mixed with GoTaQ-Green Master Mix. Colonies in lanes (6)-(16) show successful expression of H64V-mb gene. Colony represented in Lane (14) was selected for transformation.
Upon protein purification of the WT-mb and H64V mutant UV Vis absorbance was used
to measure concentrations. The spectrum of reduced Mb has prominent peaks (Q bands) in the
visible range at about 540 and 580 nm, and its Soret band is at 417 nm with ε417 = 128,000 M−1
cm−1 37. It was observed however that the heme iron was oxidized from Fe(II) to Fe(III) from
the sharp soret band at 409 nm, Q bands at 504 and 535 nm, and a CT1 band at 634 nm,
characteristic of a six-coordinated high-spin heme with a histidine residue (His-93) and a water
molecule bound at the fifth and the sixth coordination position of the iron atom, respectively37.
For the redox mediated kemp elimination Fe (II) iron is required. To reverse iron oxidation and
reduce it back to Fe (II), dithionite was used as a reducing agent as indicated by the red shifts of
16
the soret band from 409 nm 417 nm (Figure 9). Even though 3 equivalence of dithionite was
sufficient to completely reduce WT-mb, H64V proved to be very easily self-oxidized upon
exiting the glove box thus ascorbic acid was used as a co-reducing agent to stabilize it during the
progression of Kemp elimination assay. Kemp elimination failed to show any significant
catalytic activity in either the WT-mb (Kcat/KM of 13 M-1s-1 ) or the H64 mutant (kcat/KM of 18 M-
1s-1 ) (Table 2).
Figure 9. UV-Vis analyses of oxidized (red circles) and reduced (blue squares) forms of WT myoglobin (left). UV-Vis analyses of oxidized (red circles) and reduced (blue squares) forms of Mb-H64V variant (right). Myoglobin H64V/V68A Mutagenesis:
Without serving any significant catalytic activity in either WT or H64V mb, a second
mutation was introduced by SOE PCR at position 68 changing the wild type valine into an
alanine. Egeberg et al. (1990) proposed that the naturally occurring Val residue indirectly
stabilizes bound O2 by orienting the ligand for more efficient hydrogen bonding to His6438.
Steric considerations indicate that this residue itself actually hinders oxygen binding by
restricting the rotational freedom of the bound O2 and distorting the Fe-O-O bonds38. The latter
17
effects serve to explain why the V68A substitution would further increase the size of the binding
pocket allowing for easier access of the substrate. The V68A was introduced at to the H64V
mutant creating a H64V/V68A Double mutant Myoglobin.
Kemp Elimination of Myoglobin H64V vs H64V/V68A:
Upon successful purification and reduction of the H64V/V68A double mutant by
dithionite, a Kemp elimination assay was performed as described in the materials and methods.
Preliminary results demonstrated an improvement in increased Kemp elimination rate of the
double mutant compared to the H64V mutant and the V68A mutant (Figure 10).
Figure 10. Kemp elimination of Myoglobin H64V vs H64V/V68A. HSQC NMR titration of H64V/V68A:
HSQC NMR titration was performed using 1H-15N double-labeled H64V/V68A-mb
protein expressed in BL21 (DE3) E. coli cells. Comparison between the free and bound state (to
a non-catalyzable substrate analog of Kemp) showed some shifts in residue peaks, indicating
18
possible interaction between the substrate and some residues. After assignment of the amino
acid residues, it was observed that the residues with greatest chemical shifts were T95 and K96
(Figure 11).
Figure 11. HSQC NMR titration of Myoglobin H64V variant (reduced form) in absence of substrate analog (blue) and Myoglobin in presence of two equivalents of analog (red). (left) T95 and K96 residues presented the greatest shifts (right).
After further investigation of the 3D structure of the protein, it was found that the T95
and K96 residues were situated around the outside of the heme binding pocket (Figure 12). This
displays the incompatibility of the heme binding site with the Kemp substrate supporting the low
Kemp elimination results.
19
Figure 12. Residues (T95 and K96) with the greatest shifts from NMR titration (red) shown in sticks (right) are located on the outside of the binding pocked as shown on the surface representation (blue) of Myoglobin (left) (PDB ID: 1MBN). Myoglobin T95 and K96 Random Mutagenesis:
Using the H64V/V68A double mutated Myoglobin plasmid as template, the codons of the
T95 and K96 residues were randomized using an NNK library, to investigate if a different
residue at that position could provide a substrate interaction that would improve the proteins
ability for Kemp elimination.
Bacterial colonies generated from the NNK library were picked to perform a crude
extract screening to test the speed of the production of Kemp catalysis product. The crude
screening of the T95 NNK library indicated two colonies which demonstrated a significantly
greater Kemp elimination activity compared to the H64V/ V68A controls (Figure 13). On the
contrary none of the K96 showed any significant activity increase.
After sequencing, both T95 NNK library “hits,” they turned out to both be valine mutants
at the 95th position. Thus, the triple mutant H64V/V68A/T95V protein was then expressed and
purified, reduced and tested for a Kemp elimination assay. It was found that the triple mutant
had a ten-fold increase in activity compared to the H64V/V68A double mutant (Table 2).
20
Figure 13. Representative sequencing results of the NNK libraries of Myoglobin, showing the randomization of the codons (black:G, Blue:C, Red:C, Green:A) for the selected residues (right). T95NNK library crude lysate screening: Column 1: Blank, 11 (A,B,C,D): Sacrificed, 2 (A,B,C,D): H64V/V68A (control), Rest: T95NNK Clones (left). Table 2. Kinetic parameters of the Myoglobin variants (Kcat, KM and kcat/KM values are reported at pH 7.0).
Mb-Variants WT H64V H64V/V68A H64V/V68A/T95V
kcat/KM, M-1s-1 (pH 7.0) 13 18 199 2660
Mb-Variants kcat (s-1) KM (M) kcat/KM (M-1s-1)
H64V/V68A 0.394 0.00198 199 H64V/V68A/T95V 1.011 0.00038 2660
Conclusions:
Firstly, through generating comparable Kemp assay data for CYT-P450-BM3 and P450-
BM3 A82F, we were able to replicate and confirm Reetz et al.’s findings34 that heme containing
proteins can undergo directed evolution towards improved Kemp catalytic activity. Furthermore,
in our paper we present the first ever report on Kemp eliminase activity of myoglobin, and the
second example of redox-mediated Kemp eliminase. Furthermore, we have shown how NMR
successfully predicted T95 as an important residue for directed evolution, a residue that is found
outside the binding pocket and could not have been otherwise predicted by any other method. By
21
evolving the identified residue and producing the T95V containing triple mutant we have showed
more than ten-fold improvement in catalysis (Figure 14), a clear display of the capabilities of our
novel NMR-guidance method as a simple, logical and powerful strategy for Directed Evolution.
Figure 14. Kemp elimination catalyzed by Myoglobin H64V/V68A (red) and H64V/V68A/T95V (blue) variants. Conditions: 20 mM HEPES; pH 7.0; 0.25 μM protein.
22
Chapter 3: Azulenyl-Alanine Modified Antimicrobial Peptide
Reasoning for the chosen sequence:
Based on consensus determined after phylogenetic analysis of the conserved CATHL4
AMPs sequence (Figure 1) the peptide sequence A-I-P-W-I-W-I-W-R-L-L-R-K-G (AG14) was
selected (in collaboration with Gregory A. Caputo and his group) for the basis of the
development of a potentially novel antimicrobial therapeutic agent (Figure 15).
Figure 15: Phylogenetic relationship and consensus of amino acid residues of the 12 CATHL4 AMPs. The Neighbor-joining tree construction was based on complete nucleotide sequences of the CATHL4 variants (NCBI Accession ID: KJ173930—KJ173976). The compiled most conserved amino acid sequence reads as: AIPWIL(/W)IWW(/R)LLF(/R)KG.
Most peptide with natural amino acid sequences are readily degraded by proteolytic
enzymes. The susceptibility of peptide-based materials to proteolytic degradation as they pass
through the gastrointestinal path are a challenge for therapeutic applications. Thus, we will be
investigating the introduction of artificial amino acids in order to reduce the susceptibility of the
potential therapeutic peptide to trypsin proteolytic degradation without compromising its
structure and therefore without its function being altered.
AzulenylAlanine substitution of tryptophan-containing peptides:
The tryptophan (Trp) residues in the initial sequence of the peptide AG14 to
accommodate a tryptophan mimic: AzulenylAlanine (AzAla, Z) (Figure 16). Substitution of Trp
to AzAla has been shown to have a minimal effect on properties of peptides. Furthermore,
23
AzAla also has a rich absorption and fluorescence spectra (340nm) with bands distinctly
different from those of tryptophan (280 nm) useful for monitoring protein interactions.
Figure 16. Structures of Trp and AzAla.
Three different AG14 variants, AG14W4Z, AG14W6Z and AG14W8Z (in which in the
fourth, sixth and eighth position respectively tryptophan was substituted with AzAla) were
synthesized as per the materials and methods (Figure 17).
Figure 17. Primary structures of AG14 peptide and its Az-Ala variants.
In order to check if there is a change in the protease susceptibility of the peptide before
and after the introduction of AzAla enzyme digestion experiments were performed using HPLC
(High Performance Liquid Chromatography) and MALDI-TOF (Matrix-assisted laser
desorption/ionization - time-of-flight mass spectrometer) to closely monitor the rate digestion of
each (Figure 18).
A-I-P-W-I-W-I-W-R-L-L-R-K-G (AG14) A-I-P-Z-I-W-I-W-R-L-L-R-K-G (AG14W4Z) A-I-P-W-I-Z-I-W-R-L-L-R-K-G (AG14W6Z) A-I-P-W-I-W-I-Z-R-L-L-R-K-G (AG14W8Z)
24
Figure 18. Overlay of chromatograms acquired at various time points after peptide and trypsin were mixed. The inset graph shows the percentage of undigested peptide peak area monitored over time. Identity of peaks was established by MALDI-TOF using the sample that was digested for 4 hours (top). MALDI-TOF analysis of trypsin digest product at retention time of 8.1 min for AG14 collected after 4 hours. The peak at 1237 Da corresponds to AIPWIWIWR peptide segment (grey). MALDI-TOF analysis of the trypsin digest product at retention time of 8.4 min collected for AG14 after 4 hours. The peak at 1801.6 Da corresponds to the undigested peptide(pink). MALDI-TOF analysis of the trypsin digest product at retention time of 8.7 min for AG14 collected after 4 hours. The peak at 1617.9 Da corresponds to AIPWIWIWRLLR peptide segment (blue). Table 3. MALDI-TOF signal and expected mass of digested peptide sequences.
Peptide sequence Position of the peak in
the chromatogram
MALDI-TOF
signal, Da
Expected
mass, Da
AIPWIWIWR 1 1237 1240
AIPWIWIWRLLRKG* 2 1801 1806
AIPWIWIWRLLR 3 1618 1622 * amide at C-term
25
Trypsin is an endopeptidase the carboxyl side (or "C-terminal side") of the amino acids
lysine and arginine except when either is bound to a C terminal proline. The aspartate residue
(Asp 189) located in the catalytic pocket (S1) of trypsin is responsible for attracting and
stabilizing positively charged lysine and/or arginine, and is, thus, responsible for the specificity
of the enzyme.
AG14 and AzAla variants were incubated with trypsin and samples were taken at regular
time points and identified by HPLC and MALDI. The mass spectrometry results suggest that the
peptide (AG14) was cleaved between R9 and L10 (AIPWIWIWR- 1244Da, LLRKG 587 Da)
and between R12 and K13 (AIPWIWIWRKKR- 1628 Da). Trypsin digestion of the AzAla
variants suggests that the Z substitution served as a way to reduce the proteolytic susceptibility
of AG14 as shown by AG14 (Figure 18). The greatest decrease in susceptibility was observed in
AG14W8Z where the location of the unnatural amino acid is closest to the expected sites of
cleavage (Figure 19).
Figure 19. Absorbance of AG14 and AzAla variants monitored over time on the analytical HPLC in the presence the lower decrease of the integrated HPLC of trypsin (0.1 µg) for 2.5 hours, integrated for the peptides at peptide peaks over 2.5 hours compared to Abs 220 nm.
26
In order to test whether the structure of AG14 was retained by the modified peptides as
well, which could play an important role in their potential antimicrobial properties, Circular
Dichroism (CD) spectroscopy experiment was performed. Briefly, CD is defined as the unequal
absorption of left-handed and right-handed circularly polarized light. Based on the polarization
pattern observed we can determine information regarding the secondary structure of the peptide.
Figure 20. CD Spectra of AG14 and AzAla variants in 5mM phosphate, 10mM NaCl pH 7.0 (left). CD Spectra of AG14 and AzAla variants in POPC vesicles (250uM) (right). The pathlength was 1cm for both.
Circular Dichroism (CD) spectroscopy of AG14 and its AzAla variants was performed in
aqueous conditions as well as within POPC vesicles. The CD spectra showed the replacement of
W with Z at the aforementioned positions did not alter the original structure of AG14, and the
peptides maintained the same structure they had in the aqueous buffer, within the vesicles as
well. Furthermore, the absorption patterns of the peptides: negative bands at 222 nm and 208 nm
and a positive band at 193 nm, are characteristic of the expected alpha helical structure (Figure
20).
27
Figure 21. AzAla: Fluorescent spectra of excitation wavelength 342 nm and emission recorded over the range of 355 nm − 455 nm (left). Tryptophan: Fluorescent spectra of excitation wavelength 280 nm and emission recorded over the range of 300-400 nm (right).
Going from Z4 to Z6 and Z8 we observe a red shift in the peak of Trp’s spectral
maximum emissions. This is consistent with changing the microenvironment of Trp towards
more polar conditions. Based on this observation (Figure 21) we can postulate the possibility that
Trp’s in Z4 are found more buried and inside the peptides’ conformation and being somewhat
isolated from the hydrophilic solvent. Conversely, the AzAla spectral fluorescence indicate
possible quenching of signal emissions from the AzAla amino acid of Z4 as indicated by the
lowest peak (Figure 21). This supports that the AzAla amino acid of Z4 may be found on the
outer side of the peptide exposed by possible surrounding quenchers. This speculation is in
accordance with the earlier one giving us possible insight to the folding of the peptide.
Conclusions:
Introduction of AzAla in the place of Trp has decreased the peptide’s susceptibility to
trypsin digestion with the greatest effect observed in AG14W8Z where the location of the
unnatural amino acid is closest to the expected sites of cleavage. Through CD analysis, we have
28
observed that the replacement of Trp by AzAla did not cause change in the peptide structure
compared with the original AG14, nor in hydrophilic (5mM phosphate), neither in hydrophobic
(POPC vesicles) environments, thus it is expected that the antimicrobial function of the peptide
will also remain intact.
29
Chapter 4: Material and Methods (adopted from the Makhlynets Lab)
Cloning and mutagenesis:
WT Myoglobin gene (cloned into pET22b(+) vector from Novagen) was a generous gift
from the Fasan lab (University of Rochester). All myoglobin mutants reported here were cloned
into pET28a vector using NcoI and XhoI restriction enzymes. pET28a contains both N-terminal
and C-terminal His6-tag. However, mutant genes were cloned into that expression vector without
any His6-tag since the gene with twelve histidine residues was found to bind to Ni-NTA column
itself without any requirement of His6-tag. Saturation mutagenesis to generate all possible
mutants at a specific position (NNK libraries) was done using megaprimer protocol with Phusion
DNA polymerase (New England Biolabs).
Megaprimer protocol is based on two-step PCR. First PCR with T7 forward or T7 reverse
and mutagenic primer combination (as appropriate) generates the megaprimer which comprises
the desired NNK library. Temperature gradient is decided based on the melting temperatures of
the primers used. Generally, annealing at a temperature 5ºC above the lower Tm primer for 30 s
works very well. Extension is performed at 72ºC for 60 s. Once the megaprimer is generated
successfully, it can be used directly for the second PCR without any purification step. Annealing
is performed with temperature gradient from 75ºC to 65ºC for 30 s. Extension time generally
varies from 5 min to 7 min at 72ºC. After the PCR reaction, the template was digested with
restriction enzyme Dpn-I (1 L per 25 μL reaction, incubation for 6-7 h at 37ºC) to eliminate
parental DNA and then transformed into E. coli commercial C2987H cell (NEB5). Sequences of
the mutated genes were confirmed by Sanger sequencing (Genewiz, Inc.).
Site-directed mutagenesis to generate a specific mutant at a specific site was performed
by using SOE PCR protocol instead of megaprimer PCR. The second step of megaprimer PCR
30
did not work efficiently for Myoglobin (DNA template size: ~6000 bp). SOE PCR is also a two-
step PCR (like megaprimer PCR already discussed above). In this protocol, first PCR generates
two fragments, which comprise the mutation at desired position. Annealing temperature is
decided based on the melting temperatures of the primer pair and following GoTaq polymerase
protocol. Generally, annealing at 60ºC for 30 s works well for the first step. Extension is
performed at 72ºC for 45 s. Once the mutant fragments are designed successfully, they act as
templates for the second PCR, where the fragments are mixed well in the same PCR tube.
Annealing is performed for 30 s with step-gradient from 58°C to 63°C with 1°C increment of
temperature. Rest of the conditions for extension remain the same as for the first PCR. Purified
PCR products are directly used for cloning into pET28a vector using NcoI and XhoI restriction
enzymes and cloning mixture is transformed into E. coli commercial C2987H cell (NEB5α).
Colony screening is done with at least ten colonies to find positive hits. Sanger sequencing
(Genewiz, Inc.) confirmed sequences of the mutated genes.
Library screening:
Individual clones obtained after transformation of the NNK libraries into the BL21
(DE3) pLysS cells were inoculated into 200 μL of Luria Bertani (LB) supplemented with 50
μg/mL of kanamycin and 34 μg/mL of chloramphenicol in 96-well plates and grown for 6-7
hours at 37°C with shaking at 230 rpm. Aliquots of 20 μL of the resulting cultures were added to
400 μL of the 1X LB medium (supplemented with 50 μg/mL of kanamycin, 34 μg/mL of
chrolamphenicol) in deep well plates. 0.3 mM of δ-aminolevulinic acid was added as heme
precursor and overexpression was induced by adding 0.25 mM IPTG (isopropyl-β-D-1-
thiogalactopyranoside) when OD600 of cultures reached ~0.8. The cultures were grown at 25ºC
for 20 h and centrifuged at 2,000 g for 15 min; the pellets were lysed with a buffer containing 20
31
mM HEPES, pH 7.0, 0.5% triton X and centrifuged again for 15 min to clarify the lysates. The
activities of the crude lysates were tested with Kemp (0.4 mM) in 20 mM HEPES, pH 7.0 buffer
containing 1 mM of ascorbate, 1 μM of SOD and 20 nM of catalase maintaining reducing
condition. Absorbance of the product was monitored at 380 nm. The plasmids for the most active
clones were extracted and re-transformed into BL21 pLysS. The positive hits were confirmed in
triplicate in confirmation screening under the same conditions as described above. The plasmids
for the clones with higher activity were extracted and sequenced using T7 or T7 term primers (as
appropriate).
Protein expression:
A plasmid encoding a gene with the desired protein sequence was transformed into E.
coli BL21 (DE3) cells and grown overnight at 37°C on an agar plate containing kanamycin (50
μg/mL). Growth media described below was also supplemented with kanamycin. The next day, a
single colony from a fresh agar plate was grown in 50 mL Luria Bertani (LB) medium overnight.
The next morning 10 mL of the overnight culture was diluted with 1 L of 1X LB media and
incubated at 37°C until OD600 reached 0.8. Then the culture was cooled down to room
temperature on ice and δ-aminolevulinic acid (0.3 mM) was added as heme precursor. After 15
min, IPTG (isopropyl-β-D-1-thiogalactopyranoside) was added to a final concentration of 0.25
mM for induction. After incubation at 25ºC for 20 h, the cells were harvested by centrifugation
(4,000 g for 30 min). The typical yield of wet cell paste was ~5.0 g per 1 L of culture. The cell
paste was flash frozen in liquid nitrogen and preserved at -80°C until further use.
Protein purification:
Cells were resuspended in buffer containing 25 mM TRIS (pH 8.0) on ice with a protease
inhibitor phenylmethylsulfonyl fluoride (PMSF) (0.5 mM final concentration) added. Cells were
32
lysed by sonication on ice for 10 min (20 s pulse, 20 s rest). The crude cell lysate was
centrifuged at 20,000 g for 30 min and the lysate was loaded onto Ni-NTA column (2 mL,
Clontech) pre-equilibrated with resuspension buffer and the resin was then washed with 50 mL
of the same buffer. The protein was eluted with 25 mM TRIS (pH 8.0) buffer with gradient
addition of imidazole from 20 mM to 250 mM. Fractions containing the protein were identified
by the UV-Vis analysis, combined and exchanged into a buffer containing 20 mM HEPES (pH
7.0) using a desalting column (BioRad, Econo-Pac 10G). Protein samples were concentrated
down to 500-700 µM using 5 MWCO spin filter (CORNING>>431482).
The reduction of protein samples was performed by bringing the degassed, pure protein
samples into the glovebox (MBRAUN) and addition of five equivalents of freshly prepared
dithionite solution under anaerobic condition. Protein was degassed using a Schlenk line and
three one-minute cycles.
For standardization of dithionite, 20-30 mg of solid dithionite was brought into the glovebox.
Next, 1 mL of degassed milliQ water was added to that. Following the same way, potassium
ferricyanide solution was prepared. Dithionite stock was diluted 10-fold with milliQ water. Next,
two solutions were prepared in the following way:
(a) 990 μL water + 10 μL potassium ferricyanide
(b) 980 μL water + 10 μL potassium ferricyanide + 10 μL 1/10 sodium dithionite
Absorbances of both the solutions were measured at ε= 420 nm using diode array instrument.
Since, the extinction coefficient of potassium ferricyanide is ε420 = 1020 M-1cm-1, ΔA420/1020
gives the concentration of reduced iron. From this information, the reducing equivalence of 1/10
dithionite solution was calculated.
33
Kinetic assays of Kemp elimination:
Measurement of enzymatic activities of all the Myoglobin variants discussed here, were
performed at 22°C in a BioTek Eon3 plate reader monitoring absorbance of the product at 380
nm using 96-well plates. The molar extinction coefficient for the product 2-hydroxybenzonitrile
at 380 nm (15,800 M-1cm-1) was taken from the literature. Eight different concentrations of the
Kemp substrate between 50 and 1000 μM were used in these assays while maintaining a constant
2% concentration of acetonitrile. The reaction was monitored in triplicate for 900 s with pure,
reduced protein in 200 μL of buffer (20 mM HEPES, pH 7.0). To maintain reducing condition,
ascorbate (1 mM), SOD (1 µM) and catalase (10 nM) were added to the buffer before the
dilution of reduced protein to the desired concentration. Since, this reducing buffer is stable
enough in aerobic condition at least for 1 h, ascorbate, SOD and catalase are added immediately
before the assay experiment. Next, a 50 μL portion of reduced protein solution in buffer was
added first to each of wells of the 96-well plate and subsequently, 150 μL of substrate solution
was added. The obtained kinetic traces were analyzed using the Michaelis-Menten kinetic
equation {v0=kcat[E]0[S]0/(KM+[S]0)} to obtain kcat and KM values. Individual kcat and KM
parameters could not be obtained for some variants as KM values were exceedingly high. In those
cases, the data points were fit to a linear regime of the Michaelis-Menten model using the
equation {v0=kcat[E]0[S]0/(KM)}.
Expression and purification of isotopically labelled protein:
Transformation was done as described in the “protein expression” section. Overnight
culture (10 mL) was used to inoculate 2L of 1X Terrific Broth (TB) media containing the
antibiotics and incubated at 37°C until OD600 reached 0.8. Then the cells were harvested by
centrifugation (4,000 g for 30 min) at room temperature and washed with unlabelled M9 minimal
34
medium (30 mL).The cell pellet was then resuspended in M9 medium (1 L) containing 15NH4Cl
(1 g) as a nitrogen source and 13C-labelled glucose (2.5 g) as the carbon source. The culture was
incubated at 37ºC for 3 hours to allow for recovery of growth. δ-aminolevulinic acid (0.3 mM)
was added as heme precursor and after 15 mins protein expression was induced by addition of
0.25 mM IPTG and the culture was further incubated at 25ºC for 20 h. Cells were harvested by
centrifugation at 4˚C, 4,000 rpm for 30 min with typical yield of 2 g of wet cell pellet per 1 L of
culture. Pellets were flash frozen in liquid nitrogen and preserved at -80ºC until needed.
Purification of isotopically labelled protein was done as described under “Protein purification”
section.
NMR experiments:
All NMR experiments were carried out at 25°C on a Bruker AVANCE III HD 800 MHz,
equipped with a Triple Resonance Inverse (TCI) Detection CryoProbe.
Spectral data were acquired and processed using Topspin 3.0 (Bruker), NMRPipe and
analysed using Sparky 3.113 and CCPN. For reduced myoglobin H64V (850 μM), backbone
resonance assignments were obtained using a series of three-dimensional NMR experiments:
HNCACB, HN(CO)CACB, HNCO and HN(CA)CO. Chemical shifts were directly/indirectly
referenced to DSS (4,4-dimethyl-4-silapentane-1-sulfonic acid).
Interaction studies of Myoglobin H64V (350 µM) were performed by acquiring 15N-1H
HSQC spectra with either Kemp substrate/inhibitor at 1:0, 1:0.5, 1:1 and 1:2 stoichiometric
ratios. Both Kemp substrate (5-nitrobenzisoxazole) and inhibitor (5-nitrobenzotriazole) solutions
were prepared by dissolving solid substrates in 100% acetonitrile.
Trypsin digestion:
35
Pure lyophilized AG14 peptides were dissolved in MilliQ water and syringe filtered (0.2-
micron, 6500 rpm 10 mins) prior to checking concentration. The concentration of AG14 peptide
was determined by measuring absorbance at 280 nm on an UV-Vis spectrophotometer using
ɛ280= 16,500 M-1cm-1. The concentrations of AzAla variants were determined by measuring
absorbance at 342 nm on an UV-Vis spectrophotometer using ɛ342= 4,212 M-1cm-1. The peptide
stocks of 2mM concentration were prepared in buffer (50 mM sodium phosphate, 100mM NaCl,
pH 7.0). Trypsin (1 mg/ml, Ameresco) stock was prepared fresh in buffer and serial dilutions
were done: 0.1 mg/ml, 0.01 mg/ml, 0.001 mg/ml for the assay. The reaction samples (500 µL)
were prepared in triplicate by mixing AG14 (500 µM final concentration), trypsin (50 µL of
0.001 mg/ml solution) in buffer (50 mM sodium phosphate, 100mM NaCl, pH 7.0). The trypsin
digest was monitored on the HPLC every 15 min at room temperature by following the peak of
undigested peptide. Identity of various peaks in HPLC chromatogram were identified by
MALDI-TOF. For this, fractions containing peptides corresponding to different peaks on HPLC
were collected, lyophilised and re-dissolved in 10 µL of solvent B; 10 µL of solvent A, and 2 µL
of CHCA matrix was added (1:10 proportion) and the mixture was loaded onto the MALDI
target.
Preparation of Vesicles:
Solid POPC (12.5 mg, Avanti Polar Lipids) was dissolved in chloroform (0.5 mL) to
make a lipid stock (16.5 mM). This stock (303 µL) was pipetted into a glass vial and dried with
nitrogen gas and then under a vacuum for 1 hr. The resulting film was resuspended with PBS
buffer (5 mM sodium phosphate, 10 mM NaCl, pH 7.0) containing peptide (final concentration -
2 µM).
36
Circular dichroism spectroscopy:
The CD spectra were acquired on the Jasco J-715 CD spectrometer collecting 64 scans (4
s averaging time) for each spectrum and using a quartz cuvette with a 1 cm path length. The
measurements were performed on samples containing peptides (10 µM) in buffer (5 mM
phosphate,10 mM NaCl, pH 7.0) in the presence and in the absence of vesicles (250 µM of 100%
POPC). Care was taken that the sample absorbance never exceeded 1.5 at all wavelengths to
produce reliable ellipticity values. Mean residue ellipticity (MRE, deg*cm2*dmol-1) values were
calculated using the following equation, where q is ellipticity (mdeg), l is pathlength (cm), C is
peptide concentration (M), N is number of residues. MRE=θ/(10*C*l*N)
Fluorescence:
Fluorescence data for the peptides were obtained on a JY-Horiba fluoromax-2 spectro-
fluorometer at room temperature. Emission spectra measurements were taken in Spectrosil quartz
cuvettes (1 cm path length cuvette, sample volume 2 ml) using a 5 mm excitation slit width and 5
mm emission slit width for spectra. The measurements were performed on samples containing
peptides (2 µM) in buffer (50 mM sodium phosphate,150 mM NaCl, pH 7.0). The excitation
wavelength used to excite AzAla was 342 nm. Fluorescence emission spectra were recorded over
the range of 355 nm − 455 nm. To excite tryptophan, the excitation wavelength used was 280
nm. Fluorescence emission spectra were recorded over the range of 300-400 nm.
37
Chapter 5: Discussion
Protein evolution is a continuous process in nature as a result of random mutation
acquisition and Darwinian39 selection of the fittest in a constantly changing environment,
however in nature it may take a long period of time for an advantageous mutation to appear by
chance. Manmade directed evolution is a rather faster method, and it did not actually first start in
the laboratory. Since the introduction of agriculture and farming40, humans were indirectly
practicing selection of mutants through crop selection based on the quality and quantity of grains
produced and selective crossbreeding animals for many generations for better meat or milk
production or even better hunting dog skills resulted in systematic selection of favorable
combinations of genetic mutants41. Protein mutagenesis in nature however should not only be
seen as in the light of a positive natural selection pathway, as it is also responsible for the
pathogenesis of cancer where many mutants in oncogenes result in their constitutive activation
and give a growth advantage at the cellular level but with devastating effect at the organism
level. Mutants that are resistant to chemotherapeutic agents are often selected as an adverse
response to cancer treatment. In different types of cancer, such as leukemia, translocation of
different genes creates novel chimeric proteins with catastrophic effects42.
In the laboratory even faster, more sophisticated and effective directed evolution
techniques have been developed6, and in this paper we have shown how this new NMR-guided
approach allowed the evolution of a scaffold with no previous Kemp catalytic capabilities, to
incorporate and be efficient in conducting this novel catalytic function. In the laboratory,
directed protein evolution, which is the manipulation and selection of specific molecules instead
of organisms, has developed over the years and had a great impact in many industries including
agriculture, the pharmaceutical industry, the chemical industry and others. Scientists employed
38
many strategies to introduce and select favorable single or combinations of mutant proteins with
new or improved phenotypes and physical properties43. Random mutagenesis and selection of
specific favorable mutants, DNA shuffling of libraries of mutants and combination of related
proteins, exon shuffling of non-homologous genes or even whole genomic shuffling of different
strains are just a few of the methods used6. The improvement and modification of many nucleic-
acid modification enzymes such as polymerases, restriction endonucleases and many other
biological reagents led to the explosive development of molecular biology. In turn, the
development of molecular biology techniques and especially genomics and proteomics
accelerated the industry of directed protein evolution44 with an incredible impact in
pharmacology through the design of protein pharmaceuticals, antibodies, vaccines and many
therapeutic chemicals.
In this study we used a combination of rational design-based site-specific mutagenesis of
myoglobin to modify the heme binding pocket to inhibit its previous activity and make the heme
more easily accessible. Its low catalytic efficiency even after the initial mutations (kcat/KM WT:
13 M-1s-1, H64V: 18 M-1s-1) demonstrated how myoglobin’s original scaffold was not intended
for Kemp catalysis. We then used HSQC NMR-analysis followed by random mutagenesis and
enzymatic screening to establish and enhance the enzymatic activity of the protein (H64V/V68A:
199 M-1s-1, H64V/V68A/T95V: 2660 M-1s-1). We have therefore demonstrated that HSQC NMR
guided evolution can be used successfully in the introduction of a desired novel enzymatic
activity to non-protein scaffolds and provide a direction for improving the catalytic efficiency of
an enzyme.
Briefly, after successful expression of WT mb, we used side specific mutagenesis to
accommodate access of the substrate to the heme binding site. The first mutation introduced was
39
the substitution of His64 with valine, a smaller hydrophobic residue. His64 is a distal histidine
that helps myoglobin stabilize bound oxygen molecules. To further increase the size of the
binding pocket allowing for easier access of the substrate a second mutation was introduced in
position 68, replacing valine into an alanine. A Kemp elimination assay demonstrated a modest
increase in the Kemp elimination activity of the H64V/V68A double mutant compared to the
H64V and V68A mutants.
In order to determine which residues are involved in protein substrate interaction, we
performed HSQC NMR analysis of the H64V/V68A double mutant protein in the presence and
absence of a non-catalyzable substrate. HSQC NMR studies identified residues at the positions
T95 and K96 as the ones having the greatest chemical shifts as a result of substrate binding. In
an attempt to further enhance substrate binding and increase enzymatic activity, targeted random
mutagenesis of these two residues and crude lysate screening with Kemp elimination was carried
out. Random mutagenesis screening resulted in the identification of two identical constructs
with a T95V substitution which had increased enzymatic activity. Enzymatic studies of the
triple mutant H64V/V68A/T95V protein demonstrated a ten-fold increase in activity compared
to the H64V/V68A double mutant.
Since this novel NMR-guided approach has successfully predicted important residues to
be modified outside of the binding pocket, which would not have been speculated by any other
rational design method, we have established this technique as a powerful new approach in the
quest for designing and improving protein catalysts. Furthermore, we postulate that myoglobin, a
small, heme containing protein can serve as a suitable template for NMR-guided evolution of a
multitude of novel enzyme catalysts with different substrate specificities and enzymatic
activities.
40
Through incorporation of the non-natural amino acid into the small antimicrobial
peptides, we have demonstrated successful introduction of the proteolytic resistance without
altering the peptide conformation and have shown the unique ability of AzAla to serve as an
environment-insensitive and non-disruptive biochemical probe that can be selectively excited at
342 nm. The minimal changes in quantum yield and emission maxima in different
microenvironments allowed for observations of subtle effects caused by weak quenchers
inherently present in proteins.
The next step for the antimicrobial peptides, would be a comparison between AG14 and
AG14WZ fluorescent spectra in different environments to further investigate the structures of the
different peptides at various conditions and compare what information they can provide
regarding the microenvironment they are located in. Furthermore, the effectiveness of the
antimicrobial properties of AG14WZ peptides should be compared with the original AG14
peptides to further look into how well they retain their antimicrobial function. If the peptides
pass cytotoxicity testing in mammalian cells, further proteolytic analysis should be performed to
see how they would hold up against other endogenous human enzymes in addition to studies
regarding methods of oral or blood administration and the lifetime and tracking of the peptides
inside the human body.
Finally, it would be interesting to investigate the incorporation of AzAla into larger
proteins such as enzymes as an approach to introduce proteolytic resistance and spectral
enhancement without altering their structure and their natural properties. Combining our
understanding of protein enzyme structure and function in combination with new molecular
biology techniques will lead to an explosion of discoveries and the development of many
designer proteins for a multitude of applications.
41
References
1. Martínez Cuesta, S., Rahman, S. A., Furnham, N. & Thornton, J. M. The Classification
and Evolution of Enzyme Function. Biophysical Journal (2015)
doi:10.1016/j.bpj.2015.04.020.
2. Arnold, F. H. The nature of chemical innovation: New enzymes by evolution. Quarterly
Reviews of Biophysics (2015) doi:10.1017/S003358351500013X.
3. Herger, M. et al. Synthesis of β-Branched Tryptophan Analogues Using an Engineered
Subunit of Tryptophan Synthase. J. Am. Chem. Soc. (2016) doi:10.1021/jacs.6b04836.
4. Boville, C. E., Romney, D. K., Almhjell, P. J., Sieben, M. & Arnold, F. H. Improved
Synthesis of 4-Cyanotryptophan and Other Tryptophan Analogues in Aqueous Solvent
Using Variants of TrpB from Thermotoga maritima. J. Org. Chem. (2018)
doi:10.1021/acs.joc.8b00517.
5. Buller, A. R. et al. Directed Evolution Mimics Allosteric Activation by Stepwise Tuning
of the Conformational Ensemble. J. Am. Chem. Soc. (2018) doi:10.1021/jacs.8b03490.
6. Yuan, L., Kurek, I., English, J. & Keenan, R. Laboratory-Directed Protein Evolution.
Microbiol. Mol. Biol. Rev. (2005) doi:10.1128/mmbr.69.3.373-392.2005.
7. Arnold, F. H. Design by Directed Evolution. Acc. Chem. Res. (1998)
doi:10.1021/ar960017f.
8. Arnold, F. H. & Georgiou, G. Directed enzyme evolution : Screening and selection
methods . Methods (2003) doi:10.1110/ps.04754204.diverse.
9. Dwyer, M. A., Looger, L. L. & Hellinga, H. W. Computational design of a biologically
active enzyme. Science (80-. ). (2004) doi:10.1126/science.1098432.
10. Bodenhausen, G. & Ruben, D. J. Natural abundance nitrogen-15 NMR by enhanced
42
heteronuclear spectroscopy. Chem. Phys. Lett. (1980) doi:10.1016/0009-2614(80)80041-8.
11. Kościuczuk, E. M. et al. Cathelicidins: family of antimicrobial peptides. A review.
Molecular biology reports (2012) doi:10.1007/s11033-012-1997-x.
12. Weinstock, M. T., Francis, J. N., Redman, J. S. & Kay, M. S. Protease-resistant peptide
design-empowering nature’s fragile warriors against HIV. Biopolymers (2012)
doi:10.1002/bip.22073.
13. Sinkeldam, R. W., Greco, N. J. & Tor, Y. Fluorescent analogs of biomolecular building
blocks: Design, properties, and applications. Chem. Rev. (2010) doi:10.1021/cr900301e.
14. Callis, P. R. & Vivian, J. T. Understanding the variable fluorescence quantum yield of
tryptophan in proteins using QM-MM simulations. Quenching by charge transfer to the
peptide backbone. Chem. Phys. Lett. (2003) doi:10.1016/S0009-2614(02)02046-8.
15. Moroz, Y. S., Binder, W., Nygren, P., Caputo, G. A. & Korendovych, I. V. Painting
proteins blue: β-(1-azulenyl)-l-alanine as a probe for studying protein-protein interactions.
Chem. Commun. (2013) doi:10.1039/c2cc37550h.
16. Brahma, B. et al. Diversity, antimicrobial action and structure- activity relationship of
buffalo cathelicidins. PLoS One (2015) doi:10.1371/journal.pone.0144741.
17. Tipton, K. & Boyce, S. History of the enzyme nomenclature system. Bioinformatics
(2000) doi:10.1093/bioinformatics/16.1.34.
18. Copley, S. D. Toward a systems biology perspective on enzyme evolution. Journal of
Biological Chemistry (2012) doi:10.1074/jbc.R111.254714.
19. Anthony-Cahill, S. & Magliery, T. Expanding the Natural Repertoire of Protein Structure
and Function. Curr. Pharm. Biotechnol. (2005) doi:10.2174/1389201023378102.
20. Appel, D., Lutz-Wahl, S., Fischer, P., Schwaneberg, U. & Schmid, R. D. A P450 BM-3
43
mutant hydroxylates alkanes, cycloalkanes, arenes and heteroarenes. J. Biotechnol. (2001)
doi:10.1016/S0168-1656(01)00249-8.
21. Li, Q. S., Schwaneberg, U., Fischer, P. & Schmid, R. D. Directed evolution of the fatty-
acid hydroxylase P450 BM-3 into an indole-hydroxylating catalyst. Chem. - A Eur. J.
(2000) doi:10.1002/(SICI)1521-3765(20000502)6:9<1531::AID-CHEM1531>3.3.CO;2-4.
22. Makam, P. et al. Non-proteinaceous hydrolase comprised of a phenylalanine metallo-
supramolecular amyloid-like structure. Nat. Catal. (2019) doi:10.1038/s41929-019-0348-
x.
23. Makhlynets, O. V. & Korendovych, I. V. A single amino acid enzyme. Nat. Catal. (2019)
doi:10.1038/s41929-019-0379-3.
24. Yoon, J. H. et al. Uno Ferro, a de novo Designed Protein, Binds Transition Metals with
High Affinity and Stabilizes Semiquinone Radical Anion. Chem. - A Eur. J. (2019)
doi:10.1002/chem.201904020.
25. Ren, J. et al. Structural insights into mechanisms of non-nucleoside drug resistance for
HIV-1 reverse transcriptases mutated at codons 101 or 138. FEBS J. (2006)
doi:10.1111/j.1742-4658.2006.05392.x.
26. Calhoun, J. R. et al. Computational Design and Characterization of a Monomeric Helical
Dinuclear Metalloprotein. J. Mol. Biol. (2003) doi:10.1016/j.jmb.2003.10.004.
27. Calhoun, J. R. et al. Solution NMR Structure of a Designed Metalloprotein and
Complementary Molecular Dynamics Refinement. Structure (2008)
doi:10.1016/j.str.2007.11.011.
28. Poulos, T. L. Heme enzyme structure and function. Chemical Reviews (2014)
doi:10.1021/cr400415k.
44
29. Kemp, D. S., Cox, D. D. & Paul, K. G. The Physical Organic Chemistry of
Benzisoxazoles. IV. The Origins and Catalytic Nature of the Solvent Rate Acceleration for
the Decarboxylation of 3-Carboxybenzisoxazoles. J. Am. Chem. Soc. (1975)
doi:10.1021/ja00858a018.
30. Wang, Z. Kemp Elimination. in Comprehensive Organic Name Reactions and Reagents
(2010). doi:10.1002/9780470638859.conrr357.
31. Korendovych, I. V. et al. Design of a switchable eliminase. Proc. Natl. Acad. Sci. U. S. A.
(2011) doi:10.1073/pnas.1018191108.
32. Sohl, C. D., Cheng, Q. & Guengerich, F. P. Chromatographic assays of drug oxidation by
human cytochrome P450 3A4. Nat. Protoc. (2009) doi:10.1038/nprot.2009.122.
33. Guengerich, F. P., Martin, M. V., Sohl, C. D. & Cheng, Q. Measurement of cytochrome
P450 and NADPH-cytochrome P450 reductase. Nat. Protoc. (2009)
doi:10.1038/nprot.2009.121.
34. Li, A. et al. A redox-mediated Kemp eliminase. Nat. Commun. (2017)
doi:10.1038/ncomms14876.
35. Strömstedt, A. A., Pasupuleti, M., Schmidtchen, A. & Malmsten, M. Evaluation of
strategies for improving proteolytic resistance of antimicrobial peptides by using variants
of EFK17, an internal segment of LL-37. Antimicrob. Agents Chemother. (2009)
doi:10.1128/AAC.00477-08.
36. Gosavi, P. M., Moroz, Y. S. & Korendovych, I. V. β-(1-Azulenyl)-l-alanine-a functional
probe for determination of pKa of histidine residues. Chem. Commun. (2015)
doi:10.1039/c4cc08720h.
37. Tofani, L. et al. Spectroscopic and interfacial properties of myoglobin/surfactant
45
complexes. Biophys. J. (2004) doi:10.1529/biophysj.104.041731.
38. Rohlfs, R. J. et al. The effects of amino acid substitution at position E7 (residue 64) on the
kinetics of ligand binding to sperm whale myoglobin. J. Biol. Chem. (1990).
39. Dimijian, G. G. Darwinian Natural Selection: Its Enduring Explanatory Power. Baylor
Univ. Med. Cent. Proc. (2012) doi:10.1080/08998280.2012.11928811.
40. Purugganan, M. D. & Fuller, D. Q. The nature of selection during plant domestication.
Nature (2009) doi:10.1038/nature07895.
41. Gjedrem, T. Improvement of productivity through breeding schemes. GeoJournal (1985)
doi:10.1007/BF00462124.
42. Balmain, A., Gray, J. & Ponder, B. The genetics and genomics of cancer. Nature Genetics
(2003) doi:10.1038/ng1107.
43. Garland, T. & Rose, M. R. Experimental evolution: Concepts, methods, and applications
of selection experiments. Experimental Evolution: Concepts, Methods, and Applications of
Selection Experiments (2009). doi:10.5860/choice.47-6844.
44. Packer, M. S. & Liu, D. R. Methods for the directed evolution of proteins. Nature Reviews
Genetics (2015) doi:10.1038/nrg3927.