protein folding - simbacthe folding mechanism: a polypeptide chain has an almost unfathomable number...
TRANSCRIPT
![Page 1: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/1.jpg)
Protein folding
![Page 2: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/2.jpg)
Polypeptide
‘Cartoon’ representation
beta-sheet alpha-helix‘Surface’ representation
Amino acid (Phenylalanine)
Backbone
Sidechain
Basic protein structure
![Page 3: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/3.jpg)
What happens if proteins misfold?Mad Cow disease
– In a bovine epidemic that struck the UK starting in 1986, cows appeared to be mad: they drooled and staggered, were extremely nervous, or bizarrely aggressive. They all died. As the brains of the dead “mad” cows resembled a sponge, the disease was called bovine spongiform encephalopathy, or BSE.
Prions are proteins that are found in the nerve cells of all mammals. Many abnormally-shaped prions are found in the brains of BSE-infected cows.
http://www.uvm.edu/~wschaeff/101PrionsMadCow.html
Called (variant) Creutzfeldt–Jakob disease in humans
![Page 4: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/4.jpg)
What happens if proteins misfold?• Alzheimer's disease (AD)
– Worsens as it progresses, eventually leading to death
– The Aβ peptides are believed to be involved in AD: they can oligomerize and be released into the interstitial fluid of brain, where soluble oligomers may diffuse into synaptic clefts and interfere with synaptic function by unknown mechanisms.
– multiple drugs failing in Phase III trials (even as of two weeks ago!).
http://www.nature.com/ncb/journal/v6/n11/full/ncb1104-1054.html
![Page 5: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/5.jpg)
What happens if proteins misfold?• Alzheimer's disease (AD)
– Worsens as it progresses, eventually leading to death
– The Aβ peptides are believed to be involved in AD: they can oligomerize and be released into the interstitial fluid of brain, where soluble oligomers may diffuse into synaptic clefts and interfere with synaptic function by unknown mechanisms.
– multiple drugs failing in Phase III trials (even as of two weeks ago!).
http://www.nature.com/ncb/journal/v6/n11/full/ncb1104-1054.html
https://www.the-scientist.com/news-opinion/biogen--eisai-end-two-late-stage-trials-for-alzheimers-treatment-66431
![Page 6: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/6.jpg)
Protein Folding: Three Questions
“The protein-folding problem came to be three main questions:
• The Protein-Folding Problem, 50 Years On. Ken A. Dill and Justin L. MacCallum Science 338, 1042 (2012);
3. Predicting protein structures using computers: Can we devise a computer algorithm to predict a protein’s native structure from its amino acid sequence?
2. The folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast?
1. The physical folding code: How is the 3D native structure of a protein determined by the physicochemical properties that are encoded in its 1D amino-acid sequence? (Anfisen’s dogma - 1972 Nobel Prize)
![Page 7: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/7.jpg)
Q1 The Physical Folding Code
Forces Governing Protein Folding: 1. Hydrogen bonds 2. van der Waals interactions 3. Backbone angle preferences 4. Electrostatic interactions. 5. Hydrophobic interactions. 6. Chain entropy.
![Page 8: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/8.jpg)
Q2 The Folding Mechanism
• Levinthal’s paradox –The conformation of a protein
is largely determined by its backbone angles: ϕ and ψ.
– If we only focus on ψ, and assume that it can adopt one of three possible values. How long does it take for a protein with 101 amino acids to fold? • Possible conformations: 3100 ≈ 5x1047.
![Page 9: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/9.jpg)
Dawkins’ weasel
Given enough time, a monkey bashing away at random on a typewriter could produce all the works of Shakespeare.
“Let us limit the task facing our monkey somewhat. Suppose that he has to produce, not the complete works of Shakespeare but just the short sentence 'Methinks it is like a weasel', and we shall make it relatively easy by giving him a typewriter with a restricted keyboard, one with just the 26 (capital) letters, and a space bar. How long will he take to write this one little sentence?”
Richard Dawkins
‘Methinks it is like a weasel’ Hamlet
2728 = 1040; 1 letter/second ➔ 1032 years!
Original used to explain evolution by random mutation
Dawkins’ resolved this by allowing for cumulative selection
![Page 10: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/10.jpg)
Solution
• Dawkins’ weasel –Restrictions for the monkey:
• not allowed to change those letters that are already correctly in place.
This solution only offers a mathematical explanation of why protein folding is possible; it doesn’t actually
solve the problem.
How does it know what the target is?
R. Zwanzig, et al., PNAS, 89:20-22, 1992.
• Levinthal’s paradox –Consider amino acid interactions: native-fold
interactions maintained as they form
![Page 11: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/11.jpg)
Protein Folding Funnel
• Protein folding landscapes are narrower at the bottom; there are few low-energy, native-like conformations and many more open unfolded structures.
• A protein folds by taking random steps that are mostly incrementally downhill in energy.
• Different molecules of the same protein sequence may each follow microscopically different routes to the same native structure.
• A protein appears to first develop local structures in the chain (such as helices and turns) followed by growth into more global structures. Even though the folding process is blind, nevertheless it can be fast because native states can be reached by this divide-and-conquer, local-to-global process.
Wolynes, Onuchic, Thirumalai. Navigating the folding routes. Science 267: 1619-1620 (1995).
![Page 12: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/12.jpg)
10-Ala helix (in vacuum) end-to-end distance (ξ) a common RC
14 Å
32 Å
C. Chipot and J. Hénin, Exploring the free energy landscape of a short peptide using an average force, J. Chem. Phys. 123, 244906 (2005)
done in vacuum
A 1D “funnel” - folding of deca-alanine helix
![Page 13: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/13.jpg)
Free energy surface
bottom of the “funnel”
end-to-end distance (Å)
α-he
lical
con
tent PM
F (kcal/mol)
Hazel, Chipot, and Gumbart. Thermodynamics of deca-alanine folding in water. J. Chem. Theory Comput. 10: 2836-2844 (2014)
calculated free energy shows minima in two different states
![Page 14: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/14.jpg)
Hazel, Chipot, and Gumbart. Thermodynamics of deca-alanine folding in water. J. Chem. Theory Comput. 10: 2836-2844 (2014)
![Page 15: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/15.jpg)
Lattice models for protein folding
HP model: assigns residues into Hydrophobic (H) or Polar (P) classes, Assumes hydrophobic collapse dominates folding free energy
Only permit folding on a lattice (unoccupied sites are solvent)
Assign an energy penalty for any H-P or H-S contacts
dark - H; light - P
HPHPHP
PHPPHP
PBoC 8.4.3
![Page 16: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/16.jpg)
Lattice models for protein folding
HP model: assigns residues into Hydrophobic (H) or Polar (P) classes, Assumes hydrophobic collapse dominates folding free energy
Only permit folding on a lattice (unoccupied sites are solvent)
Assign an energy penalty for any H-P or H-S contacts
dark - H; light - P
HPHPHP
PHPPHP
not a good protein, all states have the same energy!
PBoC 8.4.3
![Page 17: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/17.jpg)
Lattice models for protein folding
HP model: assigns residues into Hydrophobic (H) or Polar (P) classes, Assumes hydrophobic collapse dominates folding free energy
Only permit folding on a lattice (unoccupied sites are solvent)
Assign an energy penalty for any H-P or H-S contacts
dark - H; light - P
HPHPHP
PHPPHP
not a good protein, all states have the same energy!
Here, a lowest energy minimum state exists
PBoC 8.4.3
![Page 18: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/18.jpg)
Lattice models for protein folding
pfold =e−2βϵ
e−2βϵ + 2e−4βϵ
PBoC 8.4.3
PHPPHP
![Page 19: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/19.jpg)
Gō (lattice) modelIn its simplest form, developed in 1975, assigns a favorable energy ε for native contacts and 0 for non-native contacts Requires knowledge of final structure, but permits one to examine folding kinetics
Ueda, Y., Taketomi H. & Go N. Studies on protein folding, unfolding, and fluctuations by computer simulation I. Int. J. Peptide. Res. 7, 445-459 (1975).
Lysozyme (structure) Lysozyme (lattice)
using Monte Carlo simulations, can enumerate different folding pathways examination of simulation results allows identification of common intermediate states still used in various forms today!!!
![Page 20: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/20.jpg)
Complications to the funnel
F Ulrich Hartl & Manajit Hayer-Hartl. Converging concepts of protein folding in vitro and in vivo. Nat. Struct. Mol. Bio. 16:574-581 (2009).
true free-energy landscape is much more complex than a single funnel
proteins may sample a number of intermediates without native-like structure on the folding pathway
Bowman, G. R.; Voelz, V. A.; Pande, V. S.Taming the complexity of protein folding Curr. Opin. Struct. Biol. 2011, 21, 4– 11
![Page 21: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/21.jpg)
Complications to the funnel
F Ulrich Hartl & Manajit Hayer-Hartl. Converging concepts of protein folding in vitro and in vivo. Nat. Struct. Mol. Bio. 16:574-581 (2009).
many disease states on the right-hand side - how to avoid them?
protein chaperones prevent aggregation
proteins already start folding during synthesis!
PL Clark. Protein folding in the cell: reshaping the folding funnel. Trends Biochem. Sci. 29:527-534 (2004).
![Page 22: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/22.jpg)
many in vivo factors alter the folding process, e.g.,
H-X Zhou http://pubs.acs.org/cen/coverstory/88/8848cover.html?featured=1
macromolecular crowding
chaperones such as Trigger Factor
Deeng, Chan, van der Sluis, Berninghausen, Han, Gumbart, Schulten, Beatrix, Beckmann. Dynamic behavior of Trigger Factor on the ribosome. (2016) JMB 428:3588-3602.
Complications to the funnel
![Page 23: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/23.jpg)
Q3 Protein Structure Prediction
• We know up to 1000x more sequences than structures. – There is considerable value in methods that could accurately
predict structures from sequences.
• Initiated in 1994: Critical Assessment of protein Structure Prediction (CASP). – Held every second summer, CASP is a community-wide blind
competition in which typically more than 100 different “target sequences” (of proteins whose structures are known but not yet publicly available) are made available to a community that numbers more than 100 research groups around the world.
![Page 24: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/24.jpg)
The Progress of CASP
• Currently, all successful structure-prediction algorithms are based on assuming that similar sequences lead to similar structures.
• PDB: 156,000 structures but mostly redundant (4000 structural families and 1400 folds - may be all that exist?).
![Page 25: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/25.jpg)
Remaining Challenges in CASP• When there is no protein in the PDB with a sequence
resembling the target’s, accurately predicting the structure of the target is much more difficult (free modeling, or ab initio, de novo prediction).
• Substantial improvements have been observed for free-modeling targets shorter than 100 amino acids, although no single group yet consistently produces accurate models.
• Winner in CASP12: Rosetta+MD from David Baker lab (2016)
http://robetta.bakerlab.org/structure prediction webserver
![Page 26: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/26.jpg)
Remaining Challenges in CASP• Winner in CASP13:
AlphaFold (2018) • AlphaFold is product
of DeepMind, an AI company in the UK acquired by Google in 2014 (also made AlphaGo)
https://www.sciencemag.org/news/2018/12/google-s-deepmind-aces-protein-folding
average margin of 15% accuracy improvement over other groups on the toughest 43 tests
For protein sequences for which no other information was known—43 of the 90—AlphaFold made the most accurate prediction 25 times. That far outpaced the second place finisher, which won three of the 43 tests.
![Page 27: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/27.jpg)
AlphaFold algorithm
• deep neural networks predict: (a) the distances between pairs of amino acids and (b) the [ɸ/ψ] angles between chemical bonds that connect those amino acids
• trained a generative neural network to invent new fragments, which were used to continually improve the score of the proposed protein structure
• optimized scores through gradient descent (a type of minimization)
https://deepmind.com/blog/article/alphafold
![Page 28: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/28.jpg)
MD Simulation of Protein Folding
• Challenges –Timescale
• Specialized supercomputers –Force field
• Improvement made to classical MD FF:
–AMBER; CHARMM; GROMOS, etc.
• New FF with polarizability is increasingly used.
IBM ‘Blue Gene’
D.E. Shaw Research ‘Anton’
![Page 29: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/29.jpg)
MD Simulations of Protein Folding
Using Anton, Shaw and co-workers observed reversible folding and unfolding in more than 400 events across 12 small proteins to structures within 4.5 Å of the experimental structure.
Lindorff-Larsen, et al., Science, 334:517, 2011.
![Page 30: Protein folding - SimBacThe folding mechanism: A polypeptide chain has an almost unfathomable number of possible conformations. How can proteins fold so fast? 1. The physical folding](https://reader034.vdocument.in/reader034/viewer/2022042418/5f34d68fbdb0325e1023590e/html5/thumbnails/30.jpg)
http://www.nature.com/nature/journal/vaop/ncurrent/full/nature12443.html
Beyond prediction: designer proteins
Tinberg, C. E. et al. Nature http://dx.doi.org/10.1038/nature12443 (2013).
PRINCIPLES FOR DESIGNING IDEAL PROTEIN
STRUCTURES Koga, N., Tasumi-Koga R., et al., Nature. 491(7423), 222-227.
(2012)
Simple rules create idealized structures, verified experimentally
Beyond structure, engineer protein for specific functions
http://depts.washington.edu/bakerpg/drupal/David Baker Lab