lecture 4' - intro to protein struct i spr08

Upload: curlicue

Post on 30-May-2018

212 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    1/34

    Lecture 4 Introduction toProtein Structure (1)

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    2/34

    Introduction

    Proteins are the functional forms ofpolypeptides. represent all levels of the hierarchy of

    macromolecular structure (1o

    to 4o)

    protein structure defined by: chemical properties of the polypeptide chain.

    the environment.

    Several distinct classes of proteins:

    1. globular proteins (water-soluble).2. fibrous proteins (water-insoluble).

    3. proteins that associate with membranes.

    These differ by tendencies in amino acid sequenceand composition, but can all be described using the same basic principles.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    3/34

    ro e ns u rom m noAcids

    There are 20 common amino acids. all are -amino acids:

    amino and carboxylic acid groupsseparated by a single, C carbon.

    all are L-amino acids (except Glycine). predominantly zwitterions, at pH 7. distinguished by chemical nature ofR,

    the side chain.

    Proteins also have other

    components: D-amino acids.

    e.g., bacterial antibiotics, such asgramicidin.

    Covalent modifications, following

    synthesis. Disulfide bonds common in eukaryotic

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    4/34

    Adoption of the L-formStructurally Significant

    Consider a natural protein, such asRubredoxin: protein in sulfur-metabolizing bacteria.

    Fe-S complex shown in green andyellow.

    constructed of L-amino acids.

    Will Rubredoxin made of D-formamino acids have an inverted

    structure? in 1993, L and D-forms of Rubredoxin

    were synthesized. structures: X-ray crystallography.

    they are exact mirror images.

    L- and D- HIV protease also

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    5/34

    Amino Acids

    Distinguished by chemical nature of the side-chain. size and shape.

    charge.

    hydrogen-bonding ability.

    ability to form disulfide-bridges, etc.

    Amino acids can be broadly classified into 5groups: Aliphatic: R = hydrocarbon side-chain.

    Nonpolar: R = other hydrophobic side-chain.

    Aromatic: R = aromatic ring.

    Polar: R = uncharged, polar group.

    Charged: R carries a charge in solution, at pH 7.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    6/34

    The Hydrophobic Effect

    Most proteins are amphipathic: they include both hydrophobic and hydrophilic

    residues

    Folding generally results in a partitioning of

    residues: b/wAq and non-Aq environments. hydrophobic residues

    will each desire to avoid water: tend to reside in a membrane or protein interior.

    hydrophilic residues will each desire to interact with water: tend to remain hydrated, reside on a protein exterior.

    This partitioning b/w Aq and non-Aqenvironments: leads the hydrophobic effect, which drives protein

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    7/34

    The Partition Coefficient

    The Partition Coefficient, P: measures the partitioning of a residue betweenAq and

    non-Aq environments. Consider a two-solvent system

    with separateAq and non-Aq environments;

    that are in contact, at equal conditions (e.g.,Temperature).

    P answers the question, how much of a given amino acid will reside in each

    environment?

    For a given amino acid, P is measured by:

    P = nonaq /aq, where

    i = mole fraction residing in environment i.

    conceptually simple, but there are some practical

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    8/34

    Hydrophobicity Scale

    Numerous scales of amino acid hydrophobicityhave been proposed. most based on the G

    oto transfer from water to

    octanol.

    which is best is controversial, but one popular scaleis

    The Hydrophobicity of Fouchere and Pliska(1983): hydrophobicity parameters, derived from:

    transfer from water to octanol. N-acetyl-amino acid amides.

    Hydrophobicity parameter for a given amino acid:

    = ln P = ln (nonaq / aq)

    Then: Hydrophobic: > 0; Hydrophilic: < 0

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    9/34

    Aliphatic Amino Acids

    These amino acids have alkyl side-chains: so that R is a hydrocarbon.

    All are hydrophobic (P > 1).

    hence, have hydrophobicity, = ln P

    > 0.Hydrophobicity increases with side-chainlength: = 0.31, 1.22, 1.70, 1.80 for Ala, Val, Leu, and Ile,

    respectively.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    10/34

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    11/34

    Aromatic Amino Acids

    All highly hydrophobic. Phe has a hydrophobicity of = 1.79. Tyr is less hydrophobic, at = 0.96,

    due to its reactive hydroxyl group.

    Trp is the most hydrophobic residue ( = 2.25).Rings bulky and tend to interact with otherrings. due to pi-pi interaction. in Aq. solution, rings

    perpindicular. entropically favored.

    This is in contrast with

    the stacked rings in DNA minimizes solvent

    exposure.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    12/34

    Polar Amino Acids

    Each contains groups with partial charges, and therefore, tend to form hydrogen bonds...

    Thus, much less averse to water: Asn, Gln, Ser all have negative hydrophobicity values.

    = -0.60, -0.22, and 0.04, respectively.

    Thr is slightly hydrophobic, at = 0.26. in some scales, Thr is hydrophilic (e.g., in the

    hydropathy scale).

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    13/34

    Charged Amino Acids

    Amino acids that carry a charge very hydrophilic: Lys and Arg (+) charged at pH 7

    very hydrophilic ( = -0.99 and = 1.01, respectively).

    His can also be (+) charged at pH 7 (environment-

    sensitive). intermediate hydrophobicity ( = 0.13).

    Aspartic acid and Glutamic acid (-) charged at pH 7 very hydrophilic ( = -0.77 and 0.64, respectively).

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    14/34

    Overall Charge of a Protein

    In passing, we note that the overall proteincharge: depends on the number of acidic and basic

    residues...

    at the experimental pH of interest.

    The Isoelectric Point (pI) = the pH at which the total charge of a protein is

    zero.

    Protein charge density can be estimated from pI:

    c = (pI-pH)/MW

    here, MW is the molecular weight of the protein. For pH > pI, overall charge negative (deprotonation).

    For pH < pI, overall charge positive.

    + +

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    15/34

    Protein Structure

    Proteins may have up to 4 levels of structure: Primary structure:

    the N to C amino acid sequence of the polypeptide.

    Secondary structure: helices resulting from local folding.

    Tertiary structure: global folding of secondary structures into a larger

    structure.

    Quaternary structure: association of several, independent polypeptides.

    We will look at each, in detail 1o and 2oStructure (this Lecture)

    3o

    and 4o

    structure (next Lecture)

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    16/34

    Protein Primary Structure

    A polypeptide - covalently linked chain of aminoacids. each linked amino acid called a residue. each pair of residues connected by a peptide bond:

    The N-terminal to C-terminal sequence ofresidues: Is the primary structure (1

    o) of the encoded protein.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    17/34

    Anfinsens Principle

    Anfinsens Principle is basic to biochemistry: The information needed to fold a macromolecule into

    its native, 3-D structure is contained in its sequence. Denatured ribonuclease spontaneously refolds into the

    enzymatically active form, in vitro (Anfinsen, 1963).

    1o

    structure then specifies the higher structure: Each sequence corresponds to a well-defined 3-D

    structure. or to a family of closely related structures with activity.

    the native state.

    On the other hand a unique structure does notrequire a unique sequence. level of sequence homology required for similar

    structure is only about 25%-30%.

    non-homologous sequences can also have similar

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    18/34

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    19/34

    Traditional Names for Helices

    Determined by the nature of the repeating H-bond: N = number of residues in 1 helix turn.

    d = number of atoms in the ring formed by each H-

    bond donor (amino-H) and acceptor (keto-O).

    each helix assigned the name, Nd.

    e.g., in a 310 helix,

    Each turn contains N=3 residues;

    each H-bonded donor/acceptor pair form a ring of d =10 atoms.

    This notation has several shortcomings: a -sheet cannot be described in these terms:

    each strand is a 2-fold helix, butH-bonds between

    strands.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    20/34

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    21/34

    The Standard Terminology

    For displacements down the helical axis: total rise/turn = pitch, P.

    rise/step = helical rise, h.

    steps/turn = helical repeat, c (= n).

    P = c h

    For angular displacements about the axis: angle/step = = helical angle, or twist.

    = 2/c = 2 h/P

    For helical symmetry about the z-axis, Positions of adjacent steps then related by:

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    22/34

    Helix Handedness

    The Symmetry Matrix does not uniquely specify ahelix: a helix can be left or right-handed.

    in principle, handedness given by : right-handed: > 0.

    left-handed: < 0.

    but we have the convention: >= 0. all rotations defined as right-handed.

    Our notation is degenerateWe distinguish right and left helices by: the axial displacement: P or h.

    For right-handed helices, P and h > 0.

    For left handed helices, P and h < 0.

    The helix shown here is right-handed.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    23/34

    Naming Helices by Symmetry

    Helical symmetry is denoted by NT:

    N denotes its N-fold rotation operator. T = translation generated by the symmetry

    operator, in repeats (monomers).

    compare with Nd notation.

    Example: Take the helix at right... Each residue rotated by +120

    o.

    3-fold helical symmetry (N = 3).

    1 application of the symmetry operator: translates a residue by +1 repeat, or P/3.

    thus, T = 1.

    This helix has 31

    symmetry.

    note that it is a right-handed helix.

    ri ht-handed helices with inte er N are N helices.

    e e x as

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    24/34

    e 10 e x as 1Symmetry

    Example: The 310 Helix

    one of the two common helices inproteins.

    Commonly named by Nd notation:

    N=3 residues/turn.

    d=10 atoms in each H-bond closedring.

    note: green line includes a sharedH.

    ith keto-O H-bonded with (i+3)thNitrogen.

    The 310 helix has 31 helical

    symmetry.

    Each residue rotated by +120o.

    T e e x as 18

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    25/34

    T e - e x as 185

    Symmetry

    The most common helix in globularproteins. i

    thketo-O H-bonded with the (i+4)

    thNitrogen.

    13 atoms b/w H-bond donor and acceptor.

    The -helix has 3.61 helical symmetry.

    Each residue rotated by +100o.

    3.6 residues/turn (N = 3.6).

    1 application of the symmetry operator:

    translates a residue by +1 repeat, or P/3.6. thus, a right-handed helix, with = 1.

    For helices with non-integral symmetry: N and T converted to integers

    -helix said to have 185 symmetry. 18 residues in 5 full turns.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    26/34

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    27/34

    The trans-conformation is a 21

    helix

    Consider the fully extendedpolypeptide: keto-O and amino-H of each Peptide bond in

    the trans configuration, and

    For each residue, both and = 180o.

    remember our convention: Polymer chemistry: cis = 0

    o.

    Biopolymer in a fully trans conformation

    Somewhat expanded use of the term, trans. since cis/trans defined for confi urations.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    28/34

    The -strand is also a 21 Helix

    Differs from the trans-conformation: a -strand is pleated, like a curtain.

    shorter (smaller rise, h).

    In order to be stable, -strandscombine to form sheets: by strand-to-strand hydrogen

    bonding.

    two general types: anti-parallel sheets (A).

    parallel sheets (P).

    Generally, these sheets are

    twisted.

    Standard Helices of

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    29/34

    Standard Helices ofBiopolymers

    these include all of the standard, 2ostructures of

    biopolymers.

    Thus, our original contention is correct: the only symmetric building block formed by a chain

    of chiral units is a helix.

    Question: can the backbone adopt thesestructures?

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    30/34

    The Peptide Bond

    The peptide bond links each pair of adjacent residues; is an amide linkage,

    with a partial double-bond character.

    this bond not freely rotating: 6 atoms constrained to a plane:

    Ci, Ci, Oi, Ni+1,Hi+1,Ci+1.

    The peptide bond can adopt one of 2configurations: based on the positions of Ci and Ci+1. The trans-configuration (i= 180

    o):

    energetically favoredusually adopted.

    The cis-configuration: (i= 0o):

    sterically hinderedenergetically unfavorable. exception: Proline (cis and trans nearly

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    31/34

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    32/34

    The Ramachandran Plot

    The Ni-Ci bond, and Ci-Ci bond are single bonds: in principle, each may rotate freely

    and could then assume any values b/w +/- 180o.

    however, this is true only for Glycine (R = H).

    For other R-groups, non-covalent interactions b/wadjacent side-chains: place energetic constraints on and .

    thus, some conformations (,) are sterically disallowed.

    Sasisekharan and Ramachandran (1968): first plotted the van der Waals energies of interaction vs.

    (,). using poly-L-Alanine

    R = Me, the minimally constrained group (with a C-Carbon).

    with the trans-configuration for each peptide bond.

    the resulting plot is a Ramachandran Plot.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    33/34

    The Ramachandran Plot(cont.)

    Plot shown in terms of allowed regions here, c = helical repeat, with +/- = right/left-handed.

    dark regions = sterically allowed. lightly shaded regions = moderately disfavored.

    others = disallowed.Torsion angles of helices:

    all in allowed regions. Note: L- left handed -helix.

    Glycine only (achiral).

    Not a good model for: Glycine or Proline. residues w/ bulky side-chains. more on this in Lecture 8.

    esp. sequence effects.

  • 8/14/2019 Lecture 4' - Intro to Protein Struct I Spr08

    34/34

    Conclusion

    In this Lecture, we have discussed: Amino Acid Residues

    Characteristics and Hydrophobicities

    Protein 2o

    structure:

    The local, helical structures adopted by polypeptides.

    We will continue our discussion in Lecture 4: The intermediate level structure of Proteins:

    Super-2o

    and Domainal structure of Proteins. Methods of Visualization of the overall 3-D structures

    of Proteins. Protein 3

    ostructure:

    contact plots.

    Protein 4o

    structure.