protein structure bioch301.1-2

Upload: john-anderson

Post on 02-Jun-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/11/2019 Protein Structure Bioch301.1-2

    1/65

    Biochemistry 301

    Principles of Protein Structure

    Walter Chazin

    5140 BIOSCI/MRBIII

    E-mail: Walter.Chazin

    http://structbio.vanderbilt.edu/chazin

    Jan. 8-10, 2003

  • 8/11/2019 Protein Structure Bioch301.1-2

    2/65

    Text Books

    Branden and Tooze

    Introduction to Protein Structure

    Voet, Voet and Pratt

    Fundamentals of Biochemistry

    Stryer

    Biochemistry

  • 8/11/2019 Protein Structure Bioch301.1-2

    3/65

    Proteins: Polymers of Amino Acids

    20 different amino acids: many combinations

    Proteins are made in the RIBOSOME

  • 8/11/2019 Protein Structure Bioch301.1-2

    4/65

    Amino Acid Chemistry

    NH2 Ca

    R1

    CO

    H

    NH Ca

    R2

    COOH

    H

    NH2 Ca

    R

    COOH

    H

    amino acid

    20 different types

    Amino acid Polypeptide Protein

    NH2 Ca

    R1

    COOH

    H

    NH2 Ca

    R2

    COOH

    H

  • 8/11/2019 Protein Structure Bioch301.1-2

    5/65

    Amino Acid Chemistry

    NH2 Ca

    R

    COOH

    H

    amino acid

    The free amino and carboxylic acid groups have pKas

    COOH COO-

    pKa ~ 2.2

    NH2NH3+

    pKa ~ 9.4

    At physiological pH, amino acids are zwitterions

    +NH3 Ca

    R

    COO-

    H

  • 8/11/2019 Protein Structure Bioch301.1-2

    6/65

    Amino Acid Chemistry

    Note the axesAlso titratable

    groups in side chain

  • 8/11/2019 Protein Structure Bioch301.1-2

    7/65

    Glycine Gly - G 2.4 9.8

    Alanine Ala - A 2.4 9.9

    Valine Val - V 2.2 9.7

    Leucine Leu - L 2.3 9.7

    Isoleucine Ile - I 2.3 9.8

    Amino Acids with Aliphatic R-Groups

    pKas

  • 8/11/2019 Protein Structure Bioch301.1-2

    8/65

    Amino Acids with Polar R-GroupsNon-Aromatic Amino Acids with Hydroxyl R-Groups

    Serine Ser - S 2.2 9.2 ~13

    Threonine Thr - T 2.1 9.1 ~13

    Amino Acids with Sulfur-Containing R-Groups

    Cysteine Cys - C 1.9 10.8 8.3

    Methionine Met-M 2.1 9.3

  • 8/11/2019 Protein Structure Bioch301.1-2

    9/65

    Aspartic Acid Asp - D 2.0 9.9 3.9

    Asparagine Asn - N 2.1 8.8

    Glutamic Acid Glu - E 2.1 9.5 4.1

    Glutamine Gln - Q 2.2 9.1

    Acidic Amino Acids and Amide Conjugates

  • 8/11/2019 Protein Structure Bioch301.1-2

    10/65

    Basic Amino Acids

    Arginine Arg - R 1.8 9.0 12.5

    Lysine Lys - K 2.2 9.2 10.8

    Histidine His - H 1.8 9.2 6.0

  • 8/11/2019 Protein Structure Bioch301.1-2

    11/65

    Aromatic Amino Acids and Proline

    Phenylalanine Phe - F 2.2 9.2

    Tyrosine Tyr - Y 2.2 9.1 10.1

    Tryptophan Trp-W 2.4 9.4

    Proline Pro - P 2.0 10.6

  • 8/11/2019 Protein Structure Bioch301.1-2

    12/65

    Hierarchy of Protein Structure

    20 different amino acids: many combinations

    The order of amino acids: Protein sequence

    Primary Structure

    Local conformation, depends on sequenceSecondary Structure

    Overall structure of the chain(s) in full 3D

    Tertiary/Quaternary Structure

  • 8/11/2019 Protein Structure Bioch301.1-2

    13/65

    Beyond Primary Structure:

    The Peptide Bond

    -C - N-

    O=

    -H

    -C = N-

    O--

    -H

    Resonance structures

    Peptide plane is flatwangle ~180

    Partial double-bond:

    Peptide bond

  • 8/11/2019 Protein Structure Bioch301.1-2

    14/65

    Implications of Peptide Planes

    wangle varies little, f andangles vary alot

    Many f/combinations cause atoms to collide

    Each residue is sandwiched between two planes

    Ca

    HRf

    Peptide planes

    Ca

    H R

    Ca

  • 8/11/2019 Protein Structure Bioch301.1-2

    15/65

    Polypeptide Backbone

    Backbone restricted limited conformations

    Collisions with side chain groups further limit f/combinations

    Ca

    HRf

    Ca

    H R

    Ca

    H R

  • 8/11/2019 Protein Structure Bioch301.1-2

    16/65

    Secondary StructureLocal Conformation of Consecutive Residues

    Three low energy backbone f combinations

    1. Right-hand helix: a-helix (-40

    , -60

    )

    2. Extended: antiparallelb

    -sheet (140

    , -

    140

    )

    3. Left-hand helix (rare): a-helix (45, 45)

    Glycine: special it has no side chain!

    Hydrogen bondsbetween backbone atoms

    provides stability to secondary structures

    Amino acids have specific preferences

  • 8/11/2019 Protein Structure Bioch301.1-2

    17/65

    Secondary Structure-a

    Helix

    H-bond

  • 8/11/2019 Protein Structure Bioch301.1-2

    18/65

  • 8/11/2019 Protein Structure Bioch301.1-2

    19/65

    Secondary Structure-b

    Turn

    1

    43

    2

    Reverses direction of the chain

  • 8/11/2019 Protein Structure Bioch301.1-2

    20/65

    Ribbon and Topology DiagramsRepresentations of Secondary Structures

    Sheets (arrows), Helices (cylinders)

    B/T- Figure 2.17

  • 8/11/2019 Protein Structure Bioch301.1-2

    21/65

    Ribbon and Topology DiagramsOrganization of Secondary Structures

    helix

    B/T- Figure 2.11

  • 8/11/2019 Protein Structure Bioch301.1-2

    22/65

    Beyond Secondary Structure

    Supersecondary structure (motifs): small,discrete, commonly observed aggregates ofsecondary structures

    bsheet

    helix-loop-helix

    bab

    Domains: independent units of structure

    bbarrelfour-helix bundle

    *Domains and motifs sometimes interchanged*

  • 8/11/2019 Protein Structure Bioch301.1-2

    23/65

    Protein Motifs

    V/V/P- Figure 6.28

  • 8/11/2019 Protein Structure Bioch301.1-2

    24/65

    Hairpin Motif

    B/T- Figure 2.14

  • 8/11/2019 Protein Structure Bioch301.1-2

    25/65

    Helix-Loop-Helix (H-L-H) Motif

    B/T- Figure 2.12

  • 8/11/2019 Protein Structure Bioch301.1-2

    26/65

    EF-Hand H-L-H Motif

    B/T- Figure 2.13

  • 8/11/2019 Protein Structure Bioch301.1-2

    27/65

    Greek Key Motif

    B/T- Figure 2.15

  • 8/11/2019 Protein Structure Bioch301.1-2

    28/65

    Multi-Domain (Modular) Proteins

    EGF

    Protease

    Kringle

    Ca-binding

    Protein

    Domain

  • 8/11/2019 Protein Structure Bioch301.1-2

    29/65

    Tertiary Structure

    Definition: Overall 3D form of a molecule

    Organization of the secondary structures/

    motifs/domains

    Optimization of interactions between residues

    A specific 3D structure is formed

    All proteins have multiple secondary

    structures, almost always multiple motifs, and

    in some cases multiple domains

  • 8/11/2019 Protein Structure Bioch301.1-2

    30/65

    Tertiary Structure

    Specific structures result from long-range interactions

    Electrostatic (charged) interactions

    Hydrogen bonds (OH, NH, S H)

    Hydrophobic interactions

    Soluble proteins have an inside (core) and outside

    Folding driven by water- hydrophilic/phobic

    Side chain properties specify core/exteriorSome interactions inside, others outside

  • 8/11/2019 Protein Structure Bioch301.1-2

    31/65

    Tertiary Structure

    I. Ionic Interactions (exterior)

    Forms between 2 charged side chains:

    1 NegativeGlu,Asp 1 PositiveLys,Arg,His

    Also called salt bridges.

    Ionic interactions are pH-dependent (pKa).

    Occurs at the exterior

    NOTE: pKs for in the interior of a protein may be

    very different from free amino acid.

  • 8/11/2019 Protein Structure Bioch301.1-2

    32/65

    Tertiary Structure

    II. Hydrogen bonds (interior and exterior)

    Forms between side chains/backbone/water:

    Charged side chains: Glu,Asp,His,Lys,Arg

    Polar chains: Ser,Thr,Cys,Asn,Gln,[Tyr,Trp]

    Not a specific covalent bondlower energy.

    Occurs inside, at the exterior, and with water.

  • 8/11/2019 Protein Structure Bioch301.1-2

    33/65

    Tertiary Structure

    III. Hydrophobic Interactions (interior)

    Forms between side chains of non-polar residues:

    Aliphatic (Ala,Val,Leu,Ile,Pro,Met)

    Aromatic (Phe,Trp,[Tyr])

    Clusters of side chains- but no requirement fora specific orientation like an H-bond

    In the protein interior, away from waterNot pH dependent

  • 8/11/2019 Protein Structure Bioch301.1-2

    34/65

    Tertiary Structure

    IV. Disulfide Bonds (interior and exterior)

    Forms between Cys residues:

    Cys-SH + HS-Cys Cys-S-S-Cys

    Catalyzed by specific enzymes, oxidizing agents

    Restricts flexibility of the protein

    Usually within a protein, less for linking proteins

  • 8/11/2019 Protein Structure Bioch301.1-2

    35/65

    Disulfide Bonding

    V/V/P- Figure 16.6

  • 8/11/2019 Protein Structure Bioch301.1-2

    36/65

    Quaternary Structure

    Definition: Organization of multiple chain associations

    Oligomerization- Homo (self), Hetero (different)

    Used in organizing single proteins and protein

    machines

    Specific structures result from long-range interactions

    Electrostatic (charged) interactions

    Hydrogen bonds (OH, NH, S H)Hydrophobic interactions

    Disulfides only VERY infrequently

  • 8/11/2019 Protein Structure Bioch301.1-2

    37/65

    Quaternary Structure

    The classic example- hemoglobin a2-b2

    B/T- Figure 3.7 END OF PART 1

  • 8/11/2019 Protein Structure Bioch301.1-2

    38/65

    Protein Structure from Sequence

    The pattern of amino acid side chains determinesthe local conformation and the global structure

    *Pattern is more impo rtant than exact sequence*

    A T V R L L E W E D L

    Reporting/Comparing Protein Sequences

    A T V R L L E Y K D L

    5 10

    h-CaM

    b-CaM

    conservative non-conservative

  • 8/11/2019 Protein Structure Bioch301.1-2

    39/65

  • 8/11/2019 Protein Structure Bioch301.1-2

    40/65

    How Does a Protein Find Its Fold?

    A protein of n residues: 20npossible sequences!

    100 residue protein has 10020possibilities 1.3 X 10130!

    The latest estimates indicate < 40,000

    sequences in the human genomeTHERE MUST BE RULES!

    20 different amino acids: many combinations

    N C

    1 2 3 4

    Amino terminus Carboxyl terminus

    Residue number

  • 8/11/2019 Protein Structure Bioch301.1-2

    41/65

    Limitations on Protein Sequence

    Minimum length based on ability to perform a

    biochemical function: ~40 residues (e.g. inhibitors)

    Maximum length based on complexity of assembly:

    Conversion of DNA code and production of proteins

    is carried out by molecular machines that are not

    perfect. If the sequence gets too long, too manyerrors will build up.

    *Leng th is general ly 100-1000 resid ues*

  • 8/11/2019 Protein Structure Bioch301.1-2

    42/65

    Protein Folding

    The hydrophobic effect is the major driving forceHydrophobic side chains cluster/exclude water

    Release of water cages in unfolded state

    Other forces providing stability to the folded stateHydrogen bonds

    Electrostatic interactions

    Chemical cross links- Disulfides, metal ions

  • 8/11/2019 Protein Structure Bioch301.1-2

    43/65

    Protein Folding

    Random folding has too many possibilities

    Backbone restricted but side chains not

    A 100 residue protein would require 1087s to

    search all conformations (age of universe < 1018s)

    Most proteins fold in less than 10 s!!

    Proteins must fold along specific pathways!!

  • 8/11/2019 Protein Structure Bioch301.1-2

    44/65

    Protein Folding Pathways

    Usual order of folding eventsSecondary structures formed quickly (local)

    Secondary structures aggregate to form motifs

    Hydrophobic collapse to form domains

    Coalescence of domains

    Molecular chaperones assist folding in-v ivo

    Complexity of large chains/multi-domains

    Cellular environment is rich in interactingmoleculesChaperones sequester proteins andallow time to fold

  • 8/11/2019 Protein Structure Bioch301.1-2

    45/65

    Progressive Folding of ProteinsFrom Disordered to Native State

    Protein Folding Funnel

    V/V/P- Figures 6.37/38

  • 8/11/2019 Protein Structure Bioch301.1-2

    46/65

    Functional Classes of Proteins

    Receptors- sense stimuli, e.g. in neurons

    Channels- control cell contents

    Transport- e.g. hemoglobin in blood Storage- e.g. ferritin in liver

    Enzyme- catalyze biochemical reactions

    Cell function- multi-protein machines

    Structural- collagen in skin

    Immune response- antibodies

  • 8/11/2019 Protein Structure Bioch301.1-2

    47/65

    Structural Classes of Proteins

    1. Globular proteins (enzymes, molecular machines)

    Variety of secondary structures

    Approximately spherical shape

    Water soluble

    Function in dynamic roles (e.g. catalysis,

    regulation, transport, gene processing)

  • 8/11/2019 Protein Structure Bioch301.1-2

    48/65

    Globular Proteins

    V/V/P- Figure 6.27

    Hemoglobin a Conconavalin A Triose Phosphate isomerase

  • 8/11/2019 Protein Structure Bioch301.1-2

    49/65

    Structural Classes of Proteins

    2. Fibrous Proteins (fibrils, structural proteins)

    One dominating secondary structure

    Typically narrow, rod-like shape

    Poor water solubility

    Function in structural roles (e.g. cytoskeleton,

    bone, skin)

  • 8/11/2019 Protein Structure Bioch301.1-2

    50/65

  • 8/11/2019 Protein Structure Bioch301.1-2

    51/65

  • 8/11/2019 Protein Structure Bioch301.1-2

    52/65

    Photosynthetic Reaction Center

    B/T Figure 13.6

    Extracellular

    Intracellular(cytoplasmic)

    Membrane-spanning

  • 8/11/2019 Protein Structure Bioch301.1-2

    53/65

    I n the physical sense, the

    progression of l iving organisms

    results from the communication

    between molecules.

    I nteraction between molecules is

    determined by binding aff inities.

  • 8/11/2019 Protein Structure Bioch301.1-2

    54/65

    Binding Classification of Proteins

    Structural- other structural proteins

    Receptors- regulatory proteins, transmitters

    Toxins- receptors

    Transport- O2/CO2, cholesterol, metals, sugars

    Storage- metals, amino acids,

    Enzymes- substrates, inhibitors, co-factors

    Cell function- proteins, RNA, DNA, metals, ions

    Immune response- foreign matter (antigens)

  • 8/11/2019 Protein Structure Bioch301.1-2

    55/65

    Surface Determines What Binds

    1. Steric access

    2. Shape

    3. Hydrophobic

    accessible surface

    4. Electrostatic surface

    Sequence and structure optimized to generatesurface properties for requisite binding event(s)

  • 8/11/2019 Protein Structure Bioch301.1-2

    56/65

    Determinants of Protein Surface

    Function requires specific amino acid properties

    Not all amino acids are equally useful

    Abundant: Leu, Ala, Gly, Ser, Val, Glu

    Rare: Trp, Cys, Met, His

    Post-translational modifications

    Addition of co-factors- metals, hemes, etc.

    Chemical modification- phosphorylation,glycosylation, acetylation, ubiquination,

    sumoylation

    Bi di Alt P t i St t

  • 8/11/2019 Protein Structure Bioch301.1-2

    57/65

    Binding Alters Protein Structure

    Mechanisms of Achieving Functional Properties

    1. Allosteric Control- binding at one site effects changes

    in conformation or chemistry at a point distant in space

    2. Stimulation/inhibition by control factors- proteins, ions,

    metals control progression of a biochemical process(e.g. controlling access to active site)

    3. Reversible covalent modification- chemical bonding,

    e.g. phosphorylation (kinase/phosphatase)4. Proteolytic activation/inactivation- irreversible, involves

    cleavage of one or more peptide bonds

    Calcium Signal Transduction

  • 8/11/2019 Protein Structure Bioch301.1-2

    58/65

    Calcium Signal TransductionAl lostery & Stimulation by Control Factor

    Target

    Ca2+

    Calmodulin

  • 8/11/2019 Protein Structure Bioch301.1-2

    59/65

    SequenceStructureFunction

    Many sequences can give same structure

    Side chain pattern more important thansequence

    When homology is high (>50%), likely to have samestructure and function (Structural Genomics)

    Cores conserved

    Surfaces and loops more variable

    *3-D shape more conserved than sequence*

    *There are a limited number of structural frameworks*

  • 8/11/2019 Protein Structure Bioch301.1-2

    60/65

    I. Homologous: similar sequence (cytochrome c)

    Same structure

    Same function

    Modeling structure from homology

    Varied Relationships Between

    Sequence, Structure and Function

  • 8/11/2019 Protein Structure Bioch301.1-2

    61/65

    V/V/P Figure 6.31

    C-Type CytochromesSame structure/function- Different Sequence

    Heme

    Constant structural elements and basic architecture

  • 8/11/2019 Protein Structure Bioch301.1-2

    62/65

    Varied Relationships Between

    Sequence, Structure and Function

    I. Homologous: very similar sequence (cytochrome c)

    Same structure

    Same function

    Modeling structure from homology

    II. Similar function- different sequence (dehydrogenases)

    One domain same structure

    One domain different

  • 8/11/2019 Protein Structure Bioch301.1-2

    63/65

    B/T Figure 10.8

    NAD-Binding DomainsConserved Domains/Functional Elements

    Lactate DehydrogenaseAlcohol Dehydrogenase

    V i d R l i hi B

  • 8/11/2019 Protein Structure Bioch301.1-2

    64/65

    Varied Relationships Between

    Sequence, Structure and Function

    I. Homologous: very similar sequence (cytochrome c)

    Same structure

    Same function

    Modeling structure from homology

    II. Similar function- different sequence (dehydrogenases)

    One domain same structure

    One domain different

    III. Similar structure- different function (cf. thioredoxin)

    Same 3-D structure

    Not same function

    A i i

  • 8/11/2019 Protein Structure Bioch301.1-2

    65/65

    NADH-Binding and RedoxSame structure- Different Function

    Alcohol Dehydrogenase Lactate Dehydrogenase

    Thioredoxin