protein modifications - different types, including co-translational and post-translational - inteins...

15
Protein modifications - different types, including co-translational and post-translational - inteins 16-1 Protein modification Protein modification It is estimated that the human proteome consists of ~300,000 different proteins, or about 10X more than the number of genes (!) - protein modifications - differential splicing

Post on 18-Dec-2015

224 views

Category:

Documents


1 download

TRANSCRIPT

Protein modifications- different types, including co-translational and post-translational- inteins

16-1

Protein modificationProtein modification

It is estimated that the human proteome consists of ~300,000 different proteins, or about 10X more than the number of genes (!)

- protein modifications - differential splicing

disulfide bond formation - covered

addition of a small moiety

phosphorylation, glycosylation, methylation, hydroxylation, etc. - today

truncation

self-cleavage

- viral capsid protease (SCP) - covered

- intramolecular chaperones - covered

cleavage of signal sequences - signal peptidases

protein splicing: inteins – covered in this lecture

intermolecular cleavage (e.g., by proteasome) - presentation

addition of ubiquitin or ubiquitin-like protein - presentation

16-2

Protein modifications: Protein modifications: formation or breakage of covalent bondsformation or breakage of covalent bonds

usually specified by amino acids at certain positions, or

various degenerate or specific motifs found in proteins

frequently, the N-terminal methionine is not present in mature proteins

Acylation includes acetylation, formylation, pyroglutamylation, myristoylation

~80% eukaryotic cytosolic proteins are acetylated at their N-termini makes N-terminal (Edman) sequencing difficult without special treatment

- requires enzymatic removal or treatment with chemicals that may cleave labile peptide bonds

formyl group occurs mostly as modification of the initiator methionine in bacteria pyroglutamate represents a cyclic amide generated from an N-terminal glutamic acid or glutamine residue

- can be generated by spontaneous cyclization but could also be an artifact of protein isolation under slightly acidic conditions

myristoylation is a co-translational lipid modification that is common to many signalling proteins; occurs only on N-terminal glycine residues

16-3

N- and C-terminal modifications: N- and C-terminal modifications: acylation, methylation, amidationacylation, methylation, amidation

peptide deformylase

(PDF)

formyl transferase

(FMT)

methionine aminopeptidase

(MAP)

Methylation of N-terminal amino groups is rare; different methylases do modify specific proteins, including ribosomal proteins

methylation of ribosomes affects their function

Amidation of peptides (e.g., hormones) sometimes occurs at the C-terminus

16-4

N- and C-terminal modifications: N- and C-terminal modifications: methylation and amidationmethylation and amidation

Phosphorylation phosphorylation can affect the activity and structure of proteins perhaps as many as 1 in 8 proteins are phosphorylated too many examples to list: e.g. HSF activity is modulated by phosphorylation; cell-signalling molecules are best characterized

Glycosylation glycosylation takes place in the ER, golgi by a variety of enzymes glycosylated proteins often found on the surface of cells or are secreted folding/assembly of glycosylated proteins requires ER molecular chaperones

16-5

Modification of individual side chains:Modification of individual side chains:phosphorylation and glycosylationphosphorylation and glycosylation

addition of GlcNAc (beta-O-linked N-acetylglucosamine) residues occurs in the cytoplasm and nucleus

- modifications are carried out by O-linked GlcNac transferases (OGTs)

- proteins modified by O-GlcNAc include:

cytoskeletal proteins, hormone receptors, kinases & other signalling molecules, nuclear pore proteins, oncogenes, transcription factors, tumor suppresors, transcriptional & translational machinery, viral proteins

Prenylation, fatty acid acylation proteins without major hydrophobic (transmembrane) domains can be directed to membranes by prenylation of their C-terminal cysteine residue

Hydroxylation and oxidation, carboxylation a variety of derivatives are known; e.g.,

hydroxyamino acids (hydroxyproline)

are very common in collagen

Selenocysteine/selenomethionine modification essentially all selenium in cells occurs as selenocysteine selenomethionine is a useful too for protein crystallography: can grow cells in the presence of the modified amino acid and produce protein containing se-Met; can deduce ‘phase’ of protein this way

16-6

Modification of individual side chains:Modification of individual side chains:various othersvarious others

- mass spectrometry is the most common today:- can identify modifying group with great precision, especiallyin combination with proteolytic digestion of proteins andHPLC analysis

- incorporation of radioactive groups by addition to growing cellse.g., 75Se-labeling and chromatographic isolation of proteinsNote that most of the modifiers can be purchased in radiolabeled form

- antibody cross-reactivity:e.g., antibody against phosphotyrosine

small-molecule modifications can affect not only the activity, but also the structure of proteins, much as ligands such as ATP can affect the activity and structure of proteins

16-7Identification of modificationsIdentification of modifications

use 2D gel electrophoresis to detect modified proteins in whole-cell (or partly purified) lysates

Fig. 1. O-GlcNAc is an abundant modification of nucleocytoplasmic proteins. Nucleo-cytoplasmic proteins from HeLa cells were immunopurified with an O-GlcNAc-specific antibody and stringently washed, and the O-GlcNAc-containing proteins were specifically eluted with free GlcNAc. The resulting proteins were separated on two-dimensional gels and visualized by silver staining. pI, isoelectric point; MW, molecular weight.From Wells et al. (2001) Science 291, 2376-8.

Lin et al. (1998) Cotranslational biogenesis of NF-kappaB p50 by the 26S proteasome. Cell 92, 819-828.

16-8

Presentation:Presentation: proteasome-mediated proteasome-mediated co-translational protein biogenesisco-translational protein biogenesis

Inteins represent the protein equivalent to the genomic DNA intron: elements that are spliced out of the final (mature) product

unlike introns, inteins are self-splicing through the use of an endonuclease

used commercially in biochemical applications (explanation on board)

Legend:Schematic illustration of protein splicing (upper part) and intein structure (lower part). The two terminal regions of the intein sequence form the splicing domain of a typical bifunctional intein. Six conserved sequence motifs (A to G) are shown. The intein sequence begins with the first amino acid of motif A and ends with the second last amino acid of motif G. Motifs C and E are the dodecapeptide motifs of endonuclease. A star (*) stands for hydrophobic amino acids (V, L, I, M). A dot (.) stands for a nonconserved position.adapted from Liu, X.-Q. (2000) Annu. Rev. Gen. 34, 61.

- inteins are 134-608 aa- the mini-inteins do not have the endonuclease domain but have other characteristics of inteins

16-9

Protein splicing: Protein splicing: inteinsinteins

Legend:Scenarios of intein evolution. A, loss of the endonuclease domain. B, breaking the intein sequence. C, loss of the splicing function. D, replacing the C-extein with a cholesterol molecule (green dot). E, loss of C-terminal cleavage and splicing. F, loss of N-terminal cleavage and splicing. G, placing intein fragments on two ends of a protein. H, breaking intein into 3 fragments. I, separating a middle fragment of the intein from the rest. J, presence of two different split inteins.

Scenarios A to D are based on examples observed in nature.

Scenarios E to G are based on engineered artificial inteins.

Scenarios H to J are purely hypothetical.

16-10Protein splicing: Protein splicing: inteinsinteins

N-extein represents the N-terminal polypeptide segment that is retained

C-extein represents the C-terminal segment that is retained

the Intein is what is spliced out (much as a genomic DNA intron)

Cys1, Asn154 and Ser155 represent conserved residues involved in the splicing reaction

16-11

Intein protein splicing: Intein protein splicing: mechanismmechanism

Hedgehog (Hh)Hedgehog (Hh)processingprocessing

- modification of SH by cholesterol is required for its activation- process similar to intein splicing

cholesterol

Secreted signaling proteins encoded by the hedgehog gene family induce specific patterns of differentiation in a variety of tissues and structures during vertebrate and invertebrate development. All known signaling activities of Hh proteins reside in Hh-N; Hh-C is responsible for both the peptide bond cleavage and cholesterol transfer components of the autoprocessing reaction.

Hall et al. (1997) Crystal Structure of a Hedgehog Autoprocessing Domain: Homology between Hedgehog and Self-Splicing Proteins. Cell 91, 85-97.

(A) catalytic residues of the Hh protein and(B) the crystal structure of the spliced protein

16-12

Intramolecular Autoprocessing Reactions of Hh and Self-Splicing Proteins

(A) Schematic drawing of a two-step mechanism for Hh autoprocessing (Porter et al., 1996b ). Aided by deprotonation by either solvent or a base (B1), the thiol group of Cys-258 initiates a nucleophilic attack on the carbonyl carbon of the preceding residue, Gly-257. This attack results in replacement of the peptide bond between Gly-257 and Cys-258 by a thioester linkage (step 1). The emerging -amino group of Cys-258 likely becomes protonated, and an acid (A) is shown donating a proton. The thioester is subject to a second nucleophilic attack from the 3 hydroxyl group of a cholesterol molecule, shown here facilitated by a second base (B2), resulting in a cholesterol-modified amino-terminal domain and a free carboxy-terminal domain. In vitro cleavage reactions may also be stimulated by addition of small nucleophiles including DTT, glutathione, and hydroxylamine.

Legend to right-hand Figure, part (A)

16-13