10 - rna modification

20
BIO 375: Genetics and Molecular Biology 1 10 - RNA Modifications After the RNA molecule is produced by transcription (Part 9), the structure of the RNA is often modified prior to being translated into a protein. These modifications to the RNA molecule are called RNA modifications or posttranscriptional modifications. Most RNA modifications apply only to eukaryotic RNA transcripts. Key Questions Which group of organisms modify their RNA transcripts? RNA Modifications The modifications to eukaryotic RNA transcripts include the following: RNA processing. RNA processing involves cutting large RNA transcripts into smaller ones. RNA processing can involve both exonucleases (removing nucleotides from the ends of the RNA transcript) or endonucleases (cleaving the RNA transcript at internal sites). The ribosomal RNA (rRNA) molecules that are essential components within ribosomes (see Part 11) commonly experience RNA processing. RNA splicing. Most eukaryotic genes are split genes. For split genes, initial transcription in the nucleus produces a precursor mRNA (pre-mRNA) This pre-mRNA is then spliced, meaning that internal segments within the pre-mRNA called introns (or intervening sequences) are removed and

Upload: others

Post on 18-Dec-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

BIO 375: Genetics and Molecular Biology 1

10 - RNA Modifications

After the RNA molecule is produced by transcription (Part 9), thestructure of the RNA is often modified prior to being translated into aprotein. These modifications to the RNA molecule are called RNAmodifications or posttranscriptional modifications. Most RNAmodifications apply only to eukaryotic RNA transcripts.

Key Questions

Which group of organisms modify their RNA transcripts?

RNA Modifications

The modifications to eukaryotic RNA transcripts include the following:

RNA processing. RNA processing involves cutting large RNAtranscripts into smaller ones. RNA processing can involve bothexonucleases (removing nucleotides from the ends of the RNAtranscript) or endonucleases (cleaving the RNA transcript atinternal sites). The ribosomal RNA (rRNA) molecules that areessential components within ribosomes (see Part 11) commonlyexperience RNA processing.RNA splicing. Most eukaryotic genes are split genes. Forsplit genes, initial transcription in the nucleus produces aprecursor mRNA (pre-mRNA) This pre-mRNA is then spliced,meaning that internal segments within the pre-mRNA calledintrons (or intervening sequences) are removed and

BIO 375: Genetics and Molecular Biology 2

discarded (see Figure 10.1). The remaining RNA segmentscalled exons (or expressed sequences) are spliced together toproduce a mature mRNA molecule that is transported to thecytoplasm of the cell for translation.5’ end capping. The 5’ end capping process involves theattachment of a modified nucleotide called 7-methylguanosine (7-mG) to the 5’ end of pre-mRNAmolecules. The added 7-mG is sometimes called the 5’ cap.3’ end polyadenylation. 3’ end polyadenylation involves theaddition of many adenine (A) nucleotides to the 3’ end of thepre-mRNA molecule. The added string of A nucleotides is calledthe polyA tail.RNA editing. RNA editing involves changing the nucleotidesequence of a mRNA molecule prior to translation.Base modification. Occasionally, nitrogenous bases within theRNA transcript are covalently modified by the addition ofchemical groups, including methyl groups.

Figure 10.1 RNA Modifications Overview --- Image used from OpenStax (access for free athttps://books.byui.edu/-vuzA

BIO 375: Genetics and Molecular Biology 3

Key Questions

What is the difference between a pre-mRNA and a maturemRNA?What is the difference between an intron and an exon?What is meant by 5’ end capping?What is meant by 3’ end polyadenylation?

5’ End Capping

The 5’ end of the pre-mRNA molecule is modified by the addition of a7-methylguanosine (7-mG) nucleotide. The process of adding the 7-mG to the pre-mRNA is called 5’ end capping. 5’ end capping occursduring the transcription of the pre-mRNA and is the first RNAmodification that occurs. 5’ end capping (see Figure 10.2) involvesthe following enzymes:

RNA 5’-triphosphatase removes one of the three phosphates1.from the nucleotide at the 5’ end of the pre-mRNA transcript.Guanylyltransferase cleaves GTP to produce GMP and2.pyrophosphate (PPi). Guanylyltransferase then attaches the 5’carbon of the GMP molecule to the 5’ carbon on the nucleotideat the 5’ end of the pre-mRNA transcript. It is important to notethat an unusual 5’ to 5’ covalent bond is formed. There are nowthree phosphate groups between these two adjacentnucleotides.Methyltransferase attaches a methyl group to the added3.guanine base, producing the 7-mG cap.

The 7-mG cap functions to do the following:

Serve as a binding site for proteins that transport the mRNAfrom the nucleus to the cytoplasm of the cell.Serve as a recognition site for translation factor proteins thathelp the ribosome bind to the mRNA. Once the ribosome binds

BIO 375: Genetics and Molecular Biology 4

to the mRNA, translation begins (see Part 11).Protect the 5’ end of the mRNA transcript from degradation byexonucleases.Regulate RNA splicing.

Figure 10.2 5' end capping mechanism --- Image created by JET

Key Questions

How does the 7-mG structure contribute to translation?Which nucleotide provides the energy for 5’ end capping?What is unusual about the covalent bond between 7-mG and therest of the pre-mRNA molecule?

3’ End Polyadenylation

The 3’ end of the pre-mRNA is modified by the addition of a polyAtail, a string of approximately 250 adenine (A) nucleotides. Theprocess of adding a polyA tail to the mRNA transcript (see Figure10.3), called 3’ end polyadenylation, involves:

The detection of two recognition sequences, called1.

BIO 375: Genetics and Molecular Biology 5

polyadenylation signal sequences. Both polyadenylationsignal sequences are located near the 3’ end of the pre-mRNA.

The first polyadenylation signal sequence is 5’-AAUAAA-3’. This polyadenylation signal sequence isrecognized by the endonuclease cleavage andpolyadenylation specificity factor (CPSF).The second polyadenylation signal sequence, enriched inguanine and uracil bases, is called the GU-richsequence. This GU-rich sequence is the binding site forthe cleavage stimulatory factor (CstF) protein.Once CPSF is bound to the 5’-AAUAAA-3’ site and CstF isbound to the GU-rich region, then both protein factorsinteract.

CPSF cleaves the pre-mRNA 10-35 nucleotides downstream2.(farther towards the 3’ end) of the 5’-AAUAAA-3’ sequence. Thisfree 3’ end of the pre-mRNA is then available for the addition ofadenine nucleotides.Poly(A)-polymerase (PAP) attaches many (as many as 250)3.adenine nucleotides to the 3’ end of the pre-mRNA transcript.

The polyA tail added by 3’ end polyadenylation functions as follows:

The polyA tail protects the 3’ end of the pre-mRNA transcriptfrom degradation by exonucleases.The polyA tail facilitates the transport of the mRNA from thenucleus to the cytoplasm of the cell.The polyA tail enhances the recognition of the mRNA transcriptby the ribosome during translation.

The 3’ end polyadenylation process occurs after 5’ end capping, butprior to RNA splicing. In fact, 3’ end polyadenylation assists interminating transcription in eukaryotes by the torpedo model (seePart 9).

BIO 375: Genetics and Molecular Biology 6

Figure 10.3 3' end polyadenylation mechanism --- Image created by SL

Key Questions

How does 3’ end polyadenylation contribute to transcriptiontermination in eukaryotes?Describe the functions of CPSF, CstF, and Poly(A)-polymerase.

Splicing of Group I and Group II Introns

There are three general mechanisms to remove introns from RNAmolecules. The group I and group II mechanisms are limited tocertain eukaryotes or certain organelles in the cell. For example, thegroup I mechanism removes the introns found in ribosomal RNA(rRNA) molecules in certain protozoa. The group II mechanism

BIO 375: Genetics and Molecular Biology 7

removes the introns found in the mRNA and transfer RNA (tRNA)transcripts produced by mitochondrial and chloroplast genes. Thespliceosome mechanism is the major mechanism to remove intronsfrom pre-mRNA transcripts. The spliceosome mechanism removesintrons from the pre-mRNAs produced by most structural genes in thenucleus of eukaryotic cells.

Removing group I introns. RNA splicing of group I intronsoccurs by self-splicing, meaning that the precursor RNAmolecule catalyzes the removal of its own intron (see Figure10.4). These catalytic precursor RNAs molecules are also calledribozymes. The self-splicing of group I introns involves thefollowing mechanism:

A free guanosine nucleoside (guanine nitrogenous1.base covalently linked to ribose) binds to a pocket withinthe intron. The guanosine nucleoside bound to the intronserves as a cofactor for the remaining steps in thereaction.A break forms at the junction between the 3' end of the2.first exon and the 5’ end of the intron. The guanosinenucleoside becomes attached to the 5’ end of the intron.The released exon cleaves the junction between the 3’3.end of the intron and the second exon.A phosphodiester bond is formed that links the first and4.second exons, generating a mature RNA transcript. Theintron is released and degraded.

BIO 375: Genetics and Molecular Biology 8

Figure 10.4 Removing group I introns --- Image created by SL

Removing group II introns. RNA splicing of group II intronsalso occurs by self-splicing, meaning that the precursor RNAremoves its own intron (see Figure 10.5). In other words, pre-mRNAs containing group II introns are also ribozymes. The self-splicing of group II introns involves the following:

The 2’-OH group of an adenine nucleotide within the1.intron helps to cleave the junction between the 3’ end ofthe first exon and the 5’ end of the intron. In thisreaction, the adenine nucleotide serves as a cofactor forthe enzymatic reaction.The released exon cleaves the junction between the 3’2.end of the intron and the second exon.A phosphodiester bond is formed that links the first and3.second exons, generating a mature RNA transcript. Theintron is released and degraded.

Figure 10.5 Removing group II introns --- Image created by SL

BIO 375: Genetics and Molecular Biology 9

Key Questions

What is a ribozyme?Describe the major events that occur in the group I and groupII splicing mechanisms.What molecules serve as cofactors in the group I and group IIsplicing mechanisms?

Removal of Introns by Spliceosomes

Transcription of most structural genes in the nucleus of eukaryoticcells produces a pre-mRNA molecule. Splicing of nuclear pre-mRNAsto remove the introns involves a large multi-subunit complex calledthe spliceosome.

The spliceosome binds to recognition sequences within the intronRNA sequence (see Figure 10.6). These recognition sequences withinthe intron include:

The 5’ splice site. The 5’ splice site (also called the donorsequence) is a 5’-GU-3’ at the 5’ end of the intron RNAsequence.The branch site. The branch site is an adenine nucleotide (A)near the middle of the intron RNA sequence.The 3’ splice site. The 3’ splice site (also called the acceptorsequence) is a 5’-AG-3’ at the 3’ end of the intron RNAsequence.

The subunits of the spliceosome are called small nuclearribonucleoproteins or snRNPs (“snurps”). snRNPs are composed ofuracil-rich small nuclear RNAs (snRNAs) that act as RNA enzymes(ribozymes) to remove the introns from nuclear pre-mRNA molecules.snRNPs are also composed of protein subunits that function tostabilize the spliceosome.

BIO 375: Genetics and Molecular Biology 10

The spliceosome splicing mechanism occurs as follows:

The U1 snRNP binds to the 5’ splice site, and the U2 snRNP1.binds to the branch site adenine within the intron RNAsequence.Additional snRNPs called U4, U5, and U6 bind to the intron.2.These five snRNPs (U1, U2, U4, U5, and U6) assemble to formthe spliceosome complex.The intron loops out bringing the two exon sequences close3.together. The U1 snRNP is now adjacent to U2.The 5’ splice site is cut by U1, and the 5’ end of the intron is4.covalently linked to the 2’-OH group of the branch site adenine,forming an RNA structure called a lariat.The U1 and U4 snRNPs are released.5.The 3’ splice site is cut by the U5 snRNP.6.A phosphodiester bond is formed that links the two exons7.together to form the mature mRNA. The intron is releasedalong with U2, U5, and U6.

BIO 375: Genetics and Molecular Biology 11

Figure 10.6 Spliceosome splicing --- Image created by SL

Key Questions

Which two splicing mechanisms are found in humans?What are the names of the three consensus sequences foundwithin spliceosome introns?Which spliceosome components are ribozymes?What are the functions of the U1 and U5 snRNPs?

Identifying Introns Using R Loop Experiments

Introns were initially identified within the β-globin and ovalbumingenes by performing R-loop (hybridization) experiments. Theseexperiments relied on denaturing a double-stranded DNA molecule

BIO 375: Genetics and Molecular Biology 12

that contains a gene, allowing a mature mRNA molecule to formhydrogen bonds (hybridize) with the template DNA strand, adding thecoding strand DNA, and finally, examining the resulting nucleic acidstructure in an electron microscope. What would such a molecule looklike at the end of the R-loop (hybridization) experiment?

Gene hybridized to the pre-mRNA. The pre-mRNA that hasformed hydrogen bonds with the template DNA strand preventsthe coding DNA strand from binding. Because the coding DNAstrand fails to bind to the template DNA strand, the codingDNA strand loop outs from the RNA-DNA hybrid region. Thisloop where the coding DNA strand cannot bind to the templateDNA strand is called an RNA displacement loop or R loop (seeFigure 10.7).Gene hybridized to the mature mRNA. When the DNAcontains introns, hybridization between the template DNAstrand and the mature mRNA forces the intron DNA sequencesin the template DNA strand to loop out, because the maturemRNA lacks a corresponding sequence that can bind to thetemplate strand introns. Adding the coding DNA strandproduces R-loops with an intervening region of double-strandedDNA (i.e., the intron sequences within the template and thecoding DNA strands from hydrogen bonds) called an intronloop.

BIO 375: Genetics and Molecular Biology 13

Figure 10.7 R-Loop Results --- Image created by SL

Key Questions

Suppose a gene contains four introns. How many R loops wouldbe observed in the electron microscope at the end of an R loopexperiment? How many intron loops would be observed?

Identifying Introns by Comparing gDNA with cDNA

Introns within genes can also be identified by comparing the length ofa genomic DNA (gDNA) version of a gene to the complementaryDNA (cDNA) version of the same gene. gDNA is the version of a genefound in the genome; the gDNA version of a gene contains bothintrons and exons. cDNA is produced in the laboratory by reversetranscription (see Part 8). Reverse transcription converts mature

BIO 375: Genetics and Molecular Biology 14

mRNA into a cDNA molecule using the viral enzyme reversetranscriptase. Since the cDNA molecule is produced from the maturemRNA, cDNA molecules contain exons but lack introns. The gDNAversion of the gene, which contains introns, will be longer than thecDNA version of the same gene, which lacks introns.

The polymerase chain reaction (PCR) technique (see Part 8) can beused to make billions of copies of the gDNA and the cDNA versions ofany gene of interest. The gDNA and cDNA PCR products are thenseparated by size using agarose gel electrophoresis (see Part 8).The size difference between the gDNA and the cDNA copy of the genecan be easily observed on an agarose gel (see Figure 3.2).

Figure 10.8 Comparing cDNA to gDNA to identify introns --- Image provided by K. Mark DeWall

BIO 375: Genetics and Molecular Biology 15

Key Questions

What is the difference between gDNA and cDNA?How can comparing gDNA to cDNA on an agarose gel help youdetermine that a gene contains introns?

Alternative Splicing

Alternative splicing involves splicing a single type of pre-mRNAmolecule in various ways to produce different mature mRNAmolecules (see Figure 10.9). Each of these mature mRNAs can thenproduce a slightly different protein upon translation. These distinct,yet related protein isoforms, all derived from a single gene, can havespecialized functions.

Alternative splicing is beneficial in that it allows eukaryotes to carryfewer genes in the genome, permitting a relatively small number ofgenes the flexibility to encode a vast array of proteins. In humans, it isestimated that 30–60% of the genes in the genome are alternativelyspliced. As a result, the human genome which contains approximately23,000 structural genes can produce at least ten times that number ofprotein products.

One example of alternative splicing involves the splicing of the pre-mRNA molecule for α-tropomyosin, a protein involved in musclecontraction. The α-tropomyosin gene contains 14 exons (13 introns).There are two types of exons within the α-tropomyosin pre-mRNA:

Constitutive exons. Constitutive exons are exons that arealways included in the mature α-tropomyosin mRNAs productsof alternative splicing. These exons likely encode amino acidsequences that maintain the general three-dimensionalstructure of the protein.Alternative exons. Alternative exons vary between mature α-tropomyosin mRNAs. In one cell type, one combinations of

BIO 375: Genetics and Molecular Biology 16

alternative exons are spliced together, in another cell type adifferent combination of alternative exons are spliced together.The result is two related proteins that have slightly differentfunctions to meet the unique needs of these two cell types.

Figure 10.9 Alternative splicing allows one gene to produce three proteins. --- DNA Alternative Splicing byNational Human Genome Research Institue and is used under CC0

Key Questions

Why is alternative splicing advantageous to eukaryotic cells?What are protein isoforms?What is the difference between a constitutive exon and analternative exon?

Patterns of Alternative Splicing

Alternative splicing is regulated by splicing factor proteins. Thesesplicing factor proteins help the spliceosome choose which splice sitesto use during RNA splicing. Different cell types have different splicingfactor proteins, allowing different RNA splicing patterns to occur ineach cell type. The SR proteins are an example of a group of splicingfactor proteins found in animals, including humans.

BIO 375: Genetics and Molecular Biology 17

Here are some common alternative splicing patterns observed ineukaryotic cells:

Exon Skipping. Some splicing factor proteins act as splicerepressors. Splice repressor proteins prevent the spliceosomefrom recognizing a particular splice site (see Figure 10.10)within an intron. If a splice repressor protein blocks a 3’ splicesite within an intron, the 3’ splice site in the next intron ischosen instead, and the intervening exon is removed duringRNA splicing (exon skipping).Alternative 5' and 3' Splice Sites. In addition to the 5’ splicesite, the branch site, and the 3’ splice sites discussed earlier,there are other pre-mRNA sequences involved in splicing.These additional sequence elements, often located within anearby exon, can promote the use of a particular 5’ or 3’ splicesite. For example, some potential 5’ or 3’ splice sites in the pre-mRNA are poorly recognized by the spliceosome. In certain celltypes, the binding of a splice activator protein to a spliceenhancer sequence within a nearby exon promotes the use ofthese otherwise poorly recognized 5’ or 3’ splice sites, resultingin a new splicing pattern (see Figure 10.10).Mutually Exclusive Exons. In some cases, splicing events arecoordinated to ensure that unique protein isoforms areproduced from a single gene in different cell types. Forexample, suppose there are four exons (three introns) in a pre-mRNA molecule. During splicing in one cell type, exon two isconsistently retained in the mature mRNA, while exon three isspliced out. In a different cell type, exon two is always splicedout while exon three is retained in the mature mRNA. Exonsone and four are found in the mature mRNAs in both cell types(constitutive exons). If exons two and three happen to be thesame length, then the final protein product will be the samesize in both cells; however, the amino acid sequence (andprotein function) will differ slightly because exons two andthree encode different amino acids.

BIO 375: Genetics and Molecular Biology 18

We know little about the true complexity of alternative splicing.Scientists believe that the alternative splicing patterns are cell-typeand developmental stage specific. Moreover, mutations in humangenes often lead to aberrant splicing. This aberrant splicing producesabnormal protein isoforms and ultimately, disease phenotypes.

Figure 10.10 Splicing repressor and activator proteins --- Image created by SL

Key Questions

What happens when a splice repressor protein binds to the 3’splice site within an intron?What effect would a splicing activator protein binding to asplicing enhancer sequence have on alternative splicing?What is meant by the term mutually exclusive exons?

BIO 375: Genetics and Molecular Biology 19

Review QuestionsFill in the blank:

__________________ is an endonuclease that releases the pre-1.mRNA from RNA polymerase II during transcription.____________________ is an enzyme that attaches two nucleotides2.together via a 5’ to 5’ covalent bond.One function of the 7-mG cap is to3._______________________________________.A __________________________ protein prevents the spliceosome4.from binding to a 3’ splice site.________________________ is an enzyme that adds adenine5.nucleotides to the 3' end of a pre-mRNA. These are added inthe 5' to 3' direction.The Group I intron splicing mechanism uses the nucleoside6.______________ as a cofactor during catalysis, while the_______________ intron splicing mechanism uses an adeninenucleotide as a cofactor during catalysis.The U2 snRNP binds to the ________________________ site of the7.pre-mRNA.Spliceosome subunits are composed of two components:8.proteins and _________________.______________________ is a pattern of alternative splicing where9.one exon is always retained in one cell while that same exon isalways skipped in another cell.

BIO 375: Genetics and Molecular Biology 20

Dewall, M. (n.d.). BIO 375: Genetics and MolecularBiology. BYU-I Books.https://books.byui.edu/genetics_and_molecul