rna$seq(introduc1on( - github pages ·...
TRANSCRIPT
![Page 1: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/1.jpg)
RNA-‐seq Introduc1on
Promises and pi7alls
![Page 2: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/2.jpg)
DNA is the same in all cells but which RNAs that is present is different in all
cells
![Page 3: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/3.jpg)
There is a wide variety of different func1onal RNAs
![Page 4: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/4.jpg)
Which RNAs (and some1mes then translated to proteins) varies between
samples -‐Tissues -‐ Cell types
-‐ Cell states
-‐Individuals -‐Cells
![Page 5: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/5.jpg)
RNA gives informa1on on which genes that are expressed
How DNA get transcribed to RNA (and some1mes then translated to proteins) varies between e. g. -‐Tissues -‐ Cell types
-‐ Cell states
-‐Individuals
![Page 6: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/6.jpg)
ENCODE, the Encyclopedia of DNA Elements, is a project funded by the Na1onal Human Genome Research Ins1tute to iden1fy all regions of transcrip1on, transcrip1on factor associa1on, chroma1n structure and histone modifica1on in the human genome sequence.
![Page 7: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/7.jpg)
ENCyclopedia Of Dna Elements
![Page 8: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/8.jpg)
Different kind of RNAs have different expression values
Landscape of transcrip/on in human cells, S Djebali et al. Nature 2012
![Page 9: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/9.jpg)
What defines RNA depends on how you look at it
Variants
Adapted from Landscape of transcrip/on in human cells, S Djebali et al. Nature 2012
Abundance House keeping RNAs
mRNAs
Regulatory RNAs
Novel intergenic
None
Coverage
![Page 10: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/10.jpg)
One gene many different mRNAs RNA-seq: alternative splicing
![Page 11: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/11.jpg)
Defining func1onal DNA elements in
the human genome • Statement
– A priori, we should not expect the transcriptome to consist exclusively of func1onal RNAs.
• Why is that – Zero tolerance for errant
transcripts would come at high cost in the proofreading machinery needed to perfectly gate RNA polymerase and splicing ac1vi1es, or to instantly eliminate spurious transcripts.
– In general, sequences encoding RNAs transcribed by noisy transcrip1onal machinery are expected to be less constrained, which is consistent with data shown here for very low abundance RNA
• Consequence – Thus, one should have high
confidence that the subset of the genome with large signals for RNA or chroma1n signatures coupled with strong conserva1on is func1onal and will be supported by appropriate gene1c tests.
– In contrast, the larger propor1on of genome with reproducible but low biochemical signal strength and less evolu1onary conserva1on is challenging to parse between specific func1ons and biological noise.
![Page 12: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/12.jpg)
Defining functional DNA elements in the human genome Kellis M et al. PNAS 2014;111:6131-6138
Biochemical evidence not enough to iden1fy func1onal RNAs
![Page 13: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/13.jpg)
• RNA seq course
![Page 14: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/14.jpg)
The RNA seq course • From RNA seq to reads (Introduc1on) • Mapping reads programs (Monday) • Transcriptome reconstruc1on using reference (Monday) • Transcriptome reconstruc1on without reference (Monday) • QC analysis (Tuesday) • Differen1al expression analysis (Tuesday) • miRNA analysis (Tuesday) • Gene set analysis (Wednesday) • Long RNA seq analysis(Thursday) • Single cell analysis (Wednesday)
![Page 15: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/15.jpg)
How are RNA-‐seq data generated?
Sampling process
![Page 16: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/16.jpg)
Depending on the different steps you will get different results
AAAAAAAA
enrichments -‐>
reads -‐>
library -‐>
RNA-‐> PolyA (mRNA) RiboMinus (-‐ rRNA) Size <50 nt (miRNA ) …..
Size of fragment Strand specific 5’ end specific 3’ end specific …..
Single end (1 read per fragment) Paired end (2 reads per fragment)
![Page 17: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/17.jpg)
![Page 18: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/18.jpg)
Promises and pi7alls
Long reads • Low throughput (-‐) • Complete transcripts (+) • Only highly expressed
genes (-‐-‐) • Expensive (-‐) • Low background noise (+) • Easy downstream analysis
(+)
short reads • High throughput (+) • Frac1ons of transcripts (-‐) • Full dynamic range (+-‐) • Unlimited dynamic range (+) • Cheap (+) • Low background noise (+) • Strand specificity (+) • Re-‐sequencing (+)
1
10
100
1000
10000
1 10 100 1000 10000 100000 1000000
Signal
# trancripts/cell
EST
MicroArray
RNAseq
![Page 19: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/19.jpg)
RNA seq reads correspond directly to abundance of RNAs in the sample
![Page 20: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/20.jpg)
Map reads to reference
![Page 21: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/21.jpg)
Transcriptome assembly using reference
![Page 22: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/22.jpg)
Transcriptome assembly without reference
![Page 23: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/23.jpg)
![Page 24: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/24.jpg)
Quality control -‐samples might not be what you think they are • Experiments go wrong – 30 samples with 5 steps from samples to reads has 150 poten1al steps for errors
– Error rate 1/100 with 5 steps suggest that one of every 20 samples the reads does not represent the sample
• Mixing samples – 30 samples with 5 steps from samples to reads has ~24M poten1al mix ups of samples
– Error rate 1/ 100 with 5 steps suggest that one of every 20 sample is mislabeled
• Combine the two steps and approximately one of every 10 samples are wrong
![Page 25: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/25.jpg)
RNA QC Read quality
Transcript quality
Mapping sta1s1cs
Compare between samples
![Page 26: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/26.jpg)
Differen1al expression analysis using univariate analysis
Typically univariate analysis (one gene at a 1me) – even though we know that genes are not independent
![Page 27: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/27.jpg)
Gene set analysis and data integra1on
26
RNase Y and virulence
- 38% of total virulence factors are affected by RNase Y deletion (88% downregulated) - 18% cleaved by RNase Y
with the help of Cristina Della Beffa
¾ SCP hidden by gene downregulation ?
![Page 28: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/28.jpg)
microRNA analysis (Johan)
(Berezikov et al. Genome Research, 2011.)
![Page 29: RNA$seq(Introduc1on( - GitHub Pages · RNA(gives(informaon(on(which(genes(thatare(expressed(How(DNA(gettranscribed(to(RNA((and(some1mes(then(translated(to(proteins)(varies(between(e.(g](https://reader034.vdocument.in/reader034/viewer/2022051908/5ffcbf59c684d76add57f9b2/html5/thumbnails/29.jpg)
Single cell RNA-‐seq analysis
(Sandberg, Nature Methods 2014)