annotation-agnostic differential expression analysis

25
Annotation-agnostic differential expression analysis Leonardo Collado-Torres @fellgernon #ENAR2016 www.slideshare.net/lcolladotor

Upload: lcolladotor

Post on 15-Apr-2017

264 views

Category:

Science


0 download

TRANSCRIPT

Page 1: Annotation-agnostic differential expression analysis

Annotation-agnostic differential expression analysis

Leonardo Collado-Torres @fellgernon #ENAR2016

www.slideshare.net/lcolladotor

Page 2: Annotation-agnostic differential expression analysis

Mo#va#ngproblem:iden#fyandvalidateregionsofthegenomethatchangeexpressionwhenanalyzing#ssueswithpoten#allyincompletetranscriptomeannota#on

Page 3: Annotation-agnostic differential expression analysis

RNA-seq reads

Genome (DNA)

RNA transcripts (many possible variants)

Measuring gene expression: RNA-seq

Adapted from @jtleek

Page 4: Annotation-agnostic differential expression analysis

Genome (DNA)

Mapped reads

Adapted from @jtleek

Commonanalysispipelines:

• Featurecoun#ng(geneorexonlevel)• Transcriptassembly

Page 5: Annotation-agnostic differential expression analysis

Challenges in counting

hBp://www-huber.embl.de/users/anders/HTSeq/doc/count.html

Page 6: Annotation-agnostic differential expression analysis

Annotation variation

Frazee et al, Biostatistics, 2014

Page 7: Annotation-agnostic differential expression analysis

DER finder approach

•  Findcon#guousbasepairswithDifferen#alExpressionsignalàDERegionsorDERs

•  Findnearestannotatedfeature

Frazee et al, Biostatistics, 2014

Page 8: Annotation-agnostic differential expression analysis

coverage vector 2 6 0 11 6

Genome (DNA)

Read coverage

Adapted from @jtleek

Page 9: Annotation-agnostic differential expression analysis

Jaffe et al, Nat. Neuroscience, 2015

Page 10: Annotation-agnostic differential expression analysis

Finding DERs by expressed-regions

Page 11: Annotation-agnostic differential expression analysis

CBC:28

MD:24STR:28AMY:31HIP:32

DFC:34

Total N samples: 487

BrainSpan data

CoverageDatafromBrainSpan:hBp://download.allenins#tute.org/brainspan/MRF_BigWig_Gencode_v10/

VFC:30 MFC:32 OFC:30 M1C:25

S1C:26 IPC:33 A1C:30 STC:35 ITC:33

V1C:33

Page 12: Annotation-agnostic differential expression analysis
Page 13: Annotation-agnostic differential expression analysis
Page 14: Annotation-agnostic differential expression analysis

• Data:3#ssues(liver,tes#s,heart),8sampleseach• Alignwith

• Iden#fyexpressedregionswithderfinder– Adjustcoverage(40million)– Findexpressedregions(cutoff5)– DiscardERs<9bp

GTEx: DERs via expressed-regions

Page 15: Annotation-agnostic differential expression analysis

Presence of intronic ERs

CanstrictlyintronicERsdifferen#ate#ssues?

Page 16: Annotation-agnostic differential expression analysis

PCs differentiate tissues

Page 17: Annotation-agnostic differential expression analysis

Differential intronic ERs | exonic ERs

Page 18: Annotation-agnostic differential expression analysis

Differential intronic ERs | exonic ERs

Page 19: Annotation-agnostic differential expression analysis

Differential intronic ERs | exonic ERs

Page 20: Annotation-agnostic differential expression analysis

Simulation setup 3replicates:

2groups,eachwith5samples~2millionpaired-endreadsforchr171/6high,1/6lowingroup2vsgroup1

Annota#on:

completemissing20%oftranscripts(8.28%exons)

Referenceset:

3868exonsthatoverlaponly1transcript

Page 21: Annotation-agnostic differential expression analysis

Simulation results

•  Similarpowertomethodsthathavecompleteannota#on

•  Methodswithincorrectannota#onlosealotofpower•  HigherempiricalFDR/FPR

Page 22: Annotation-agnostic differential expression analysis

Collado-Torres et al, F1000Research, 2015

regionReport

Page 23: Annotation-agnostic differential expression analysis

Mo#va#ngproblem:iden#fyandvalidateregionsofthegenomethatchangeexpressionwhenanalyzing#ssueswithpoten#allyincompletetranscriptomeannota#onderfinderpermitsdiscoveryofnovelexpressedregions1. weiden#fiedexpressedintronicregionsthat

differen#ate#ssuesindependentlyofthenearestexonicregion

2. wehavedevelopedtoolsforreproducible/shareablerepor#ng

Page 24: Annotation-agnostic differential expression analysis

Acknowledgements

Hopkins Jeffrey Leek Alyssa Frazee Abhinav Nellore Chris Wilks Ben Langmead

LIBD Andrew Jaffe Jooheon Shin Nikolay Ivanov Amy Deep Ran Tao Yankai Jia Thomas Hyde Joel Kleinman Daniel Weinberger

Harvard Rafael Irizarry Michael Love Funding NIH LIBD CONACyT México

Page 25: Annotation-agnostic differential expression analysis

References + software + code •  Collado-Torres, et al. bioRxiv (2015) doi:10.1101/015370

–  http://bioconductor.org/packages/derfinder –  http://lcolladotor.github.io/derSupplement/

•  Collado-Torres, et al. F1000Research (2015) doi:10.12688/f1000research.6379.1

-  http://www.bioconductor.org/packages/regionReport -  http://lcolladotor.github.io/regionReportSupp/

•  Nellore, Collado-Torres, et al. bioRxiv (2015) doi:10.1101/019067

- rail.bio • Nellore, …, Collado-Torres, et al. bioRxiv (2016) doi:10.1101/038224

- intropolis.rail.bio

•  Jaffe, Shin, Collado-Torres, et al. Nat. Neurosci. (2015) doi:10.1038/nn.3898 –  https://github.com/lcolladotor/libd_n36 –  https://github.com/lcolladotor/enrichedRanges

•  Frazee, et al. Biostatistics. (2014) doi:10.1093/biostatistics/kxt053 –  https://github.com/leekgroup/derfinder