signal processing for next-gen sequencing data

SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA RNA-seq CHIP-seq DNAse I-seq FAIRE-seq Peaks Transcripts Gene models Binding sites RIP/CLIP-seq

Upload: fay

Post on 23-Feb-2016

33 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

Peaks. DNAse I- seq. CHIP- seq. SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA. Gene models. RNA- seq. Binding sites. RIP/CLIP- seq. FAIRE- seq. Transcripts. The Power of Next-Gen Sequencing. Chromosome Conformation Capture. + More Experiments. DNA Genome Sequencing - PowerPoint PPT Presentation

TRANSCRIPT

SIGNAL PROCESSING FORNEXT-GEN SEQUENCING DATA

RNA-seq

CHIP-seqDNAse I-seq

FAIRE-seq

Peaks

Transcripts

Gene models

Binding sitesRIP/CLIP-seq

Page 2: SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA

DNAGenome SequencingExome Sequencing

DNAse I hypersensitivity

RNARNA-seq

Protein

Chromosome Conformation Capture

CHART, CHirP, RAP

CLIP, RIP

CRAC PAR-CLIP

iCLIP

CHIP

The Power of Next-Gen Sequencing

+ More Experiments

Page 3: SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA

Next-Gen Sequencing as Signal Data

Map reads (red) to the genome. Whole pieces of DNA are black.

Count # of reads mapping to each DNA base signalRozowsky et al. 2009 Nature Biotech

Page 4: SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA

Read mapping

• Problem: match up to a billion short sequence reads to the genome

• Need sequence alignment algorithm faster than BLAST

Rozowsky et al. 2009 Nature Biotech

Page 5: SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA

Read mapping (sequence alignment)

• Dynamic programming– Optimal, but SLOW

• BLAST– Searches primarily for close matches, still too slow

for high throughput sequence read mapping• Read mapping– Only want very close matches, must be super fast

Page 6: SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA

Index-based short read mappers

Trapnell and Salzberg 2010, Slide adapted from Ray Auerbach

• Similar to BLAST• Map all genomic

locations of all possible short sequences in a hash table

• Check if read subsequences map to adjacent locations in the genome, allowing for up to 1 or 2 mismatches.

• Very memory intensive!

Page 7: SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA

Read Alignment using Burrows-Wheeler Transform

Used in Bowtie, the current most widely used read aligner• Described in Coursera course: Bioinformatics Algorithms (part 1, week 10)

Trapnell and Salzberg 2010, Slide adapted from Ray Auerbach

Page 8: SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA

Read mapping issues

• Multiple mapping• Unmapped reads due to sequencing errors• VERY computationally expensive– Remapping data from The Cancer Genome Atlas

consortium would take 6 CPU years1

• Current methods use heuristics, and are not 100% accurate

• These are open problems

1https://twitter.com/markgerstein/status/396658032169742336

https://twitter.com/markgerstein/status/396658032169742336

Page 9: SIGNAL PROCESSING FOR NEXT-GEN SEQUENCING DATA

Outline

• Finding enriched regions– CHIP-seq: peaks of protein binding

• RNA-seq: going beyond enrichment

• Comparing genome-wide signal information• Application: Predicting gene expression from

transcription factor and histone modification binding

Next gen sequencing translation to clinic

Next Gen Sequencing - · PDF fileAgenda Helping Solve Challenges in NGS DNA Library Prep • Overview of de novo next gen sequencing • Review of DNA fragment library construction

Cytogenetics 101: Clinical Research and Molecular Genetic ... · PDF fileWhat is Next-Gen Sequencing: Brief History • Frederick Sanger (Sanger Sequencing) – “First Generation”

Next-Gen Sequencing in the High School Classroomase.tufts.edu/chemistry/walt/sepa/BosLab/2017-04-11boslab_web.pdf · April 11, 2017 BOSLAB Mark Hartman Next-Gen Sequencing in the

Signal Gen

Next Gen Sequencing based Biodefense Assays: The Need for

Next Gen Sequencing - Lucigen

Signal Processing for DNA Sequencing - Petros T. …boufounos.com/Publications/Boufounos_MEngThesis02.pdf · Signal Processing for DNA Sequencing by ... I am looking forward to their

RNA Sequencing Services - LC Sciences · RNA Sequencing Services 1-888-528-8818 microRNA / Small RNA Sequencing Service Next-gen sequencing is a new method and powerful tool used

Genomics – Next-Gen sequencing and Microarrays Overview By Tim Lawrence

Next-Gen Sequencing Exposes Frequent MED12 Targets in ...mcr.aacrjournals.org/content/molcanres/early/2015/01/14/1541-7786...Next-Gen Sequencing Exposes Frequent MED12 Mutations and

Sequencing Technologies - andrew-michaelson.com · Objectives •Understand Sanger sequencing (first generation) •Understand Next Gen Sequencing •Understand 3rd Generation Nanopore

Signal Processing for DNA Sequencing - RLE at MIT

Next Gen Sequencing Workshop - · PDF fileNext Gen Sequencing Workshop Genome Quebec and Mcgill Innovation Center Maxime Caron and colleagues [email protected] [email protected]

Genome mapping Genome sequencing Next Gen sequencing

Clinical Sequencing by Sanger: State of the Art in a Next-Gen World Aug 2011 Sanger... · 2014-04-10 · Clinical Sequencing by Sanger: State of the Art in a Next-Gen World Elaine

NEXT-GENERATION SEQUENCING AND BIOINFORMATICSmbg.unipv.it/attach/1_next_gen_bioinformatics.pdf · NEXT-GEN SEQUENCING TECHNOLOGIES • Roche/454 FLX • Applied Biosystems SOLiD System

Q UALITY MASSIVELY : EXAMPLES ASPARAGALES POACEAE 1€¦ · and Poaceae. Massively parallel sequencing (MPS, often called next-gen-eration sequencing) technologies have revolutionized

Examining gene expression and methylation with next gen sequencing

Next-Gen Sequencing and data · PDF fileNext-Gen Sequencing and data processing Li-Jun Ma Umass Amherst Fusarium Workshop Asilomar March 15 2011

next-gen sequencing informatics primer - Boston · PDF filenext-gen sequencing informatics primer Adrian Heilbut BU Bioinfo Program Retreat May 22 2010

Clinical Next-Gen Sequencing for Solid Tumors: What, …arup.utah.edu/media/nextGenSolidTumors/Corless-Univ of Utah Grand... · Clinical Next-Gen Sequencing for Solid Tumors: What,

Next Generation Sequencing Technologiessssykim/teaching/f14/slides/NextGenSequencing... · Sequencing • Polymerase ... Mardis, ER; Ann Rev Genom & Hum Gen This can be done with

SMBE 2015: Rapid Identification of Phylogenetically Informative Data from Next-Gen Sequencing

in the era of next-gen genome sequencing: what are we

Lowering Next Gen Sequencing DNA Input Requirements · PDF file Lowering Next Gen Sequencing DNA Input Requirements and Gaining Access to More Samples Rob Brazas, Ph.D. Sr. Product

Sanger vs Next-Gen Sequencing … · Next-Gen Sequencing Workflow Source: Lu and Shen, 2016, Biochemistry, Genetics and Molecular Biology. DOI: 10.5772/61657 • Genome • Whole

Genetic mates SNPedia write-ups Final Next-gen sequencing ...web.stanford.edu/class/gene210/files/lectures/2013/nextgen... · Genetic mates SNPedia write-ups Final Next-gen sequencing

Next Gen. Sequencing Sept. 24, 2008 1 Massively Parallel High Throughput DNA Sequencing: Automation for Microbial Community, Gene Expression and de novo

Ken McGrath - Next Gen Sequencing - Game of Thrones edition

The Future of Next-Gen Sequencing (NGS) · PDF fileIt is my pleasure to introduce Cambridge Healthtech Institute’s The Future of Next-Gen Sequencing market study. For the past 5-6

CT Brown - Doing next-gen sequencing analysis in the cloud

Next Gen sequencing - Denver, · PDF file1/26/15 1 Genome mapping Genome sequencing & Next Gen sequencing Cytogenec Band 5/10*Mb* YACs& ~1&Mb BACs& ~150&Kb& STSmapping ﬁngerprintmapping

Next Gen Sequencing (NGS) Technology Overview

Sequencing Pitfalls - Schatzlabschatzlab.cshl.edu/teaching/2012/2012.URP.Sequencing Lunch.pdf · Zeroth, First, Second Generation 1980s-1990s: 1st Gen Automated Capillary Sequencing