transcriptomics and next generation sequencing · next generation sequencing different flavours + +...

29
Transcriptomics and Next Generation Sequencing Centre for Systems Biology Technical University of Denmark January 2010 Ulrik Plate

Upload: others

Post on 12-Aug-2020

20 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Transcriptomics and Next GenerationSequencing

Centre for Systems BiologyTechnical University of Denmark

January 2010

Ulrik Plate

Page 2: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Outline

1. Introduction and background

2. Next Generation Sequencing Technology

3. Sequencing Data

4. Computer Exercises

Page 3: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

1. INTRODUCTION AND BACKGROUND

Transcriptomics and Next GenerationSequencing

Page 4: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

IntroductionThe system approach

• Reductionist vs. system-wide approach

Page 5: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Evolution of TechnologiesGene Transcription

• Northern Blot ⇒

• RT-PCR ⇒

• H.D. Microarrays ⇒

• Next GenerationSequencing ⇒

Single genes

Multiple genes

Whole genomes

Populations ofgenomes

Page 6: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

IntuitionSequencing for Transcriptomics?

• Sample cells• Purify mRNA*• mRNA ⇒ cDNA• Cleaving into 35-400nt fragments

• Sequence fragments in parallel

* or DNA for genomic sequencing

• PCR amplification

Page 7: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Transcriptomics Next Gen. Sequencing

Unique new possibilities

• Known as well as de novo genomes

• Track specific alleles

• Track SNPs & somatic mutations

• Track multiple species or cell lines

• Analyze alternative splicing

Page 8: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Features of TechnologyMicroarrays & N.G. Sequencing

Microarrays N.G.Sequencing

Known genomes All genomes cheaper getting closer Lots of post-proc. Even more post-processing

Page 9: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Where is it Heading?

• Microarray chips get bigger, better and cheaper, but reagent prices don’tchange

• N.G. Sequencing: Speed increases and cost drops exponentially resp.

Page 10: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Progression in N.G. Sequencing

Page 11: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

A $1000 genome?

Page 12: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

High-Throughput

• Sanger: 96 reads < 800-1000b / run

• Solexa: 1.2*106 reads < 75b / run

Page 13: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

2. Next Generation Sequencing

How it really works.

Transcriptomics and Next GenerationSequencing

Page 14: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

• Illumina/Solexa

• Roche 454

• ABI Solid

• Heliscope

and more….

Next Generation SequencingDifferent flavours

Page 15: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

+

+

Next Generation SequencingDifferent flavours – 3 common elements

Page 16: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

TechnologyOverview

1. (c)DNA is fragmented

2. Adaptors ligated to fragments

3. Several possible protocols yield array of PCRcolonies.

4. Enzymatic extension with fluorescently taggednucleotides.

5. Cyclic readout by imaging the array.

Page 17: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Technology (emulsion PCR)1.

• Fragmentation of DNA• Primers are attached to the surface of a bead

Page 18: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

• Fragments, with adaptors, are PCR amplified within a water drop in oil.• One primer is attached to the surface of a bead• 3’ modification of fragments, to covalently bind bead to chip surface.

Technology2. Bead preparation

Page 19: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Technology3. Cycles of elongation

Page 20: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Image of each chemistry cycleis captured by the instrument.After laser excitation, image data is collectedfor each bead/cluster

Technology4. Cycles of imaging

Page 21: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Sequence read over multiple cyclesRepeated cycles of sequencing to determine thesequence of bases in a given fragment, a single base at a time.

TechnologySequence of images

Page 22: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

TechnologyA single transect image

Page 23: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

3. Sequencing Data

Transcriptomics and Next GenerationSequencing

Page 24: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

DataSequencing output

@SEQ_IDGATTTGGGGTTCAAAGCAGTATCGATCAAATAGTAAATCCATTTGTTCAACTCACAGTTT+!''*((((***+))%%%++)(%%%%).1***-+*''))**55CCF>>>>>>CCCCCCC65

• Fastq

• Ascii conversion table

Page 25: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Base CallsRead quality

• Phred quality:

• Solexa quality:

Page 26: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

• 5 minutes just to count the number of lines in Unix• 9 million reads of length ~75• 0.675 billion bases• 135x coverage @ 5 mio. bp genome• 1.5 billion characters in one fastq file• 1.5 gigabyte of data

DataSequencing output

A single Illumina/Solexa run (paired)

Page 27: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Transcriptomics and Next GenerationSequencing

4. Computer Exercises

Page 28: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

DataIssues with N.G. Sequencing

• Larger genes make up bigger proportions of the fragments

• Constant highly expressed genes dominates the fragment pool

• No upper limit on quantification of a single gene as in microarrays

• 16S can constitute up to 90% of all sequenced reads

• Other specific biases we will deal with in the exercises

Page 29: Transcriptomics and Next Generation Sequencing · Next Generation Sequencing Different flavours + + Next Generation Sequencing Different flavours – 3 common elements. Technology

Thank you