sequencing technologies high...

56
High Throughput Sequencing Technologies UCD Genome Center Bioinformatics Core Monday 15 June 2015

Upload: doanhuong

Post on 09-Apr-2018

223 views

Category:

Documents


5 download

TRANSCRIPT

Page 1: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

High Throughput Sequencing Technologies

UCD Genome Center Bioinformatics CoreMonday 15 June 2015

Page 2: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Sequencing Explosion

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

www.genome.gov/sequencingcostshttp://t.co/Ka5cVGhdqo

Page 3: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Sequencing Explosion

adapted from Mardis 2011 Nature 470:198

PacBio ... Oxford Nanopore(2014-15)

2011 2012 2013

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Page 4: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Current Sequencing Technologies● Sanger● (Roche) 454● Illumina● SOLiD● PacBio● Ion Torrent● Complete Genomics (BGI) … “Revolocity”● Moleculo (Illumina) … “TruSeq Synthetic Long Reads”● Oxford Nanopore● BioNano Genomics● 10X Genomics● Dovetail Genomics

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Page 5: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Sanger Sequencing

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

● ddNTP's (with fluorescent labels) incorporated (along with unlabeled dNTP's) in amplification step, resulting in some molecules terminated at every position

● Gel / capillary electrophoresis orders molecules by length

● Fluorescent label (color) indicates terminal base identity at each position

● Read colors, in order, to derive sequence

2011 2012

http://users.rcn.com/jkimball.ma.ultranet/BiologyPages/D/DNAsequencing

.html

http://www.jgi.doe.gov/sequencing/education/how/how

_10.html

Page 6: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Illumina MiSeqIllumina HiSeq 2000 / 2500

Page 7: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

8 "Lanes"(two flow cells per HiSeq)(one-lane FC for MiSeq)

Surface of flow cell is coated with a lawn of oligo pairs ...

Page 8: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation:Hybridize Fragments & Extend

Adapter Sequence

(contains primer)● Millions of single

molecules hybridize to the lawn of adapters

● dsDNA extended by polymerases

Page 9: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation:Denature Double-stranded DNA

● dsDNA is denatured● Original template

fragment washed away

● Newly synthesized strand is covalently bound to flow cell

New strand

Original strand

discard

Page 10: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation:Covalently-Bound, Randomly Dispersed

Single Molecules

● Resulting covalently-bound DNA fragments are bound to the flow cell surface in a random pattern

Page 11: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation:Bridge Amplification

● Single-strand flops over to hybridize to adjacent adapter, forming a bridge

● dsDNA synthesized from primer in hybridized adapter

Page 12: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation:Bridge Amplification

● dsDNA bridge now formed

● each strand covalently bound to different adapter

Page 13: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation:Bridge Amplification

● dsDNA bridge is denatured

Page 14: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation:Bridge Amplification

● Single strands flop over to hybridize to adjacent adapters, forming bridges

● dsDNA synthesized by polymerases

Page 15: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation:Bridge Amplification

● Bridge amplification cycles repeated many times

Page 16: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation

● dsDNA bridges denatured

● Strands in one of the orientations cleaved and washed away

Page 17: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation

● resulting cluster has ssDNA in only one orientation

Page 18: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Cluster Generation

● Free 3'-ends blocked to prevent unwanted priming

Page 19: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Sequencing By Synthesis

● Sequencing primer is hybridized to adapter sequence, starting Sequencing By Synthesis

Sequencing primer

Page 20: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Sequencing By Synthesis

Cycle four Fl-NTP's +

polymerases

Image incorporated

Fl-NTP's

Cleave terminator

and dye

X 36-150 (HiSeq), 250 (MiSeq) ...

Page 21: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Sequencing By Synthesis

Page 22: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Sequencing By Synthesis

Page 23: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Paired-end sequencing

● Bridge amplification to generate strands with opposite orientation

Page 24: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Paired-end sequencing

● dsDNA bridges denatured

● Strands in already sequenced orientation cleaved and washed away

Page 25: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Paired-end sequencing

● strands with uniform orientation, opposite that in first read

Page 26: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Paired-end sequencing

● Free 3'-ends blocked to prevent unwanted priming

Page 27: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Paired-end sequencing

● Sequencing primer is hybridized to other adapter sequence, starting second read's Sequencing By Synthesis

Sequencing primer

Page 28: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Illumina

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

http://systems.illumina.com/systems/sequencing.ilmn

Page 29: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

PacBio

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Page 30: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio RS (Real-time Sequencer)

Polymerase / DNA complex adhered to bottom of imaging well (Zero Mode Waveguide) ... evanescent wave illuminates tiny volume around polymerase.

Page 31: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio RS (Real-time Sequencer)

Fluorescently-tagged nucleotides are only seen (for an appreciable amount of time) when associated with polymerase. Persistent time in the excitation volume can be recognized as a "pulse."

Page 32: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio RS (Real-time Sequencer)

Science, 02 January 2009/10.1126/science.1162986

Page 33: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio SMRTbell Construct

Page 34: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio Sequencing

Page 35: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio Sequencing

Page 36: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio Sequencing

Page 37: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio detection of modified bases

Movie → trace → pulse timing can reveal nucleotide modification, e.g. N6-methyladenosine

Page 38: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio accuracy

Page 39: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Oxford Nanopore

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Oxford NanoporeMinION

Oxford NanoporePromethION

Oxford NanoporeGridION

Page 40: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Oxford Nanopore

Page 41: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Oxford Nanopore

Page 42: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Oxford Nanoporehairpin

“1D” read (if stopped here)

on its way to becoming a “2D” read

dsDNA (green & purple strands)

pore protein

semi-conductor layer

Page 43: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Oxford Nanoporehairpin

dsDNA (green & purple strands)

Five or six nt’s “read” at once; basecalls depend on HMM

Page 44: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Oxford Nanopore

Whole genome shotgun sequence data, assembly of E coli MG1655 strain published on GigaDB, bioRXiv by Nick Loman et al.

Mean read length ~6 KbpMax read length ~40 Kbp

Errors largely indels … error rate comparable to PacBio in some hands!

Single contig de novo assembly of E coli MG1655, polished to 99.4% accuracy

Page 45: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

10X Genomics GemCode Platform

Page 46: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

10X Genomics GemCode Platform

Produces up to 750k sets of “linked reads” - each pair of reads with the same barcode came from the same long DNA molecule, allowing haplotype resolution (video).

Page 47: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

10X Genomics GemCode Platform

Each gel bead contains many copies of the same barcode, in molecules ready for Illumina library preparation.

Microfluidics encapsulates each bead in a reagent bubble in an oil stream; each bubble ideally contains one long DNA

fragment (depends on correct titration).

Page 48: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

10X Genomics GemCode Platform

Bead / reagent bubbles are collectively temperature cycled, priming short barcoded PCR fragments from random locations on the trapped long DNA fragments. Each PCR fragment is ready for Illumina library preparation.

After fragment generation, bubbles are burst and Illumina library preparation is

completed. Pooled, barcoded fragments are then sequenced on an

Illumina instrument.

Page 49: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

10X Genomics GemCode Platform

All reads with the same barcode are “linked”; they come from the same original molecule.

Linked reads can thus show haplotype-specific sequence, from SNPs/indels to larger structural

variants (SVs).

Page 50: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Dovetail Genomics

Uses Hi-C method in vitro, to capture clean profiles of chimeric reads with frequency that falls off ~logarithmically with distance.

Page 51: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Dovetail Genomics

Long, naked DNA fragments, in vitro, are combined with engineered histones to coil DNA cleanly (without in vivo “signal”). DNA is cross-linked, cut, ends filled in, and ligated to create chimeric molecules, which are then enriched and go into Illumina sequencing.

Page 52: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Dovetail Genomics

Page 53: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Dovetail Genomics

cHiCago libraries assess distances well beyond other technologies. Combined with Illumina contigs, (DiscoDove = DISCOVAR + Dovetail, or MERACULOUS + HiRise) scaffolds of 10s of Mbps have been achieved.

Page 54: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

Sequencing Explosion

adapted from Shokralla 2012 Molecular Ecology 21:1794

Oxford Nanopore2014/5

2013

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

PacBio RS II

Page 55: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Tech Comparison

Ryan Kim, ~Dec. 2012

Page 56: Sequencing Technologies High Throughputbioinformatics.ucdavis.edu/.../M-SequencingTechnologies-JF.pdf · Sequencing Technologies ... (contains primer) Millions of single molecules

UC Davis Genome Center | Bioinformatics Core | J Fass Sequencing Technologies 2015-06-15

Tech Comparison

● Non-technology considerations○ error modes related to application○ single-molecule preferred?

■ novel isoforms■ haplotype resolution (phasing)■ base modification

● Local expertise○ library prep○ secondary analysis

● Availability / turnaround time