updated: new high throughput sequencing technologies at the norwegian sequencing centre - and beyond

58
New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond Lex Nederbragt, NSC and CEES [email protected]

Upload: lex-nederbragt

Post on 10-May-2015

2.466 views

Category:

Technology


2 download

DESCRIPTION

Un update of the previous talk with the same title. A talk I gave at the Computational Life Science initiative (University of Oslo) about new High Throughput Sequencing instruments at the Norwegian Sequencing Centre. I also mentioned future upgrades, and the upcoming nanopore sequencing platform of Oxford nanopore.

TRANSCRIPT

Page 1: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

New High Throughput Sequencing technologies at the Norwegian Sequencing Centre

- and beyond

Lex Nederbragt, NSC and [email protected]

Page 2: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Norwegian Sequencing Centre

National technology core facility

Page 3: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Norwegian Sequencing Centre

Offering sequencing services

GS FLX from Roche/454 HiSeq 2000 from Illumina

Page 4: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

1) New instruments:

Benchtop sequencers

Page 5: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Phase 1: more is better

2005 GS20 200 000 reads 100 bp0.02 Gb/run

2011 GS FLX+ 1.2 million reads 750 bp0.7 Gb/run

2006 GA 28 million reads 25 bp0.7 Gb/run

2011 HiSeq 2000 3 billion reads 2x100 bp600 Gb/run

Page 6: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Phase 2: smaller is better

GS Junior from Roche/454

MiSeq from Illumina

PGM from Ion Torrent/Life Technologies

0.04 GB/run400 bp reads

0.7 GB/run700 bp reads

2 GB/run2x150 bp reads

600 GB/run2x100 bp reads

0.01, 0.1 or 1 GB/run100 or 200 bp reads

Page 7: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Why benchtop sequencing instruments?

10 hours

27 hours

23 hours

10 days

3 hours

GS Junior from Roche/454

MiSeq from Illumina

PGM from Ion Torrent/Life Technologies

Page 8: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Why benchtop sequencing instruments?

Affordable price per instrument Small projects

Diagnostics

Fast turn around time

http://pennystockalerts.com/ http://www.highqualitylinkbuildingservice.com/http://www.vetlearn.com/ http://vanillajava.blogspot.com

Page 9: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Benchtop sequencing instruments at the NSC

✖✔

Page 10: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Sequencing technology

Page 11: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Sequencing technology

Reversible dye terminator sequencing-by-synthesis

Reversible terminators

Metzker 2010 Nat Rev Genet.11(1):31-46

Page 12: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Sequencing technology

Pyrosequencing

PPi: pyrophosphate

Page 13: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Sequencing technology

Free proton current shift sequencing-by-synthesis

Page 14: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Error profiles

Substitution errors

ACGTAGCTGATTTTAGGCTTAGCTACGTAGCTGCTTTTAGGCTTAGCT

Homopolymer errors

ACGTAGCTGATTTTAGGCTTAGCTACGTAGCTGATTT-AGGCTTAGCT

Page 15: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Applications

Platform 454 Illumina HiSeq

Illumina MiSeq Ion Torrent

resequencing - +++ ++ +++de novo +++ + + ++

metagenomics +++ ++ + ++

mRNA ++ +++ ++ +

miRNA - +++ +++ -

ChIP - +++ ++ -

DNA meth - +++ + -

Page 16: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Announced upgrades

750 bp reads?

2x 250 bp reads8.5 Gbp

Proton from Ion Torrent

Fall 2012: 10 Gb on Proton I chip, 400 bp2013: 4 x more wells on Proton II chip

HiSeq 20002x150 bp?

HiSeq 2500Rapid run mode 27 hrs

2x150 bp, 90 Gbp

400 bp reads

Page 17: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Which instrument to choose?

Page 18: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Shameless self-promotion (1)

flxlexblog.wordpress.com

Page 19: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Shameless self-promotion (1)

@lexnederbragt

Page 20: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

2) New instruments:

Pacific Biosciences RS

Page 21: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Phase 3: single-molecule

C2 (current) chemistry:Average read length 2500 bp36 000 reads90 MB per ‘run’

Page 22: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Technology

Zero-mode waveguide

Page 23: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Technology

Page 24: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Library preparation

Page 25: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Read length

Page 26: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Raw reads and subreads

‘Subreads’

Page 27: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Raw read quality

85-87% accuracyRandom errors (!)

4 to 5 passes: accuracies in the high 90's %5 passes yields average Q30 (1:1000 chance of error)

Page 28: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Applications

Platform 454 Illumina HiSeq

Illumina MiSeq* Ion Torrent PacBio

resequencing - +++ ++ - +de novo +++ + + +++ +++

metagenomics +++ ++ + +++ +/-

mRNA ++ +++ ++ ++ ++

miRNA - +++ +++ - -

ChIP - +++ ++ - -

DNA meth - +++ + - !!!

SNP validation + - - - ++

Page 29: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio and base modifications

Page 30: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

What use for PacBio?

http://www.walker.co.uk/What-Use-Is-a-Moose-9780744578393.aspx

Page 31: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: uses

Short reads high quality

SNP validationShort tandem repeats!

Page 32: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: uses

Long reads low quality

Uses?

85-87% accuracy

Page 33: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: uses

Can long PacBio reads overcome repeats

during de novo assembly?

http://nopsa.hiit.fi/pmg/viewer/photo.php?id=14134

Page 34: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: long read uses

For de novo

http://schatzlab.cshl.edu/presentations/2012-01-17.PAG.SMRTassembly.pdf

For scaffolding

Page 35: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: long read uses

For de novo Error correct with short reads

http://schatzlab.cshl.edu/presentations/2012-01-17.PAG.SMRTassembly.pdf

Page 36: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: long read uses

For de novo

http://schatzlab.cshl.edu/presentations/2012-01-17.PAG.SMRTassembly.pdf

Page 37: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: long read uses

For de novo

Koren et al, 2012

Page 38: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: long read uses

For de novo

Page 39: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: long read uses

For de novo - alternatives

ALLPATHS_LG

Page 40: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Shameless self-promotion (2)

flxlexblog.wordpress.com

Page 41: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

3) Preliminary results:

Pacific Biosciences RS

Page 42: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Samples

Atlantic cod Fish X

http://www.drawinghowtodraw.com/http://en.wikipedia.org

Page 43: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Libraries

4kb and 17b insert sizes

Page 44: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Raw reads

cod 4kb cod 17kb Fish X 4kb Fish X 4kb Fish X 17kb

Fish X 17kb

0

5,000

10,000

15,000

20,000

25,000

30,000

ZMWs

cod 4kb cod 17kb Fish X 4kb Fish X 4kb Fish X 17kb

Fish X 17kb

0500

1,0001,5002,0002,5003,0003,5004,000

Mean readlength

cod 4kb cod 17kb Fish X 4kb Fish X 4kb Fish X 17kb Fish X 17kb0

5,000

10,000

15,000

20,000

25,000

Longest read

Page 45: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Length of longest subread for all raw reads

Page 46: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Length of longest subread for all raw reads

Fish XCod

Page 47: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Mapping to the cod genome

Long insert – single pass

Short insert – many passes

Page 48: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Mapping to the cod genome

Long insert – single pass

11.4 kbp subread

Page 49: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Mapping to the cod genome

Long insert – single pass

10.6 kbp subread

Page 50: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

PacBio: first results

Mapping to the cod genome

Long insert – single pass

10.9 kbp subread

Page 51: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

4) Beyond ‘second generation’ sequencing

Page 52: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

High-throughput sequencing

Phase 1: more is better

Phase 2: smaller is better

Phase 3: single-molecule

Phase 4: the sky is the limit?

Page 53: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Nanopore sequencing

Page 54: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Oxford Nanopore

AGBT conference, February 2012

Page 55: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Oxford Nanopore

AGBT conference, February 2012

100 kbp readsCurrently at 4% error1% error at release

GridION2000 nanopores/nodetens of Gb data per 24 hourRun until…20 nodes 1 human genome in 15 minutes

Page 56: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Oxford Nanopore

AGBT conference, February 2012

MinION

512 nanopores 150mb/hour

Up to 6 hours$900

Page 57: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Nanopore sequencing

Seeing is believing

http://uwgcm.org

Page 58: Updated: New High Throughput Sequencing technologies at the Norwegian Sequencing Centre - and beyond

Nanopore sequencing

Democratization of sequencing