de novo sequencing v.s . database search

17
De Novo Sequencing v.s. Database Search Bin Ma School of Computer Science University of Waterloo Ontario, Canada

Upload: geordi

Post on 22-Feb-2016

61 views

Category:

Documents


0 download

DESCRIPTION

De Novo Sequencing v.s . Database Search. Bin Ma School of Computer Science University of Waterloo Ontario, Canada. protein DB. Two Computational Approaches. MS/MS Spectra. 1. 2. database search. de novo sequencing. peptides. peptides. Is This a True Peptide Identification?. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: De Novo  Sequencing  v.s . Database Search

De Novo Sequencing v.s. Database Search

Bin MaSchool of Computer Science

University of WaterlooOntario, Canada

Page 2: De Novo  Sequencing  v.s . Database Search

de novosequencing

1

2

Two Computational Approaches

protein DBdatabase search

MS/MS Spectra

peptidespeptides

Page 3: De Novo  Sequencing  v.s . Database Search
Page 4: De Novo  Sequencing  v.s . Database Search

Is This a True Peptide Identification?

Page 5: De Novo  Sequencing  v.s . Database Search

Take Advantage of Both Approaches

DB search

Assigned?

Yes

NoDe Novo

All Spectra

DB peptidesnovel peptides

Page 6: De Novo  Sequencing  v.s . Database Search

Facts and Fallacies

• True. It can be regarded as a “universal” DB search.

1. De novo is harder.

2. De novo is slow.• False. PEAKS processes 15 spectra/sec on a PC.

3. De novo is less reliable.• Depends.

Page 7: De Novo  Sequencing  v.s . Database Search

Goals of This Talk

• A “fair” comparison• A “real” combination

Page 8: De Novo  Sequencing  v.s . Database Search

What Is a Fair Comparison?

• The ability to report a long sequence tag – long enough to uniquely identify the correct peptide from

the same database.

Page 9: De Novo  Sequencing  v.s . Database Search

Experiment 1.

• Purpose: to check the length of correct de novo tags on good quality spectra.

• Trypsin digest of Human tumour cell; LTQ-Orbitrap CID.

• Mascot DB search; 1% FDR.• Compare PEAKS de novo sequences with DB

search results.

Page 10: De Novo  Sequencing  v.s . Database Search

Distribution of Correct Tag Length

≥5 ≥6 ≥7 ≥8 ≥9 ≥100%

10%20%30%40%50%60%70%80%90%

100%

# of correct amino acid found by PEAKS de novo sequencing

perc

atag

e in

Mas

cot h

igh

confi

dede

nt P

SM

Page 11: De Novo  Sequencing  v.s . Database Search

“De Novo Only” Peptides

Page 12: De Novo  Sequencing  v.s . Database Search

Experiment 1 Conclusion.

• De novo sequencing performs well in terms of finding long sequence tags.

• De novo also finds “de novo only” peptides.

Page 13: De Novo  Sequencing  v.s . Database Search

Experiment 2.• Purpose: To test de novo tags’ ability for identifying the

peptides from the database?• For each de novo sequence, reports the “best-matching”

peptide from the database.• Use target-decoy to determine the FDR.

Page 14: De Novo  Sequencing  v.s . Database Search

Experiment 2 Conclusion

• De novo tags have better and complementary ability in selecting the correct peptides from DB.

• This might be due to the choice of the software.

Page 15: De Novo  Sequencing  v.s . Database Search

Experiment 3. De Novo Assisted DB Search# matched amino acidsbetween de novo & DB search

x+4ybest separation line

Mascot Score

Page 16: De Novo  Sequencing  v.s . Database Search

0 500 1000 1500 2000 2500 3000 3500 40000.0%

0.5%

1.0%

1.5%

2.0%

2.5%

# of PSM

FDR

Mascot PEAKS DB

PEAKS DB

Page 17: De Novo  Sequencing  v.s . Database Search

Conclusion

De novo both helps to improve DB search, and reports novel peptides.

DB search

Found?

De Novo

Improved DB search results

De novo only peptides

improves

no

yes

MS/MS Spectra

This is the default workflow of PEAKS 5.3.