spin workshop microbial genomics @nist

34
Nate Olson Genome Scale Measurements Biosystems and Biomaterials Division Microbial Genomics @ NIST

Upload: nathan-olson

Post on 14-Jul-2015

93 views

Category:

Science


3 download

TRANSCRIPT

Page 1: SPIN Workshop Microbial Genomics @NIST

Nate OlsonGenome Scale MeasurementsBiosystems and Biomaterials Division

Microbial Genomics @ NIST

Page 2: SPIN Workshop Microbial Genomics @NIST

● Example Microbial and Genomic Programs○ ERCC spike in controls○ Genome In a Bottle○ Biothreat Detection

● Three Microbial Genomics Projects○ Genomic Purity○ SNP Method Evaluation○ Genomic Reference Materials

Talk Overview

DisclaimerOpinions expressed in this paper are the authors’ and do not necessarily reflect the policies and views of DHS, NIST, or affiliated venues. Certain commercial equipment, instruments, or materials are identified in this paper in order to specify the experimental procedure adequately. Such identification is not intended to imply recommendations or endorsement by NIST, nor is it intended to imply that the materials or equipment identified are necessarily the best available for the purpose.

Microbial Genomics @ NIST

Page 3: SPIN Workshop Microbial Genomics @NIST

External RNA Control Consortium

Advancing Genomic and Biothreat Detection Metrology

Biothreat Detection

Genome In A Bottle Consortium

Page 4: SPIN Workshop Microbial Genomics @NIST

ExternalRNAControlConsortium

Is the apparent difference between samples biological or an artifact?

Page 5: SPIN Workshop Microbial Genomics @NIST

External RNA Control Consortium

Spike in controls to assess technical performance

Page 6: SPIN Workshop Microbial Genomics @NIST

External RNA Control Consortium

https://github.com/usnistgov/erccdashboard

ERCC Dashboard facilitates RM use

Page 7: SPIN Workshop Microbial Genomics @NIST

Is this genetic mutation real or an artifact?

Genome In A Bottle Consortium

Page 8: SPIN Workshop Microbial Genomics @NIST

http://genomeinabottle.org/

Genome In A Bottle

RM to challenge measurement process

Page 9: SPIN Workshop Microbial Genomics @NIST

http://www.bioplanet.com/gcat

Genome In A Bottle

High confidence variants used for algorithm benchmarking

Page 10: SPIN Workshop Microbial Genomics @NIST

Biothreat Detection

Is this suspicious powder a biothreat agent, or a false alarm?

Page 11: SPIN Workshop Microbial Genomics @NIST

Biothreat Detection

Surrogate material to support first responder training

Page 12: SPIN Workshop Microbial Genomics @NIST

Biothreat Detection

Engineered yeast as surrogate for biothreat agents

Page 13: SPIN Workshop Microbial Genomics @NIST

Address measurement challenges with reference materials and

documentary standards.

Summary

Page 14: SPIN Workshop Microbial Genomics @NIST

Microbial Genomics

Microbial Sample Characterization● Genomic Purity - DHS● Evaluating SNP calling methods - DHS● Microbial Genomic RMs - FDA

Page 15: SPIN Workshop Microbial Genomics @NIST

Microbial Genomics Purity

Challenge: Identify low levels of contaminants without knowing their identity.

Page 16: SPIN Workshop Microbial Genomics @NIST

Genomic Purity

Approach:Taxonomic read classification

paired with NGS

Page 17: SPIN Workshop Microbial Genomics @NIST

Experimental Design

Genomic Purity

● Seven Organisms○ Bacillus anthracis○ Escherichia coli O157:H7○ Francisella tularensis○ Pseudomonas aeruginosa○ Salmonella enterica○ Staphylococcus aureus○ Yersinia pestis

● Simulated Datasets○ Illumina error profile○ 250 paired end reads○ 20 X coverage

● 336 spiked datasets○ Pairwise combinations○ Contaminant concentrations

5% to 2.5 x 10-4 % of cells● Pathoscope used for read

classification (http://sourceforge.net/projects/pathoscope/)○ Database - Genbank bacterial

genomes

In-Silico Experiment

Page 18: SPIN Workshop Microbial Genomics @NIST

Contaminant

Genomic Purity

Only Contaminant and Sample Genus Identified

Page 19: SPIN Workshop Microbial Genomics @NIST

Genomic Purity

Only Contaminant and Sample Genus Identified

Page 20: SPIN Workshop Microbial Genomics @NIST

Detected contaminants down to 5.0 x 10-4 % of cells

● Sensitivity Dependent on○ Relative size of the sample and

contaminant genomes○ Genetic similarity of sample and

contaminant to other organisms in the database

Genomic Purity

Page 21: SPIN Workshop Microbial Genomics @NIST

Able to Detect Contaminants at less than 1% Cell Concentrations for Most Pairwise Comparisons

Genomic Purity

Page 22: SPIN Workshop Microbial Genomics @NIST

Conclusions● Next generation sequencing in conjunction with read classification

algorithms can be used to assess sample purity

● Achieved ○ Genus level classification specificity○ With sensitivity ranging from 5% to 2.5 x 10-4%

● Future work includes further validation of the method using real mixtures

Genomic Purity

Page 23: SPIN Workshop Microbial Genomics @NIST

SNP Method Evaluation

Challenge: Defining confidence in sample identification

Page 24: SPIN Workshop Microbial Genomics @NIST

SNP Method Evaluation

Approach: Whole genome (SNP) sample identification

Page 25: SPIN Workshop Microbial Genomics @NIST

SNP Method Evaluation Requirements

1. Reference with known truth○ Genomic DNA○ Data

■ real vs. simulated2. Performance metrics3. Replicates for assessing

uncertainty○ multiple sequencing runs○ bootstrap replicates ○ multiple reference genomes

SNP Method Evaluation

Page 26: SPIN Workshop Microbial Genomics @NIST

Truth Tables can be static or dynamic

SNP Method Evaluation

Page 27: SPIN Workshop Microbial Genomics @NIST

Truth Table Values Used to Calculate Performance Metrics

SNP Method Evaluation

Quality Score Algorithm

Page 28: SPIN Workshop Microbial Genomics @NIST

Conclusions● Three requirements for evaluation

○ Reference with known truth○ Performance metrics○ Replicates for uncertainty assessment

● Working to develop tools for implementing these requirements

● Application of these requirements will help to establish confidence in SNP based sample identification

SNP Method Evaluation

Page 29: SPIN Workshop Microbial Genomics @NIST

Genomic Reference Materials

Development of Microbial Reference Materials

Page 30: SPIN Workshop Microbial Genomics @NIST

Strains selected based on public health relevance and GC content

Genomic Reference Materials

Page 31: SPIN Workshop Microbial Genomics @NIST

Orthogonal Methods used to Characterize Genome Structure, Sequence, and Purity

Genomic Reference Materials

Page 32: SPIN Workshop Microbial Genomics @NIST

● Microbial genomic reference materials characterized○ Genome Structure○ Sequence○ Purity○ Stability

● Material and data will help validate pathogen detection assays as well as sequencing and bioinformatic workflows.

Genomic Reference Materials

Conclusions

Page 33: SPIN Workshop Microbial Genomics @NIST

Developing a measurement infrastructure to support genome-

based characterization of microbial samples.

Microbial Genomics @ NIST

Summary

Page 34: SPIN Workshop Microbial Genomics @NIST

● Biosystems and Biomaterials Division● Genome Scale Measurements

Genomic Purity● Justin Zook● Nancy Lin

MIcrobial Genomic Reference Materials● Marc Salit● Justin Zook● Steven Lund● Scott Jackson● Marc Allard and others at FDA● Heike Sichtig at the FDA

SNP Calling Method Evaluation● NIST

○ Jayne Morrow○ Justin Zook○ Steven Lund○ Nancy Lin○ Marc Salit

● Northern Arizona University○ Jim Schrup○ Becky Coleman○ Jason Sahl○ Paul Keim

● University of New Hampshire○ Jeff Foster

This work was supported by the Department of Homeland Security (DHS) Science and Technology Directorate under the Interagency Agreement HSHQPM-12-X-00078 with NIST and by two interagency agreements with the FDA.

Acknowledgements