defense of my bsc-thesis

48
Beyond the Command- Line Interface: Transcriptome Analysis based on Next-Generation- Sequencing Data

Upload: bastian-greshake

Post on 15-May-2015

1.842 views

Category:

Education


0 download

DESCRIPTION

The slides i used for the defense of my BSc-Thesis. A blogpost in german about the work can be found here: http://www.wissenslogs.de/wblogs/blog/bierologie/biologie/2010-08-11/meine-bachelor-arbeit-transkriptom-analysen-f-r-jeden

TRANSCRIPT

Page 1: Defense of my BSc-Thesis

Beyond the Command-Line Interface:

Transcriptome Analysis based on Next-Generation-Sequencing Data

Page 2: Defense of my BSc-Thesis

Beyond the Command-Line Interface:

Transcriptome Analysis based on Next-Generation-Sequencing Data

Page 3: Defense of my BSc-Thesis

Transcriptome Analysis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

• cheap to produce

• fast way of producing data

• allows comparative analysis

Page 4: Defense of my BSc-Thesis

Assembly

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

Page 5: Defense of my BSc-Thesis

Assembly

read 1

read III

read 1V read 1I

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

Page 6: Defense of my BSc-Thesis

Assembly

read 1 read III

read 1Vread 1I

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

Page 7: Defense of my BSc-Thesis

Assembly

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

Page 8: Defense of my BSc-Thesis

Assembly

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

Page 9: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

Annotation

Image: http://commons.wikimedia.org/wiki/User:Rama, CC-BY-SA 2.0

Page 10: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

Annotation

Image: http://commons.wikimedia.org/wiki/User:Rama, CC-BY-SA 2.0

Page 11: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Assembly / Annotation

Annotation

Image: http://commons.wikimedia.org/wiki/User:Rama, CC-BY-SA 2.0

Page 12: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Tools / Interfaces / Drawbacks

How is it done?What tools do exist?

Page 13: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Tools / Interfaces / Drawbacks

How is it done?

What tools do exist?

Page 14: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Tools / Interfaces / Drawbacks

How is it done?What tools do exist?Assembly

SNP-Detection

Homology-Detection

ORF-Prediciton

CAP3 QualitySNP BLAST ESTScan

MIRA AutoSNP FASTX/FASTY ORFPredictor

PolyBayes HMMER

MIRA PatternHunter

Page 15: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Tools / Interfaces / Drawbacks

Page 16: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Tools / Interfaces / Drawbacks

Toolword count

of the manual

BLAST2GO 618

BLAST 3469

HMMER 32213

MIRA 71654

Page 17: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Tools / Interfaces / Drawbacks

Chin et al, 1998

Drawbacks

• spoilt for choice by the amount of tools

• time-consuming to find right tools

• lots of parameters to learn

• the command-line interface is unintuitive

Page 18: Defense of my BSc-Thesis

Beyond the CLIGetting around the Drawbacks

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Chin et al, 1998

• minimize choice of tools

• minimize the parameters for each tool

• add a graphical user interface

Page 19: Defense of my BSc-Thesis

PipelineTranscriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 20: Defense of my BSc-Thesis

PipelineTranscriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Task Tool

File-ConversionPerl- & Python-

Scripts

Assembly MIRA

SNP-Detection MIRA

Homology-Detection

BLAST

ORF-Prediction ORFPredictor

Pfam-Annotationpfam_scan /

HMMER

GO-Annotation BLAST2GO

Page 21: Defense of my BSc-Thesis

PipelineTranscriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Task Tool

File-ConversionPerl- & Python-

Scripts

Assembly MIRA

SNP-Detection MIRA

Homology-Detection

BLAST

ORF-Prediction ORFPredictor

Pfam-Annotationpfam_scan /

HMMER

GO-Annotation BLAST2GO

Page 22: Defense of my BSc-Thesis

PipelineTranscriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Task Tool

File-ConversionPerl- & Python-

Scripts

Assembly MIRA

SNP-Detection MIRA

Homology-Detection

BLAST

ORF-Prediction ORFPredictor

Pfam-Annotationpfam_scan /

HMMER

GO-Annotation BLAST2GO

Page 23: Defense of my BSc-Thesis

PipelineTranscriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Task Tool

File-ConversionPerl- & Python-

Scripts

Assembly MIRA

SNP-Detection MIRA

Homology-Detection

BLAST

ORF-Prediction ORFPredictor

Pfam-Annotationpfam_scan /

HMMER

GO-Annotation BLAST2GO

Page 24: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 25: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 26: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 27: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 28: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 29: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 30: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 31: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 32: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 33: Defense of my BSc-Thesis

GUI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 34: Defense of my BSc-Thesis

Distributed Ruby

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 35: Defense of my BSc-Thesis

Distributed Ruby

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Solutions / Pipeline / GUI / Distributed Ruby

Page 36: Defense of my BSc-Thesis

Results

Zostera marina

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Zostera marina / Results

Image: Colin Faulkingham, CC-BY-SA 2.5

Page 37: Defense of my BSc-Thesis

Results

Zostera marina

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Zostera marina / Results

Image: Colin Faulkingham, CC-BY-SA 2.5

866838 454-reads

Page 38: Defense of my BSc-Thesis

Results

Zostera marina

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Zostera marina / Results

Image: Colin Faulkingham, CC-BY-SA 2.5

866838 454-reads

35364 contigs

Page 39: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Zostera marina / Results

Page 40: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Zostera marina / Results

Page 41: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Zostera marina / Results

Page 42: Defense of my BSc-Thesis

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Zostera marina / Results

Page 43: Defense of my BSc-Thesis

Shortcomings

• only basic analysis due to lack of parameters of single tools

• no graphical representation of results

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Shortcomings & Solutions / Summary

Page 44: Defense of my BSc-Thesis

Shortcomings• only basic analysis due to lack of

parameters of single tools

• no graphical representation of results

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Shortcomings & Solutions / Summary

Solutions• add optional “expert” parameters

• implement graphical representations

Page 45: Defense of my BSc-Thesis

Summary

• Transcriptome Analysis needs software tools

• Many different tools for each task exist

• Most of them are CLI-driven

• People don‘t like the CLI

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Shortcomings & Solutions / Summary

Page 46: Defense of my BSc-Thesis

Summary

• Built a pipeline that provides usual tools and a GUI

• The pipeline works for standard tasks of Transcriptome Analysis

• In an upcoming version more tool-parameters & graphical results shall be added

Transcriptome Analysis / How is it done? / Beyond the CLI / Results / Outlook

Shortcomings & Solutions / Summary