computational+proteomics+ and+metabolomics+ · 2014-10-12 · computational proteomics . separation...
TRANSCRIPT
![Page 1: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/1.jpg)
This work is licensed under a Creative Commons Attribution 4.0 International License.
COMPUTATIONAL PROTEOMICS AND METABOLOMICS
Oliver Kohlbacher, Sven Nahnsen, Knut Reinert
0. Introduc,on and Overview
![Page 2: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/2.jpg)
Systems Biology – Defini=on
“Systems biology is a rela)vely new biological study field that focuses on the systema)c study of complex interac)ons in biological systems, thus using a new perspec)ve (integra=on instead of reduc=on) to study them. Par)cularly from year 2000 onwards, the term is used widely in the biosciences, and in a variety of contexts. Because the scien)fic method has been used primarily toward reduc)onism, one of the goals of systems biology is to discover new emergent proper=es that may arise from the systemic view used by this discipline in order to understand beCer the en)rety of processes that happen in a biological system.”
2 http://en.wikipedia.org/wiki/Systems_biology (06/06/2008)
![Page 3: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/3.jpg)
Integra=on vs. Reduc=onism
• Systems biology as an integra)ve approach takes the reduc)onist approach one step further
• Do not only understand the components, but understand ‘emerging proper)es’ of a system
• Key of this is the integra)on of different data, covering different aspects of the system
• Integrated modeling of the whole system can then reveal these emerging and dynamic proper)es
Example: circadian clock – the temporal (dynamic) behavior is an emerging property of the rather simple interac)on of a few key players.
3
![Page 4: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/4.jpg)
Circadian Clock in Cyanobacteria • Circadian clocks are internal
oscillators implemen)ng a 24 hour rhythm in most organisms
• The model shown on the right is a simple model for cyanobacteria including three genes (KaiA, KaiB, and KaiC – A/B/C)
• Their interac)on, phosphoryla)on, hexamer forma)on (H6), etc. are simple processes that can be described mathema)cally
• Together these simple processes give rise to the oscilla)on shown on the right, which agrees well with experimental data
• Looking at each of the processes in isola)on will not reveal the oscialla)on
4 I. Axmann, S. Legewie, and H. Herzel (2007). A minimal circadian clock model. Genome Informatics. 18:54-64.
![Page 5: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/5.jpg)
Central Dogma of Molecular Biology
Origin of the “Central Dogma of Molecular Biology” (Francis Crick, 1956)
• First articulation by Francis Crick in 1956 • Published in Nature in 1970
5
![Page 6: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/6.jpg)
Genome sequencing February 2001 – Publication of the first draft of the human genome
6
![Page 7: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/7.jpg)
‘Postgenomics’ – The Age of Omes
http://www.oed.com http://www.nature.com/omics/about/index.html
-‐ome, comb. form […] 3. Cell Biol. and Molecular Biol. Forming nouns with the sense ‘all of the specified cons=tuents of a cell, considered collec=vely or in total’, as plas=dome n., plastome n., vacuome n.
(Oxford English DicBonary online)
Ever since the rise of genomics, the suffix "-‐omics" has been added to many fields to denote studies undertaken on a large or genome-‐wide scale. While not everyone agrees with this change of terms, we felt that the terms are sufficiently widely used to serve as pointers to our published papers in the area.
(Website of ‘Nature’)
7
![Page 8: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/8.jpg)
8
![Page 9: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/9.jpg)
OMICS Mania
9 http://omics.org/index.php/Alphabetically_ordered_list_of_omes_and_omics
![Page 10: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/10.jpg)
The World of Omes
10
• Genome
• Transcriptome
• Proteome
• Metabolome
DNA
mRNA
Protein
Metabolites
http://www.pdb.org/pdb/images/2nzt_bio_r_500.jpg
![Page 11: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/11.jpg)
Technologies
Next-‐Genera=on Sequencing
Genome Epigenome
Transcriptome RNOme
Proteome Interactome
Metabolome Lipidome
Mass Spectrometry
11
![Page 12: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/12.jpg)
12
![Page 13: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/13.jpg)
Human Proteome
Nature cover May 2014 • Two dra^ versions of the human proteome (for various) )ssues
• Claim ~90% coverage of the proteome
Nature, 509 (2014) [cover]
![Page 14: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/14.jpg)
OMICS Data
• High-‐throughput techniques provide data for one specific type of rela)onship
• Genomics: DNA sequence data
• Transcriptomics: mRNA concentra)on
• Proteomics: protein concentra)ons/sequence
• Metabolomics: metabolite concentra)ons
• Interactomics: protein-‐protein interac)on data
• OMICS data is reduc)onist, but at a very large scale
• OMICS data is o^en voluminous, but of low quality/noisy
14
![Page 15: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/15.jpg)
Classical Data vs. Omics Data
• Low-‐throughput
• Low-‐dimensional, o^en single facts
• High accuracy, every data point supported by mul)ple experiments
• Analysis of experiments simple (small data volume!)
15
• High-‐throughput
• High-‐dimensional, measuring many parameters in parallel
• O^en low accuracy, lots of
noise
• O^en not interpretable without sta)s)cs/bioinforma)cs
Classical Omics
![Page 16: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/16.jpg)
Omics is a MaZer of Perspec=ve!
16
![Page 17: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/17.jpg)
Omics is a MaZer of Perspec=ve
• Each omics technology/level provides a cross-‐sec)on of one par)cular type of biomolecules
• Different levels thus correlate (roughly) to dis)nct ?????? • Genomics: what can the cell poten)ally do? • Transcriptomics: what is currently being turned on? • Proteomics: what enzymes are currently ac)ve? which signals are being transduced?
• Metabolomics: what is being produced/consumed?
• Different levels thus provide a different func)onal perspec)ve
17
![Page 18: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/18.jpg)
Omics Technologies
http://en.wikipedia.org/wiki/File:Metabolomics_schema.png , accessed 2014-03-10, 11:42:00 UTC 18
![Page 19: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/19.jpg)
Integra=ve Analysis
• Analyzing individual data set is trivial • Simultaneous integrated analysis of data from mul)ple layers/types of data is currently s)ll the major challenge!
19
![Page 20: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/20.jpg)
Computa=onal Systems Biology
• The complexity and also the sheer amount of data produced with high-‐throughput techniques makes manual analysis difficult
• Systems biology thus requires a strong computa)onal component:
Computa=onal Systems Biology
http://www.sys-bio.org/contentimages/WhyWeNeedComputer.png 20
![Page 21: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/21.jpg)
Challenges in Data Integra=on
• Seman=c integra=on of data from different sources • Different data formats
• Ambigui)es, nomenclature
• Lack of data • We do not know everything! • High-‐throughput methods show only a frac)on of ‘everything’ (detec)on limits!)
• Different scales • Time scales different, length scales different
• How to model different resolu)ons simultaneously?
21
![Page 22: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/22.jpg)
Protein
• A protein or polypep)de consists of a linear chain of amino acids that build 3-‐dimensional structures
• Amino acids are connected via pep)de bonds
H2N C
H
R1
C NH C C NH C
O
R2
O H
R3
C NH C C
O H O
R4
OH
Peptide bonds
C-terminus N-terminus
22
![Page 23: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/23.jpg)
Proteomics • Proteomics: study of a proteome • Proteome: sum of all proteins in a given sample (e.g., )ssue, cell, )me-‐point)
• Proteomics typically tries to • Catalog the proteins in a sample (qualita=ve proteomics)
• Quan)fy the proteins in a sample, i.e., determine the concentra)ons of all proteins (quan=ta=ve proteomics)
• Concentra)ons in a sample vary dras)cally – large dynamic range required (see figure on the right)
23 Kulak et al., Nat. Methods 11, 319–324 (2014) doi:10.1038/nmeth.2834
![Page 24: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/24.jpg)
Proteomics – Typical Ques=ons
• There are some problema)c issues on defining a protein • Protein iden)ty: unique amino acid sequence and single source of origin?
• There may be different genes encoding the iden)cal amino acid sequence
• Different organisms may encode iden)cal proteins
• Splice variants: A gene can give rise to different mRNAs • Polymorphisms: many genes occur in allelic variants encoding sequence varia)ons
• PosCransla)onal modifica)ons: PTMs are very hetero-‐geneous and significantly alter the func)on of the protein
24
![Page 25: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/25.jpg)
Proteomics -‐ Examples
25
http://www.iamashcash.com/wp-content/uploads/2011/03/caterpillar-to-butterfly1.jpg, accessed: 14/10/2013 6 PM http://www.ufrgs.br/laprotox/en/what-we-do/research-lines/ureases-non-enzymatic-properties/ureases-induce-platelet-secretion-and-aggregation Beck et al., Blood (2014), 123(5):e1-e10. doi: 10.1182/blood-2013-07-512384
Understanding phenotypes: Genome remains the same…
…proteome changes
Understanding signaling: Platelets are non-nucleated cells – to understand their behavior (blood clotting) phosphoproteomics is required. It reveals time-resolved activation of kinases.
Activated platelets Time course of selected phosphopeptides (Beck et al., 2014)
![Page 26: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/26.jpg)
Main fields of proteomics
protein expression
protein characterization (identification + PTMs)
protein interaction
protein localization
?
0.0
0.5
1.0
26
![Page 27: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/27.jpg)
Applica=ons of proteomics
?
0.0
0.5
1.0
• Drug target identification • Determine content of a protein mixture
• Understanding regulation of protein activity
• Gene annotation • Therapeutic markers
• Drug target identification
• Functional annotation (compartment and function)
• Drug target identification
protein expression
protein characterization (identification + PTMs)
protein interaction
protein localization
27
![Page 28: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/28.jpg)
Metabolites
• Metabolites are intermediates and products of metabolic processes – everything that biochemistry can create
• Technically speaking also DNA, RNA and proteins could be considered metabolites
• The term is usually restricted to small molecules • Spans a variety of substance classes (not complete):
• Amino acids • Alcohols • Lipids • Sugars • …
• Chemically much more diverse than proteome! 28
![Page 29: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/29.jpg)
Metabolomics – The Big Picture
Nicholson and Lindon. Nature 2008, 455, 1054-1056 29
![Page 30: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/30.jpg)
Metabolic Networks
http://www.genome.jp/dbget-bin/www_bget?pathway+ecj00020
30
![Page 31: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/31.jpg)
Technologies
Modern Proteomics and Metabolomics studies are based on
Chromatography coupled to Mass spectrometry (MS)
http://en.wikipedia.org/wiki/High-performance_liquid_chromatography. Access 14/10/2013, 5 PM
www.planetorbitrap.com Access 14/10/2013, 5 PM
31
![Page 32: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/32.jpg)
Technologies
• Chromatography (GC/LC) • Chromatography separates proteins/pep)des or metabolites
• Reduces complexity of samples
• Mass spectrometry (MS) • Iden)fies the biomolecules (mass spectrum o^en used similar to a ‘fingerprint’ of the molecule)
• Signal intensity is propor)onal to concentra)on of the molecule in the sample
32
![Page 33: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/33.jpg)
Shotgun proteomics
33
Protein extraction
MS spectrum
MS/MS spectrum
Trypsin digestion
Peptide fractionation (e.g., isoelectric focusing)
High Performace Liquid Chromatography
(HPLC)
Mass Spectrometry (MS)
Computational proteomics
![Page 34: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/34.jpg)
Separation 1 separate peptides by their retention time on column
Ionization electrospray, transfers charge to the peptides
Separation 2 MS separates by mass-to-charge ratio (m/z)
HPLC ESI TOF
At its core: HPLC-‐MS
RT
I Spectrum (scan)
34
![Page 35: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/35.jpg)
Mass Spectrometry
mass spectrometry
measure a peptide‘s mass-to-charge ratio m/z
Inte
nsit
y
m/z In
tens
ity
35
![Page 36: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/36.jpg)
Proteomics: Database Search • Iden)fica)on of mass spectra
is easily done through database search • Search all pep)des of matching mass from a database
• Construct a theore)cal mass spectrum for these pep)de candidates
• Score against the experimental spectrum
• Post-‐genomics: database
search is possible because we have a genome sequence
Sequence DB
? ? ?
36
![Page 37: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/37.jpg)
Integra=ve Analysis
• Analyzing individual data set is trivial • Simultaneous integrated analysis of data from mul)ple layers/types of data is currently s)ll the major challenge!
37
![Page 38: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/38.jpg)
Growth of Omics Data (EBI Repositories)
38 Illustration: Christoph Steinbeck, EBI
![Page 39: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/39.jpg)
Mul=-‐Omics/Polyomics
• Systems biology requires an integra)ve view spanning more than one omics level – this is called ‘mul)-‐omics’ or ‘polyomics’
• Data sets are • Huge (o^en hundreds of GB) • Heterogeneous • Complex in their structure
• Integra)ve analysis is complex (usually takes longer than data genera)on)
• Complex analysis workflows are hard to reproduce
39
![Page 40: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/40.jpg)
Big Data and Reproducible Science
40
![Page 41: COMPUTATIONAL+PROTEOMICS+ AND+METABOLOMICS+ · 2014-10-12 · Computational proteomics . Separation 1 separate peptides by their retention time on column Ionization electrospray,](https://reader034.vdocument.in/reader034/viewer/2022042712/5f95095a6e3dec41501eb6dc/html5/thumbnails/41.jpg)
Big Data and Reproducible Science
41