bigcat bioinformatics hunting strategy of the bigcat
TRANSCRIPT
![Page 1: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/1.jpg)
BiGCaT BioinformaticsHunting strategy of the bigcat
![Page 2: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/2.jpg)
BiGCaT,BiGCaT,bridge between two universitiesbridge between two universities
Universiteit MaastrichtPatients, Experiments,
Arrays and Loads of Data
TU/eIdeas & Experience in Data Handling
BiGCaT
![Page 3: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/3.jpg)
Major Research FieldsMajor Research Fields
CardiovascularResearch
Nutritional &Environmental
Research
BiGCaT
![Page 4: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/4.jpg)
What are we looking for?What are we looking for?
![Page 5: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/5.jpg)
What are we looking for?What are we looking for?
Different conditions show different levels of gene expression for specific genes
![Page 6: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/6.jpg)
Differences in gene expression?Differences in gene expression?Between e.g.:Between e.g.:• healthy and sickhealthy and sick• different stages of disease progressiondifferent stages of disease progression• different stages of healingdifferent stages of healing• failed and successful treatment• more and less vulnerable individuals
Shows:Shows:• important pathways and receptors important pathways and receptors • which then can be influencedwhich then can be influenced
![Page 7: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/7.jpg)
The transfer of informationThe transfer of informationfrom DNA to protein.from DNA to protein.
From: Alberts et al. Molecular Biology of the Cell, 3rd edn.
![Page 8: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/8.jpg)
Eukaryotic genesEukaryotic genesin somewhat more detailin somewhat more detail
![Page 9: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/9.jpg)
Gene Gene expression measurementexpression measurement
Functional genomics/transcriptomics:Changes in mRNA– Gene expression microarrays– Suppression subtraction lybraries–
Proteomics:Changes in protein levels– 2D gel electrophoresis – Antibody arrays–
DNA mRNA protein
![Page 10: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/10.jpg)
Gene expression Gene expression arraysarraysMicroarrays: relative
fluorescense signals. Identification.
Macroarrays: absolute radioactive signal. Validation.
![Page 11: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/11.jpg)
Layout of a microarray experiment
1) Get the cells
2) Isolate RNA
3) Make fluorescent cDNA
4) Hybridize
5) Laser read out
6) Analyze image
![Page 12: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/12.jpg)
The cat and its prey:The cat and its prey:the datathe data
Comprises: Known cDNA sequences (not known genes!)
on the array = reporters Data sets typically contain 20,000 image spot
intensity values in 2 colors One experiment often contains multiple data
points for every reporter (e.g. times or treatments) Each datapoint can (should) consist of multiple
arrays
Bioinformatics should translate this in to useful biological information
![Page 13: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/13.jpg)
HuntingHunting
Comprises: Analyze reporters Data pretreatment Finding patterns in expression Evaluate biological significance of
those patterns
![Page 14: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/14.jpg)
Reporter analysisReporter analysis
Reporter sequence must be known(can be sequenced using digest electrophoresis).
Lookup sequence in genome databases (e.g. Genbank/Embl or Swissprot)
Will often find other RNA experiments (ESTs) or just chromosome location.
![Page 15: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/15.jpg)
Blast reporters against what?Blast reporters against what?
Nucleotide databases (EMBL/Genbank)Disadvantages: many hits, best hit on clone, we actually want function (ie protein)
Nucleotide clusters (Unigene)Disadvantage: still no function
Protein databases (Swissprot+trEMBL)Disadvantages: non coding sequence not found, frameshifts in clones
![Page 16: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/16.jpg)
Two implemented solutionsTwo implemented solutions
Start with Unigene (from Blastn or platform provider), mine using SRS (direct, through PDB, through PIR) -> Swissprot/trEMBL
Use dedicated EMBL-Swissprot X-linked DB (Blast against EMBL subset get Swissprot/trEMBL)
![Page 17: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/17.jpg)
Two implemented solutionsTwo implemented solutions
Start with Unigene (from Blastn or platform provider), mine using SRS (direct, through PDB, through PIR) -> Swissprot/trEMBL
Use dedicated EMBL-Swissprot X-linked DB (Blast against EMBL subset get Swissprot/trEMBL)
![Page 18: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/18.jpg)
Scotland - Holland: 1-0? Scotland - Holland: 1-0?
Check Affymetrix reporter sequences.
- Each reporter 16 25-mer probes.- Blast against ENSEMBL genes
(takes 1 month on UK grid).- Use for cross-species analysis- Adapt RMA statistical analysis in
Bioconductor
![Page 19: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/19.jpg)
Next slide shows data of one Next slide shows data of one single actual microarraysingle actual microarray
Normalized expression shown for both channels.
Each reporter is shown with a single dot. Red dots are controls Note the GEM barcode (QC) Note the slight error in linear
normalization (low expressed genes are higher in Cy5 channel)
![Page 20: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/20.jpg)
![Page 21: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/21.jpg)
Next slide shows same data Next slide shows same data after processingafter processing
Controls removed Bad spots (<40% average area) removed Low signals (<2.5 Signal/Background)
removed All reporters with <1.7 fold change
removed (only changing spots shown)
![Page 22: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/22.jpg)
![Page 23: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/23.jpg)
Final slide shows information Final slide shows information for one single reporterfor one single reporter
This signifies one single spot It is a known gene:
an UDP glucuronyltransferase Raw data and fold change are
shown
![Page 24: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/24.jpg)
![Page 25: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/25.jpg)
Secondary Secondary AnalysesAnalyses
Gene clustering(find genes that behave equally)
Cluster evaluation(what do we see in clusters …)
Physiological evaluation(for arrays, proteomics, clusters)
Understand the regulation
![Page 26: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/26.jpg)
2
time
Exp
r. le
vel
Clustering: find genes with same pattern
T1 signal
T2
sig
na
l
Left hand picture shows expression patterns for 2 genes (these should probably end up in the same cluster).
Right hand picture shows the expression vector for one gene for the first 2 dimensions. Can be normalized by amplitude (circle) or relatively (square).
![Page 27: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/27.jpg)
Cluster evaluationCluster evaluation
Group genes (function, pathway, regulations etc.)
Find groups in patterns using visualization tools and automatic detection.
Should lead to results like:“This experiment shows that a large number of apoptosis genes are up-regulated during the early stage after treatment. Probably meaning that cells are dying”
![Page 28: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/28.jpg)
Example of GenMAPP results:
Manual lookup on a MAPP
![Page 29: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/29.jpg)
Understanding regulationUnderstanding regulation
The main idea: co-regulated genes could have common regulatory pathways.
The basic approach: annotate transcription factor binding sites using Transfac and use for supervised clustering.
The problem: each gene has hundreds of tfb’s.
Solution? Use syntenic regions using rVista (work in progress with Rick Dixon)
![Page 30: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/30.jpg)
Understanding QTL’sUnderstanding QTL’s
Get blood pressure QTLs: from ENSEMBL/cfg Welcome group.
Look up functional pathways and Go annotations using GenMapp: virtual experiment assume all genes in QTL are changing.
Create a new blood pressure Mapp: confront this with real blood pressure/heart failure microarray data.
Work in progress TU/e MDP3 group.
![Page 31: BiGCaT Bioinformatics Hunting strategy of the bigcat](https://reader036.vdocument.in/reader036/viewer/2022062404/551a7e185503466b3a8b4636/html5/thumbnails/31.jpg)
People involvedPeople involvedBigcat Maastricht: Rachel van Haaften (IOP), Edwin ter Voert (BMT), Joris Korbeeck (BMT/UM), Willem Ligtenberg (IOP), Stan Gaj (tUL), Chris Evelo
Tue: Peter Hilbers, Huub ten Eijkelder, Patrick van Brakel, lots of studentsCARIM: Yigal Pinto, Umesh Sharma, Blanche Schroen, Matthijs Blankesteijn, Jos Smits, Jo de Mey, Danielle Curfs, Kitty Cleutjens, Natasja Kisters, Esther Lutgens, Birgit Faber, Petra Eurlings, Ann-Pascalle Bijnens, Mat Daemen, Frank Stassen, Marc van Bilssen, Marten Hoffker. NUTRIM: Wim Saris, Freddy Troost, Johan Renes, Simone van Breda.GROW: Daisy vd Schaft, Chamindie PuyandeeraIOP Nutrigenomics: Milka Sokolovic, Theo Hackvoort, Meike Bunger, Guido Hooiveld, Michael Müller, Lisa Gilhuis-Pedersen, Antoine van Kampen, Edwin Mariman, Wout Lamers, Nicole Franssen, Jaap keijerCfg Welcome group: Neil Hanlon (Glasgow) Gontran Zepeda (Edinburg), Rick Dixon (Leicester), Sheetal Patel (London).Paris leptin group: Soraya Taleb, Rafaelle Cancello,Nathalie Courtin, Carine ClementOrganon: Jan Klomp, Rene van Schaik.BioAsp: Marc Laarhoven.