post-genomics functional genomics (a) identifying genes from the sequence (b) gene expression...

42

Upload: hilary-thomas

Post on 13-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics
Page 2: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Post-genomics

Page 3: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Functional genomics(A) Identifying genes from the sequence (B) Gene expression profiling

(transcriptome)(C) Model systems

Proteomics

Systems biology

Post-genomics

Page 4: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

(A) Hunting genes from the sequence

2 broad approaches

1) Ab initio method (computational)

2) Experimental method

Page 5: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

1) Ab initio method (computational)

Scanning ORFs (open reading frames)

5’- ATGACGCATGATCGAGGAT –3’

3’ – TACTGCGTACTAGCTCCTA –5’

AACTAA

ATG

CCTCTA

TCC

Page 6: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Ab initio method (computational)

initiation or termination codons Codon bias found in specific species

Not all codons used at same frequency e.g.human leucine mainly coded by CTG and rarely by TTA or CTA

Exon-intron boundaries (splice sites)5’-AG GTAAGT-3’ hit and miss affair

Upstream control sequences – e.g conserved motifs in transcription factor

binding regions CpG islands

Page 7: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

2) experimental method

Experimental evaluation based on the use of transcribed RNA to locate exons and entire genes from DNA fragment.

Page 8: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

experimental method Some strategies Hybridisation approaches – Northern

Blots, cDNA capture / cDNA select, Zoo blots

Transcript mapping: RT-PCR, exon trapping etc In this method, known DNA databases are searched to find out whether the test sequence is similar to any other known genes, suggesting an evolutionary relationship.

Page 9: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Northern Blot Zoo Blot

Page 10: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Transcriptomecomplete collection of transcribed

elements of the genome (global mRNA profiling)

transcriptome maps will provide clues on • Regions of transcription• Transcription factor binding sites • Sites of chromatin modification • Sites of DNA methylation • Chromosomal origins of replication

(B) Gene expression profiling

Page 11: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Homology searches (BLAST searches)

- Orthologous genes (homologues in different

organisms with common ancestor) – comparative

genomics

- Paralogous genes (genes in the same organism,

e.g. multigene families)

- orphan genes / families

COMPUTATIONAL APPROACH

Page 12: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

The transcriptome

Analysis can be done by either

Microarray technology

SAGE (serial analysis of gene expression) technology

Page 13: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

(a) Schematic drawing of a DNA chip.

Microarray(chip) Segment of

a chip Spot containing copiesof a single DNAmolecule

Part of oneDNA strand

AG

GACGT

DNAbases

AATTCGC

Examplesof reactions

TT

AAGCG

AAT

TCGC

cCNA fromuntreatedcells

chip DNA

TT

AAGCG

cDNAfromtreatedcells

Pair ofcomplementary

basesA

GGACGT

GG

GACTA

CC

CGGAT

TC

CTGCA

(b) The analysis of the hybridization process identifies genes that respond in specific ways.

Cell samples are stabilizedand fluorescent labelsare added.

Gene that strongly increasedactivity in treated cells

Gene that strongly decreasedactivity in treated cellsGene that was equally activein treated and untreated cells

Gene that was inactivein both groups

(c) Computer analysis of the binding of complementary sequences can identify genes that respond to drug treatment.

Microarrays

Page 14: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

gene inactivation methods (knockouts, RNAi, site-directed mutagenesis, transposon tagging, genetic footprinting etc)

Gene overexpression methods (knock-ins, transgenics, reporter genes)

MODEL SYSTEMS

Page 15: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

RNAi

RNAi mimics loss-of-function mutations

Non-inheritable

Lack of reproducibility

Page 16: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

How does RNAi work?

http://www.nature.com/focus/rnai/animations/index.html

Page 17: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Gene overexpression methods (knock-ins, transgenics, reporter genes etc)

MODEL SYSTEMS

Page 18: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Proteomics

Nature (2003) March 13: Insight articles from pg 194

Analysis of protein expressionProtein structure Protein-protein interactions

Page 19: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Proteomics

Proteome projects - co-ordinated by the HUPO (Human Protein Organisation)

Involve protein biochemistry on a high-throughput scale

Problems limited and variable sample material, sample degradation, abundance, post-translational modifications, huge tissue, developmental and temporal

specificity as well as disease and drug influences.

Nature (2003) March 13: Insight articles from pgs 191-197.

Page 20: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Approaches in proteomics

Nature (2003) March 13: Insight articles from pgs 191-197.

High throughput approach

1)Mass- spectrometry

2) Array based

proteomics

3)Structural proteomics

Page 21: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

High throughput approaches in proteomics

1) Mass spectrometry-based proteomics: relies on the discovery of protein ionisation techniques.

used for protein identification and

quantification, profiling, protein interactions and modifications.

Nature (2003) March 13: Insight articles from pgs 191-197

Page 22: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

two dimensional gels and mass spectrometry

Identification of proteins in complex mixtures

Page 23: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

19_09.jpg

two dimensional gels

Page 24: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Mass spectrometry (MS)

Nature (2003) March 13: Insight articles from pgs 191-197

Page 25: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

ionizer source: converts analyte to gaseous ions mass analyser: measures mass-to-charge ratio

(m/z)detector: registers the number of ions at each m/z

Principle of MS

Page 26: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Types of ionizer sources

Nature (2003) March 13: Insight articles from pgs 191-197.

Electrospray ionisation (ESI)matrix-assisted laser desortion/ionisation (MALDI)

MALDI-MS - simple peptide mixtures whereas ESI-MS - for complex samples.

Page 27: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

2) Array-based proteomics

Nature (2003) March 13: Insight articles from pgs 191-197.

Based on the cloning and amplification of identified ORFs into homologous (ideally used for bacterial and yeast proteins) or sometimes heterologous systems (insect cells which result in post-translational

modifications similar to mammalian cells). A fusion tag (short peptide or protein

domain that is linked to each protein member e.g. GST) is incorporated into the plasmid construct.

Page 28: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Array based proteomics….

Nature (2003) March 13: Insight articles from pgs 191-197.

a.  Protein expression and purification b.  Protein activity: Analysis can be done using

biochemical genomics or functional protein microarrays. c.  Protein interaction analysis two-hybrid analysis (yeast 2-hybrid), FRET (Fluorescence resonance energy transfer), phage display etc d. Protein localisation: immunolocalisation of epitope-tagged products. E.g the use of GFP or luciferase tags

Page 29: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Array based proteomics….

Nature (2003) March 13: Insight articles from pgs 191-197.

Protein chips

Antibody chips – arrayed antibodiesAntigen chips – arrayed antigensFunctional arrays – arrayed proteinsProtein capture chips – arrayed capture agents that interact with proteins e.g. BIAcoreSolution arrays – nanoparticles

Page 30: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

19_14.jpg

3) Structural proteomics

Page 31: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

19_14.jpg

3) Structural proteomics

Page 32: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Identification of protein-protein

interactionsaffinity capture/mass

spectrometry

Fig. 10. 31

Page 33: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Identification of protein-protein interactionsPhage display

Fig. 10.32

Page 34: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Systems Biology – the global study of multiple components of biological systems and their

interactions

• New approach to studying biological systems has made possible– Sequencing genomes– High-throughput platform development– Development of powerful computational

tools– The use of model organisms– Comparative genomics

Page 35: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Six steps in systems approach

• Formulate computer based model for the system

• Discovery science to define as many of the system’s elements as possible

• Perturb the system genetically or environmentally

• Integrating levels of information form perturbations

• Formulate hypothesis to explain disparities between model and experimental data

• Refine the model after integrating data

Page 36: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

19_20.jpg

Page 37: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Nitin S. Baliga et al. Genome Res. 2004; 14: 1025-1035

Systems biology approach to studying how Halobacterium NRC-1 transcriptome responds to uv radiation

Page 38: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

The systems biology approach to studying how yeast turns genes on and off in the

utilization of galactose

Fig. 10.34

Page 39: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Fig. 10.35

Page 40: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Human Genome Project has changed the potential for predictive/preventive

medicine

• Provided access to DNA polymorphisms underlying human variability– Makes possible identification of genes

predisposing to disease– Understanding of defective genes in context of

biological systems– Circumvent limitations of defective genes

• Novel drugs• Environmental controls• Approaches such as stem-cell transplants or

gene therapy

Page 41: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

Challenges for the future – ‘physiome’

Nature Reviews Molecular Cell Biology 4; 237-243 (2003)

Page 42: Post-genomics Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems Proteomics

General Reading– Chapter 19- HMG3 by Strachan and Read

Reference

- Nature (13 March 2003). Proteomics insight articles from Vol. 422, No. 6928 pgs 191-197