chapter 14 genomes and genomics. sequencing dna dideoxy (sanger) method ddgtp ddatp ddttp ddctp...

30
Chapter 14 Genomes and Genomics

Upload: percival-mathews

Post on 11-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Chapter 14 Genomes and Genomics

Page 2: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Sequencing DNAdideoxy (Sanger) method

ddGTP ddATP ddTTP ddCTP

5’TAATGTACG TAATGTACTAATGTATAATGTTAATGTAATTAATAT

Fred Sanger, Nobel prize 1980

Page 3: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Sequencing DNAdideoxy (Sanger) method

Leroy Hood, CaltechFluorescence based sequencing

Norm Dovici – Capillary electrophoresis

Page 4: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Sequencing DNAdideoxy (Sanger) method

Page 5: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Genomics era: High-throughput DNA sequencing

The first high-throughput genomics technology was automated DNA sequencing in the early 1990.

In September 1999, Celera Genomics completed the sequencing of the Drosophila genome.

Baker’s yeast, Saccharomyces cerevisiae (15 million bp), was the first eukaryotic genome to be sequenced.

TIGR (The Institute for Genomics Research) 1995 – first whole genome sequence, H. influenza

Page 6: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Genomics: Completed genomes as 2002

Currently the genome of over 600 organisms are sequenced:

This generates large amounts of information to be handled by individual computers.

http://www.genomesonline.org/

Page 7: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Cloning/librariesBAC, YAC and ESTs

• BAC = bacterial artificial chromosome– 150 kb, replicate in E.coli

• YAC = yeast artificial chromosome– 150 kb -1.5 Mb, replicate in yeast

Page 8: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Assemblingcontigs

Page 9: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Ordered-clone Sequencing

Clones ordered by restriction enzyme sites

Page 10: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Annotation

• ORF – open reading frame

• EST- Expressed sequence tag – Based on mRNA

• Comparative genomics

Page 11: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

The trend of data growth

01

234

56

78

1980 1985 1990 1995 2000

Years

Nu

cle

oti

de

s(b

illio

n)

21st century is a century of biotechnology:

Microarray: Global expression analysis: RNA levels of every gene in the genome analyzed in parallel.

Proteomics:Global protein analysis generates by large mass spectra libraries.

Metabolomics:Global metabolite analysis: 25,000 secondary metabolites characterized

Genomics: New sequence information is being produced at increasing rates. (The

contents of GenBank double every year)

Glycomics:Global sugar metabolism analysis

Page 12: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

How to handle the large amount of information?

Drew Sheneman, New Jersey--The Newark Star Ledger

Answer: bioinformatics and Internet

Page 13: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Bioinformatics history

IBM 7090 computer

In1960s: the birth of bioinformatics

Margaret Oakley Dayhoff created:The first protein databaseThe first program for sequence assembly

There is a need for computers and algorithms that allow: Access, processing, storing, sharing, retrieving, visualizing, annotating…

Page 14: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

DNA (nucleotide sequences) databases

They are big databases and searching either one should produce similar results because they exchange information routinely.

-GenBank (NCBI): www.ncbi.nlm.nih.gov

-Arabidopsis: (TAIR) www.arabidopsis.org

Specialized databases:Tissues, species… -ESTs (Expressed Sequence Tags)

~at NCBI ~at TIGR

- ...many more!

Page 15: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Comparative genomics

BLAST – basic local alignment and search tool(http://www.ncbi.nlm.nih.gov/)

Homologsorthologsparalogs

Page 16: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

QuestionYou are a researcher who has tentatively identified a human homolog of a yeast gene. You determine the DNA sequence of cDNAs of both your yeast gene and the human gene and decide to compare the gene sequences, as well as the predicted protein sequence of each, using alignment software. You would expect the greatest sequence identity from comparisons of the:

a. cDNA sequencesb. Protein sequencesc. Genomic DNA sequencesd. Both (a) and (b) will give you equivalent sequence similaritye. All will give equivalent sequence similarity

Page 17: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

What is a microarray?

Page 18: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Types of Arrays

• Expression Arrays – cDNA– Genome

• Affymetrix (GeneChip®)

• Agilent

• Tiling arrays

Page 19: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Overview of Microarrays

Page 20: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Transcription Profiling of a mutant

WT

mutant

Page 21: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

A “good” microarray plate

Red = only in treatment

Green = only in normal

Yellow = found in both

Black = found in neither

Page 22: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

ResultsResults

100’s of genes identified,

those turned on, those turned off

Page 23: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Expression mapred = up regulatedgreen= down regulated

Page 24: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Question

Microarray technology directly involves:

a. PCR

b. DNA sequencing

c. Hybridization

d. RFLP detection

e. None of the above

Page 25: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Protein – protein interactions

• ChIP (chomatin immunoprecipitation)

• Yeast two hybrid

• Bi Molecular Fluorescence Complementation (BMFC)

Page 26: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

ChIP and ChIP- chip

Page 27: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Yeast two hybrid

Page 28: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Citovsky et al., 2006

Bi Molecular Fluorescence Complementation(BMFC)

Page 29: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Reverse genetics

• Gene knockouts

• RNAi

• Overexpression

• Altered expression

Page 30: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA

Summary

• DNA Sequencing and the rise of genomics

• Annotation of genome sequence– Comparative genomics– Functional genomics

• Protein-protein interactions

• ESTs

• Reverse genetics