1 cloning, genomes, and proteomes chromosome : component in the cell that contains genetic...

30
1 Cloning, genomes, and proteomes • Chromosome: Component in the cell that contains genetic information • Plasmid: Circular DNA molecule that replicates separately from the host chromosome • Genome: The complete set of genes for an organism • Proteome: The entire protein complement encoded by an organism's genome • DNA cloning: Cutting out a piece of DNA from the genome and inserting into a plasmid vector. • Recombinant DNA: A DNA molecule comprising covalently linked segments from two or more sources Definitions:

Upload: barry-golden

Post on 22-Dec-2015

218 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

1

Cloning, genomes, and proteomes

• Chromosome: Component in the cell that contains genetic information

• Plasmid: Circular DNA molecule that replicates separately from the host chromosome

• Genome: The complete set of genes for an organism• Proteome: The entire protein complement encoded by an

organism's genome• DNA cloning: Cutting out a piece of DNA from the genome

and inserting into a plasmid vector. • Recombinant DNA: A DNA molecule comprising covalently

linked segments from two or more sources

Definitions:

Page 2: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

2

Genomes and Chromosomes

From a lysed E. coli cell:

Fig. 24-4

• The E. coli genome consists of a single chromosome which is a double-stranded circular DNA molecule with 4,639,221 base pairs.

• E. coli also contain smaller circular DNA molecules that are free in the cytosol (plamids); see white arrows in figure.

Page 3: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

3

Genomes and Chromosomes

From a human:

Fig. 24-5

• The human genome consists of 22 chromosomes (times 2) plus an X and a Y, or two X chromosomes (46 total).

• Eukaryotic chromosomes are complex, consisting of DNA and protein.

• The human genome contains 700 times more DNA than the E. coli genome: 3 x 109 (3 billion) base pairs.

Page 4: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

4

Genomes

Mammalian genomes Sequenced and in progressHuman (3000 Mb; complete)Mouse (2500 Mb; complete)Cow (3000 Mb)Armadillo (3000 Mb)Lesser hedgehogAfrican elephant (3000 Mb)OpossumRabbit (3500 Mb)Chimp (3100 Mb)Dog (2400 Mb)Rat (2800 Mb)

•Complete genome sequencing for many organisms (mostly microorganisms) has been accomplished.

•The resulting field of genomics concerns the study of genes on a cellular scale.

•This has been made possible through technological advances in DNA sequencing…

Page 5: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

5

Genomes

Berg Fig. 6.6

The first genome of a free-living organism sequenced was that of Haemophilus influenzae in 1995. The genome encodes more than 1700 proteins and 70 RNA molecules. The functions of ~ 50% of the proteins were determined by comparison with other species.

Page 6: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

6

Genomes

Page 7: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

7

Human genome project

Fig. 9-18

• The human genome project was initiated in 1989 with the goal of sequencing the 3 billion base-pair human genome in 15 years.

• The National Institutes of Health and the Department of Energy instituted the joint project. 20 centers contributed.

• There was great skepticism that this could be accomplished in a reasonable amount of time.

• In 1998, the company Celera genomics formed to sequence the human genome.

• Celera and the HGP concurrently announced the human genome draft in 2001. The genome was completed in 2004.

Page 8: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

8

Human genome project results

Fig. 9-19

• Estimated 27,894 genes

• ~1.1% in exons.

• 1/1000 bp differ between individual humans: SNPs (single nucleotide polymorphisms

• From SNPs arise human variety.

Page 9: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

9

Human genome project results

Fig. 9-19

• <1% of SNPs are expected to impact protein function.

• Thus, thousands of genetic variations contribute to human diversity (not millions!)

Page 10: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

10

Human genome project - gene function

title:

Venter et al., Science (2001)291, 1304-1351.

Page 11: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

11

Human genome project

Fruit fly: 13,000 genes Human: 28,000 genes.

The surprisingly small number of genes in the human genome (~ 100,000 expected; < 30,000 identified) was a major surprise from the project.

Page 12: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

12

Human genome project

• Gene regulation, modification (i.e. methylation)• Chromosomal modifications• Location, quantity, timing of transcription• Tissue-specific protein expression• Roles (regulatory, other) of intronic DNA• RNA splicing• RNA roles in gene expression• RNA editing (changes made to mRNA)• Translational control (at ribosome)• Alterations in protein-protein interactions

The modest number of genes indicates we must look elsewhere to explain the human complexity.

Venter et al., Science (2001)291, 1304-1351.

Page 13: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

13

Proteomics

• There remain thousands of proteins in each eukaryotic cell about which we know nothing.

• Characterizing the proteome is a much larger task than the genome. This links genes to function:

• Phenotypic function: effects of a protein on an entire organism

• Cellular function: the network of interactions with other proteins in the cell

• Molecular function: the biochemical activity of a protein

Proteomics is the determination and analysis of the complete complement of proteins expressed by a genome.

Page 14: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

14

Proteomics strategies

Fig. 9-20

• Comparative genomics: Compare with genes and proteins of known function. Uses sequence and structural relationships. The increasing availability of genomics data greatly aids this approach.

•Orthologs: Genes of different species but possessing a clear sequence and functional relationship to each other.

•Paralogs: Genes within an organism with a sequence and structural relationship.

Page 15: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

15

Proteomics strategies

Fig. 3-22

For genes with no identifiable relationships to known genes, other approaches need to be applied.

2-D gel electrophoresis and mass spectrometry: Analyze the appearance or particular proteins from different tissues, as a function of development, or from tissues treated in different ways.

pI

mole

cu

lar w

eig

ht

Page 16: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

16

DNA microarray

Fig. 9-22

Page 17: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

17

DNA microarray

Fig. 9-22

cDNA: complementary DNA (prepared from mRNA).

Page 18: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

18

DNA microarray

Fig. 9-22

Page 19: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

19

DNA microarray

Fig. 9-22

Page 20: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

20

Gene Expression Profile

Fig. 9-23

Each spot in this microarray contains DNA from one of the 6,200 genes in the yeast genome. The different colors indicate conditions under which the genes are expressed. Here, green spots represents mRNAs abundant early in development, red RNAs are abundant later in development.

Page 21: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

21

Probing Protein Interactions

Fig. 9-25

Analysis of protein-protein interactions also can reveal important information about a protein's function and its role in the cell.

The yeast two-hybrid system allows for detection of protein-protein interactions by bringing together the DNA binding domain and the activation domain of the yeast Gal4 protein via interaction of two proteins and expression of a reporter gene.

Page 22: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

22

Proteomics strategies

Fig. 9-25

Analysis of protein-protein interactions also can reveal important information about a protein's function and its role in the cell.

The two fusions are created in separate yeast strains which are mated. The mated mixture is grown under conditions on which the yeast cannot survive unless the reporter gene is expressed. Surviving colonies have interacting protein fusion pairs.

Page 23: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

23

-omics …is the study of…:

• Genomics: the full complement of an organism's genes.• Proteomics: the full complement of an organism's proteins:• Transcriptomics: an organism's RNA transcribed from its

DNA• Metabonomics: an organism's metabolite profiles• Structural genomics: the 3-D structures of an organism's

proteins and RNAs• Pharmacogenomics: the interaction between genes and

gene products and medications. How an individual's genetic inheritance affects the body's response to drugs.

• One important strategy in proteomics, structural genomics, and related fields is gene expression. This is most commonly done in a bacterial host such as E. coli.

Page 24: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

24

Gene cloning and expression plasmids

A common E. coli cloning plasmid, pBR322:

Fig. 9-4

• Ori: where plasmid replication is initiated by cellular enzymes. This is required to propagate the plasmid within the cell.

• tetR and ampR: genes that confer resistance of the antibiotics tetracycline and ampicillin.

• EcoRI, BamHI, … unique sequences that are targets for endonucleases. Provide sites for cutting the plasmid.

Page 25: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

25

Use of restriction endonucleases

Fig. 9-3

• Endonucleases are enzymes that cleave DNA at a particular recognition sequence.

• They may cleave to leave "sticky ends" or "blunt ends."

Page 26: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

26

Use of restriction endonucleases

• A 4 bp sequence will occur once every 44 (256) bp.

• A 6 bp sequence will occur once every 46 (4,096) bp.

• An 8 bp sequence will occur once every 65,000 bp.

Page 27: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

27

Cloning

Fig. 9-1

Page 28: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

28

Cloning

Fig. 9-1

• 1• 2• 3

Page 29: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

29

Cloning

Fig. 9-1

Page 30: 1 Cloning, genomes, and proteomes Chromosome : Component in the cell that contains genetic information Plasmid: Circular DNA molecule that replicates separately

30

Cloning

Fig. 9-1