genome composition
DESCRIPTION
Genome Composition. Dan Graur. Genome Composition in Bacteria. Carsonella ruddii has a very low GC content. The selectionist explanation views GC content as an adaptation. G:C pairs are more stable than A:T pairs. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/1.jpg)
1
Genome CompositionGenome Composition
Dan GraurDan Graur
![Page 2: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/2.jpg)
2
Genome Genome Composition in Composition in
BacteriaBacteria
![Page 3: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/3.jpg)
Carsonella ruddii has a very low GC content.
![Page 4: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/4.jpg)
4
![Page 5: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/5.jpg)
5
The selectionist explanation views GC content as an adaptation.
G:C pairs are more stable than A:T pairs.
Preferential usage of amino acids encoded by GC-rich codons (e.g., ala and arg) and avoidance of amino acids encoded by GC-poor codons (e.g., ser and lys).
T-T dimers are sensitive to UV radiation.
NoNoempiricalempiricalevidenceevidence
![Page 6: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/6.jpg)
6
The mutationist explanation
Rate of substitution G/C T/A is Rate of substitution T/A G/C is
Noboru SueokaUniversity of Colorado
![Page 7: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/7.jpg)
7
at equilibrium: PGC =ν
ν +μ
![Page 8: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/8.jpg)
8
GC mutational pressure: μν
=1−PGCPGC
![Page 9: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/9.jpg)
9
μν
=3 ⇒ 25% GC
μν
=1 ⇒ 50% GC
μν
=.33 ⇒ 75% GC
Mycoplasma capricolum
Escherichia coli
Micrococcus luteus
![Page 10: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/10.jpg)
10
![Page 11: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/11.jpg)
11
Differences in the way the leading and lagging strands of DNA are replicated can result in strand-dependent mutation patterns.
The expectation under no-strand-bias conditions is
fA = fT and fC = fG
![Page 12: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/12.jpg)
12
Deviations from equal Deviations from equal
mutation rates between the mutation rates between the
two strands are quantified by two strands are quantified by
the the skewskew..
![Page 13: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/13.jpg)
13
SX=Y =fX −fYfX +fY
The skew is a measure of inequality between the frequencies of nucleotides X and Y on a strand.
![Page 14: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/14.jpg)
14
If there are no violations of the no-strand-bias conditions:
SX=Y
=0
![Page 15: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/15.jpg)
15
Skew values are calculated for sliding windows of predetermined lengths, and are plotted on a skew diagram.
![Page 16: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/16.jpg)
16
Bacillus subtilis
chirochorechirochore chirochorechirochore
![Page 17: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/17.jpg)
17
![Page 18: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/18.jpg)
18
Chlamidia trachomatis
![Page 19: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/19.jpg)
19
Compositional Properties of Eukaryotic Genomes
![Page 20: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/20.jpg)
20
GC content of bacterial genomes ranges from ~24% to ~74%
Intergenomicvariability
GC content of vertebrate genomes ranges from ~40% to ~45%
![Page 21: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/21.jpg)
21
TTGACCGATGACCCCGGTTCAGGCTTCACCACAGTGTGGAACGCGGTCGTCTCCGAACTTAACGGCGACCCTAAGGTTGACGACGGACCCAGCAGTGATGCTAATCTCAGCGCTCCGCTGACCCCTCAGCAAAGGGCTTGGCTCAATCTCGTCCAGCCATTGACCATCGTCGAGGGGTTTGCTCTGTTATCCGTGCCGAGCAGCTTTGTCCAAAACGAAATCGAGCGCCATCTGCGGGCCCCGATTACCGACGCTCTCAGCCGCCGACTCGGACATCAGATCCAACTCGGGGTCCGCATCGCTCCGCCGGCGACCGACGAAGCCGACGACACTACCGTGCCGCCTTCCGAGAGATTGATGACAGCGCTGCGGCACGGGGCGATAACCAGCACAGTTGGCCAAGTTACTTCACCGAGCGCCCGCACAATACCGATTCCGCTACCGCTGGCGTAACCAGCCTTAACCGTCGCTACACCTTTGATACGTTCGTTATCGGCGCCTCCAACCGGTTCGCGCACGCCGCCGCCTTGGCGATCGCAGAAGCACCCGCCCGCGCTTACAACCCCCTGTTCATCTGGGGCGAGTCCGGTCTCGGCAAGACACACCTGCTACACGCGGCAGGCAACTATGCCCAACGGTTGTTCCCGGGAATGCGGGTCAAATATGTCTCCACCGAGGAATTCACCAACGACTTCATTAACTCGCTCCGCGATGACCGCAAGGTCGCATTCAAACGCAGCTACCGCGACGTAGACGTGCTGTTGGTCGACGACATCCAATTCATTGAAGGCAAAGAGGGTATTCAAGAGGAGTTCTTCCACACCTTCAACACCTTGCACAATGCCAACAAGCAAATCGTCATCTCATCTGACCGCCCACCCAAGCAGCTCGCCACCCTCGAGGACCGGCTGAGAACCCGCTTTGAGTGGGGGCTGATCACTGACGTACAACCACCCGAGCTGGAGACCCGCATCGCCATCTTGCGCAAGAAAGCACAGATGGAACGGCTCGCGGTCCCCGACGATGTCCTCGAACTCATCGCCAGCAGTATCGAACGCAATATCCGTGAACTCGAGGCCGAGGAATTCACCAACGACTTCATTAACTCGCTCCGCGATGACCGCAAGGTCGCATTCAAACGCAGCTACCGCGACGTAGACGTGCTGTTGGTCGACGACATCCAATTCATTGAAGGCAAAG
Interspecific variation among vertebrate genomes is low. However, vertebrates seem to have a much more complex intragenomic compositional organization (internal structure) than prokaryotic genomes.
![Page 22: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/22.jpg)
22
TTGACCGATGACCCCGGTTCAGGCTTCACCACAGTGTGGAACGCGGTCGTCTCCGAACTTAACGGCGACCCTAAGGTTGACGACGGACCCAGCAGTGATGCTAATCTCAGCGCTCCGCTGACCCCTCAGCAAAGGGCTTGGCTCAATCTCGTCCAGCCATTGACCATCGTCGAGGGGTTTGCTCTGTTATCCGTGCCGAGCAGCTTTGTCCAAAACGAAATCGAGCGCCATCTGCGGGCCCCGATTACCGACGCTCTCAGCCGCCGACTCGGACATCAGATCCAACTCGGGGTCCGCATCGCTCCGCCGGCGACCGACGAAGCCGACGACACTACCGTGCCGCCTTCCGAGAGATTGATGACAGCGCTGCGGCACGGGGCGATAACCAGCACAGTTGGCCAAGTTACTTCACCGAGCGCCCGCACAATACCGATTCCGCTACCGCTGGCGTAACCAGCCTTAACCGTCGCTACACCTTTGATACGTTCGTTATCGGCGCCTCCAACCGGTTCGCGCACGCCGCCGCCTTGGCGATCGCAGAAGCACCCGCCCGCGCTTACAACCCCCTGTTCATCTGGGGCGAGTCCGGTCTCGGCAAGACACACCTGCTACACGCGGCAGGCAACTATGCCCAACGGTTGTTCCCGGGAATGCGGGTCAAATATGTCTCCACCGAGGAATTCACCAACGACTTCATTAACTCGCTCCGCGATGACCGCAAGGTCGCATTCAAACGCAGCTACCGCGACGTAGACGTGCTGTTGGTCGACGACATCCAATTCATTGAAGGCAAAGAGGGTATTCAAGAGGAGTTCTTCCACACCTTCAACACCTTGCACAATGCCAACAAGCAAATCGTCATCTCATCTGACCGCCCACCCAAGCAGCTCGCCACCCTCGAGGACCGGCTGAGAACCCGCTTTGAGTGGGGGCTGATCACTGACGTACAACCACCCGAGCTGGAGACCCGCATCGCCATCTTGCGCAAGAAAGCACAGATGGAACGGCTCGCGGTCCCCGACGATGTCCTCGAACTCATCGCCAGCAGTATCGAACGCAATATCCGTGAACTCGAGGCCGAGGAATTCACCAACGACTTCATTAACTCGCTCCGCGATGACCGCAAGGTCGCATTCAAACGCAGCTACCGCGACGTAGACGTGCTGTTGGTCGACGACATCCAATTCATTGAAGGCAAAG
How are nucleotides distributed along the genome?Uniform? Patchy? Clines?
![Page 23: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/23.jpg)
23
“When vertebrate genomic DNA is randomly sheared into fragments 30-100 kb in size and the fragments are separated by base composition, the fragments cluster into a small number of classes distinguished from each other by their GC content. Each class is characterized by bands of similar, but not identical, base compositions.”
(Macaya et al. 1976; Thiery et al. 1976; Bernardi et al. 1985)
Equilibrium centrifugation in Cs2SO4 density gradient
![Page 24: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/24.jpg)
24
carp
![Page 25: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/25.jpg)
25
The Isochore Theory - Giorgio Bernardi
carp
![Page 26: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/26.jpg)
26
![Page 27: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/27.jpg)
27
Isochores do not merit the prefix “iso.”
Lander et al. (2001)
![Page 28: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/28.jpg)
28
Post genomic era (2001)
Objections against the isochore theory:Objections against the isochore theory:
““We can rule out a strict notion of isochores as We can rule out a strict notion of isochores as compositionally homogeneous.”compositionally homogeneous.” Lander et al. (2001) Lander et al. (2001)
““There are no isochores in chromosomes 21 and 22.”There are no isochores in chromosomes 21 and 22.” Häring and Kyper (2001)Häring and Kyper (2001)
Defense of the isochore theory:Defense of the isochore theory:
““The conclusion of the authors that ‘isochores’ are The conclusion of the authors that ‘isochores’ are not ‘strict isochores’ is correct, however isochore are not ‘strict isochores’ is correct, however isochore are fairlyfairly homogeneous regions.” homogeneous regions.” Bernardi (2001) Bernardi (2001)
![Page 29: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/29.jpg)
29
![Page 30: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/30.jpg)
30
In search of isochores…
Questions: Do isochores exist? Is the isochore theory a useful (or practical)
concept?
![Page 31: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/31.jpg)
31
Segmentation Models
• Assumption: Sequences can be partitioned into a number of segments each with a characteristic GC content.
• Each segment has a certain degree of internal homogeneity (or similarity).
![Page 32: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/32.jpg)
32
In search of isochores…
Methodology: Define rigorously 6 attributes of isochores and
of the isochore theory as applied to humans Test attributes against the human genome
data
![Page 33: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/33.jpg)
33
Attributes of isochores
A1. Distinguishability: An isochore is a DNA segment that has a characteristic GC content that differs significantly from the GC content of adjacent isochores.
A2. Homogeneity: An isochore is more homogeneous in its composition than the chromosome on which it resides.
A3. Minimum length: The length of an isochore exceeds a certain cutoff value. In the literature, the most commonly mentioned value is 300 Kb.
![Page 34: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/34.jpg)
34
Attributes of the isochore theory in humans
A4. Genome coverage: The overwhelming majority of the human genome consists of segments abiding by A1-A3. Non-isochoric DNA takes up only a small fraction of the genome.
![Page 35: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/35.jpg)
35
A5. Isochore families: The human genome comprises of five isochore families, each described by a particular Gaussian distribution of GC content.
Attributes of the isochore theory in humans
![Page 36: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/36.jpg)
36
A6. Isochore assignment into families: It is possible to classify each isochore into its isochore family based solely on its compositional properties.
Practicality of the isochore theory
![Page 37: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/37.jpg)
37
Segment length distribution
The fitted regression line
(solid line) indicates that
the tail of the distribution
exhibits power-law decay
with an exponent of –2.38.
P L–2.38
![Page 38: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/38.jpg)
38
Power laws everywhere!
![Page 39: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/39.jpg)
39
Isochore families
1
2
3
4
Most parsimonious Gaussian fit to putative isochores
![Page 40: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/40.jpg)
40
Homogeneous “isochores” in vertebrates
![Page 41: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/41.jpg)
41
Assignment into families
Classification errors reach values of 70%. Only a minute fraction of segments can be classified with an expected error under 5%.
![Page 42: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/42.jpg)
42
Summary
(A1) Distinguishability
(A2) Homogeneity
(A3) Minimum length X
(A1) Genome coverage
(A2) Isochore families families
(A3) Isochore assignment into families X
![Page 43: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/43.jpg)
43
Conclusion:
The isochore theory may have reached the limits of its usefulness as a description of genomic compositional structures.
![Page 44: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/44.jpg)
44
![Page 45: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/45.jpg)
45
![Page 46: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/46.jpg)
46
As of December 2004
17 genetic codes
11 mitochondrial
5 nuclear
1 nuclear + mitochondrial
![Page 47: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/47.jpg)
47
Lock & Key Hypothesis
![Page 48: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/48.jpg)
48
Frozen accidents
EvolutionaryDead Ends
![Page 49: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/49.jpg)
49
![Page 50: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/50.jpg)
50
The codon-The codon-capture capture hypothesishypothesis
Thomas Jukes
![Page 51: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/51.jpg)
51
AAA = lysine
Universalgeneticcode
![Page 52: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/52.jpg)
52AAA = asparagine
Echinodermata
![Page 53: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/53.jpg)
53
Hemichordata
AAA = unassigned
![Page 54: Genome Composition](https://reader034.vdocument.in/reader034/viewer/2022052701/56814086550346895dac0e6f/html5/thumbnails/54.jpg)
54