mutations and epimutations

34
Mutations and Epimutations A story of two cultivars and their children. Matteo Pellegrini

Upload: audra-rodriguez

Post on 03-Jan-2016

25 views

Category:

Documents


0 download

DESCRIPTION

Mutations and Epimutations. A story of two cultivars and their children. Matteo Pellegrini. Nipponbare and 93-11. Nipponbare : Oryza sativa japonica Primarily Japan, China, Indonesia Agronomic differences: Days to heading. 93-11 Oryza sativa indica India, Bangladesh, Nepal, China - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Mutations and  Epimutations

Mutations and Epimutations

A story of two cultivars and their children.Matteo Pellegrini

Page 2: Mutations and  Epimutations

Nipponbare and 93-11

• Nipponbare:– Oryza sativa japonica

• Primarily Japan, China, Indonesia

• Agronomic differences:• Days to heading

• 93-11– Oryza sativa indica

• India, Bangladesh, Nepal, China

• Submerged growth

• Agronomic differences:• Seed fertility• Long grain• Taller (83 cm)

Page 3: Mutations and  Epimutations

Why Study Crosses?

• Crosses of Indica and Japonica are often sterile

• Show hybrid vigor in agronomic traits

Page 4: Mutations and  Epimutations

Overview

• Identify SNPs between ecotypes.– SNP generation

• Identify epiMutations between ecotypes.– Identify methyl-inheritance

• Identify allele-specific expression• Identify RNA editing

P

F1

NPB 9311

• 2 rice ecotypes: Nipponbare and 93-11• Generated BS-seq data for NPB, 93-11, and 2 reciprocal crosses

Page 5: Mutations and  Epimutations

Detecting Cytosine MethylationA, Cunmethylated, Cmethylated, G, T ?

… m mm …… ACCCGTACCCGATTAG …

… ATCTGTATCCGATTAG …

Apply sodium bisulfite and amplify: Unmethylated C → T, methylated C (and A/G/T) unchanged Try to align new sequence to known reference; compare

Page 6: Mutations and  Epimutations

Mapping Approach: BS Seeker

Chen et al (2010) BMC Bioinformatics

BS reads are C/T converted, so normal aligners are not applicable

Three letter alignment:

AATCGTA

CTAATCGCAGG

BS read:

Ref. genome:

TTAATTGTAGG

AATTGTA

Convert C to T

AATTGTATTAATTGTAGG

Bowtie mapping

CTAATCGCAGGAATCGTA

Restore to 4 letters

m u

Compare alignments

Page 7: Mutations and  Epimutations

7

Methylation levels at single-base resolution

Calculate methylation level at each covered cytosine Methylation level= #C/(#C+#T)

5’--attgagacatcctagcgcgtggtgacaataata—-3’ttttagcgcgtggtg

cattttagtgcgtgg

tagtgcgtggtg

3/(3+0)=100%

1/(1+2)=33.3%

Ref. genome:

Page 8: Mutations and  Epimutations

Workflow

• Alignments– BS-Seeker mapping of NPB and 9311 samples to NPB reference

genome.– Maps 9311 genome to NPB coordinates

• Parent genomes– Each read generates a small implied sequence fragment.– Use this to generate a parent genome.

• F1 read matching• Map reads to NPB reference genome to get location.• Compare each read to NPB and 9311 parent genomes and

determine better match.

Page 9: Mutations and  Epimutations

SNP

parent1

parent2

Methylation level at CG sites

Methylation level at CG sites

BS-seq

parent1/parent2

Detecting Alelle-Specific methylation

Page 10: Mutations and  Epimutations

Library statisticsMethyl-Seq Reads Mapped % Mapped Coverage

NPB 298M 134M 45% 17.58

93-11 157M 74M 47% 10.14

NPB x 93-11 594M 279M 47% 20.04

-NPB 6.51

-93-11 6.08

93-11 x NPB 543M 236M 43% 25.77 -NPB 7.45

-93-11 6.59

RNA-Seq

NPB 42M 17M 42%

93-11 42M 13M 31%

NPB x 93-11 48M 12M 26%

-NPB

-93-11

93-11 x NPB 43M 11M 25%

-NPB

-93-11

Page 11: Mutations and  Epimutations

Identifying SNPs

• If sites: – > 3 reads/strand– > 90% agreement within ecotype– Strands agree with each other (compensate for Cs).– (obviously) disagree with each other.

• Will miss indels, dups, inversions, other chr rearrangements.

• Will miss long runs of SNPs ( > 3 within ~55 bp) (BS-seeker limit)

Page 12: Mutations and  Epimutations

SNPs - NPB vs 93-11• 1,209,456 mutations /

306,106,830 sites with mutual base calls

• ~ 1/253 bases

• Mostly (73%) C->T (or G->A if C->T on opposite strand) or T->C & A->G if in other 93-11

A C G T

A 86,677,300

42,553

216,135

42,513

C 43,336

65,771,387

34,146

226,045

G 226,045

34,146

65,771,387

43,336

T 42,513

216,135

42,553

86,677,300

Page 13: Mutations and  Epimutations

SNPs - NPB vs F1 (9N-NPB)• 12 mutations

• Are these real or false?

• Similar numbers amongst all F1 comparisons

A C G T

A 3,188,414

-

3

-

C -

2,695,005

-

3

G 2

-

2,548,205

-

T -

4

-

3,253,196

Page 14: Mutations and  Epimutations

Identifying epimutations

• Use the binomial dist. to build min, max, and mean pct methylation at each C.

• Confidence intervals at 5% are min, max

As # of reads ^, interval size v

Reads

Min/m

ax

Page 15: Mutations and  Epimutations

Identifying epimutations (cont)

• Called different if:– mean(sample1) <

min(sample2) & mean(sample2) > max(sample1)

Page 16: Mutations and  Epimutations

1 in 300 CG sites spontaneously mutate across one generation

Epimutation rate

Page 17: Mutations and  Epimutations

Epimutation clusters

9311 parent

NPB parent

NPB cross

NPB cross

9311 cross

9311 cross

Page 18: Mutations and  Epimutations

Epimutation clusters II

9311 parent

NPB parent

NPB cross

NPB cross

9311 cross

9311 cross

Page 19: Mutations and  Epimutations

Epimutations are enriched in regions where parents differ

Half of the epimutations between parents and crosses occur at sites where parents differ

Page 20: Mutations and  Epimutations

Epimutations (continued)

• Epimutations within genes– 498 genes were significantly enriched for

epimutations– GO Term x-ecotypes indicates: ATP synthesizing

related activity (ATP synthesis coupled proton transport, hydrogen transport, ion transmembrane transport, etc).

Page 21: Mutations and  Epimutations

Expression

• Many genes (~7800/25640) are differentially expressed between ecotypes.

• GO term: choroplast related terms, response to cadmiumion.

Page 22: Mutations and  Epimutations

Expression cont.

• Across generations, only 78 genes differentially expressed

• Of these only 2 were differentially expressed in the parents

Page 23: Mutations and  Epimutations

Allele Specific Expression

• 681 examples of allele specific expression

• Partially explain hybrid vigor?

NPB parent

NPB cross

9311 parent

9311 cross

NPB cross

9311 cross

Page 24: Mutations and  Epimutations

Allele-Specific Genes Accumulate Mutations

SNP Density

All genes Allele-specific genes

And are also enriched for differentially methylated sites

Page 25: Mutations and  Epimutations

Allele-specific Expression

cont.

And are also enriched for differentially methylated sites

Page 26: Mutations and  Epimutations

RNA Editing

• Cytidine deamination : C to U

• Adenosine deaminase: A to I (G)

Page 27: Mutations and  Epimutations

How Widespread• Recent studies indicate that

RNA editing may be more widespread than originally thought

• Others have disputed this claim (Schrider et al, PlosOne)

• In plants RNA editing is thought to take place in the mitochondria and plastids

• Is there editing in nuclear genes?

Science. 2011 Jul 1;333(6038):53-8.

Page 28: Mutations and  Epimutations

RNA Editing in Rice

NPB - RNA

A C G T

NPB - DNA

A 5535334 6907 3063 2219

C 4758 4436282 4279 7054

G 3777 2437 4382636 4213

T 2210 3227 6949 5577323

Initially we found lots of examples….

Page 29: Mutations and  Epimutations

On Closer Inspection…

Alignments are often off by one or more bases at splice sites

Page 30: Mutations and  Epimutations

But a Few Real Ones Remain?

Page 31: Mutations and  Epimutations

But more Filtering Should be done…

Position of edit site along read

Page 32: Mutations and  Epimutations

Current Numbers

Page 33: Mutations and  Epimutations

Conclusions

• Epimutation rates are one in 300 cytosines across one generation– Clusters of epimutations are present– Are enriched in sites where parental epigenomes differ

• Allele-specific expression is widespread and associated with– Increased SNP densities– Higher differential methylation

• Find some evidence for RNA editing but…

Page 34: Mutations and  Epimutations

Acknowledgements

–Krishna Chodavarapu (Pellegrini Lab)–Suhua Feng (Steve Jacobsen Lab)–Blake Myers, Guo-liang Wang, Yulin Jia