why this paper causal genetic variants at loci contributing to complex phenotypes unknown rat/mice...

32

Upload: mckenna-bateson

Post on 14-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant
Page 2: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Why this paper

• Causal genetic variants at loci contributing to complex phenotypes unknown

• Rat/mice model organisms in physiology and diseases

• Relevant to our work– Integration of GWAS of different traits– Interpretation of human GWAS

Page 3: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Advantages of genetic mapping using heterogeneous stocks

• Accuracy of QTL mapping to Mb resolution

• WGS imputation from progenitor genomes

• Haplotypes well defined– Single SNP vs haplotype (spatial) association

– Difficult in humans, large #of rare/unknown haplotypes

Page 4: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Design

Sequencing

AJ AKR Balb C3H C57 DBA IS RIII

HS

Random Breeding

HS Generation > 60

Reconstruction of rat genomes as mosaic of founder haplotypesbased on 265,551 SNPs (“sequence imputation”)

Page 5: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Genotypes

• 1,407 phenotyped NIH-HS animals• 198 parents (~14.2 litter size)• RATDIV genotyping array (13 inbred strains)

– 803,485 SNPs– 560,000 segregating in NIG-HS– 265,551 used for haplotype reconstruction

• Sequencing of founder samples– Number ?– 22x coverage

Page 6: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Phenotypes• 160 measurements

Page 7: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Sequencing

• 7.2M SNP• 633,000 indels• 44,000 structural variants

Page 8: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Sequencing• False Positives

• 2.7% SNP• 2.2% indels• 16.7% structural variants

• False Negatives• 17.2% SNPs• 41.4% indels• 65% structural variants

Page 9: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Nucleotide diversity in NIH-HS progenitors

• Similar diversity between strains

Page 10: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Nucleotide diversity in NIH-HS progenitors

• Similar diversity between strains• 29% SNP private to particular strain

– Unique haplotypes relatively common• Regions of low diversity are small (~400 kb)

Page 11: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Genotyping

Page 12: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

QTL mapping

• Reconstruction of rat genomes as mosaics of founder haplotypes– R HAPPY

Page 13: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Svenson K L et al. Genetics 2012;190:437-447

Page 14: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

QTL mapping

• Reconstruction of rat genomes as mosaics of founder haplotypes– R HAPPY. – Mixed Linear Model (EMMA, normal phenotypes)

– Resample model averaging (BAGPHENOTYPE,non-normal)• Non-parametric bootstrap aggregation (bagging)

Haplotype from strain s at locus l

random effectExpected number of haplotypes

Page 15: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Haplotype

Strain A B C------------------------------y1 = 2 0

0y2 = 0 2

0y3 = 0 1

1

QTL mapping

Page 16: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

QTL results

• 355 QTLs for 122 phenotypes (avg. 2.9)

Page 17: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

QTL results

Page 18: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

QTL results

Page 19: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Haplotype (1)

Strain A B C------------------------------y1 = 2 0

0y2 = 0 2

0y3 = 0 1

1 Sequence variants

A BC

Strain CC CC TT------------------------------SDP 0 0

1

Merge analysesStrain distribution pattern (SDP)

ABC

ABC

= 0 0 1

= 1 0 0

Page 20: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Haplotype (1)

Strain A B C------------------------------y1 = 2 0

0y2 = 0 2

0y3 = 0 1

1

Sequence variants

Strain CC CC TT------------------------------y1 = 2 0

0y2 = 0 2

0y3 = 0 1

1Merge model (2)

Strain C T------------------------------y1 = 2 0y2 = 2 0y3 = 1 1

• (2) Sub model (1)• if QTL == single variant

• R2(2)~R2(1)• [logPmerge – logPhaplotype] > 0

Merge analyses

Page 21: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Merge analyses

• 343 QTLs– 131 (38%) at least 1 candidate variant

• Increased resolution– 90% of variants ruled out, d <0– Candidates in coding regions affecting protein

structure more likely to be causal – Eliminates candidate genes that are distant from

candidate variant

Page 22: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Merge analyses (examples)

• 3 QTL for patelet aggregation

Page 23: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Merge analyses (examples)

• Candidate variant in single gene

Page 24: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Merge analyses (examples)

• Candidate variant in coding region

Page 25: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Merge analysis

• Single variants rarely account for QTL effects– 212 (68%) QTL had no candidate variant

• Possible reasons– Causative variants missed in sequencing– QTL mapping biased towards QTL without

candidate variants – Merge underestimates statistical significance– Multiple causal variants

Page 26: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Merge analysis

– Causative variants missed in sequencing• Simulation of all possible SDPs for di-tri-allelic SNPs and

merge analysis• 168 (49%) would still have no causative variant

– Simulation different QTL architectures• Single variants• Multiple variants within gene, multiple variants linked

loci• Haplotype effects/ no individual variants

Page 27: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Merge analysis– Simulation of causal variants

Page 28: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Merge analysis

• Haplotype mapping overestimates QTL without causative variant (?)

• Merge analysis underestimates number of QTL without causative variant (?)– Multiple causative variants

Page 29: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant
Page 30: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Concordance between species• 38 measures common between NIG-HS and mice HS• Orthologous rarely contribute to the same

phenotype

Page 31: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Concordance between species• 38 measures common between NIG-HS and mice HS

• Orthologous rarely contribute to the same phenotype

• KEGG pathways for QTL associated genes in rat in mice only significantly enriched for “proportion of B cells”)

Page 32: Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant

Discussion• Combining sequence with mapping data can identify candidate

loci• 50% of QTL can not be attributed to single causal variant

– Multiple causal variants, more complex models required– Rat QTL similar to Trans eQTL

• Not possible to accurately asses overlap between species– limited power of pathway analysis– limited power from comparing phenotypes (within species?)– Variants in orthologous genes rarely contribute to same phenotype