quantitative trait loci, qtl an introduction to quantitative genetics and common methods for mapping...

57
Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Upload: randolf-melton

Post on 28-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Quantitative Trait Loci, QTL

An introduction to quantitative geneticsand common methods for mapping of

loci underlying continuous traits:

Page 2: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Why study quantitative traits?

• Many (most) human traits/disorders are complex in the sense that they are governed by several genetic loci as well as being influenced by environmental agents;

• Many of these traits are intrinsically continuously varying and need specialized statistical models/methods for the localization and estimation of genetic contributions;

• In addition, in several cases there are potential benefits from studying continuously varying quantities as opposed to a binary affected/unaffected response:

Page 3: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

For example:

• in a study of risk factors the underlying quantitative phenotypes that predispose disease may be more etiologically homogenous than the disease phenotype itself;

• some qualitative phenotypes occur once a threshold for susceptibility has been exceeded, e.g. type 2 diabetes, obesity, etc.;

• in such a case the binary phenotype (affected/unaffected) is not as informative as the actual phenotypic measurements;

Page 4: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

A pedigree representation

Page 5: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Variance and variability

• methods for linkage analysis of QTL in humans rely on a partitioning of the total variability of trait values;

• in statistical theory, the variance is the expected squared deviation round the mean value,

• it can be estimated from data as:

• the square root of the variance is called the standard deviation;

n

i i

Y

Y

yyn

s

YEYV

YE

1

22

2

;)(1

];)[()(

:)(

Page 6: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

A simple model for the phenotype

Y = X + ewhere• Y is the phenotypic value, i.e. the trait value;• X is the genotypic value, i.e. the mean or

expected phenotypic value given the genotype;• e is the environmental deviation with mean 0.• We assume that the total phenotypic variance is

the sum of the genotypic variance and the environmental variance, V (Y ) = V (X ) + V (e), i.e. the environmental contribution is assumed independent of the genotype of the individual;

Page 7: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Distribution of Y : a single biallelic locus

Page 8: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

A single biallelic locus: genetic effects

Genotype

Genotypic value

• a is the homozygous effect,• k is the dominance coeffcient• k = 0 means complete additivity,• k = 1 means complete dominance (of A2),• k > 1 if A2 is overdominant.

Page 9: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Example: The pygmy gene, pg

• From data we have the following mean values of weight:

X++ = 14g, X+pg = 12g, Xpgpg = 6g,

• 2a = 14 -6 = 8 implies a =4,

• (1 + k)a = 12 - 6 = 6 implies k = 0.5.

Data suggest recessivity (although not complete) of the pygmy gene.

Page 10: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Decomposition of the genotypic value, X

• Xij is the mean of Y for AiAj-individuals;• when k = 0 the two alleles of a biallelic locus be

haves in a completely additive fashion: X is a linear function of the number of A2-alleles;

• we can then think of each allele contributing a purely additive effect to X ;

• this can be generalized to k ≠ 0 by decomposition of X into additive contributions of alleles together with deviations resulting from dominance;

• the generalization is accomplished using least-squares regression of X on the gene content;

Page 11: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Least-squares linear regression

iondecomposit variance),() X̂()(

residuals; squared of sum theminimize

deviation; residual valuefitted i.e. ,+ X̂ = X

VVXV

Page 12: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Model 1

.for 2

,for

,for 2ˆ

genotype; in the alleles- of

number the with ,ˆ

dominance; toduedeviation residual theis

, allele ofeffect additive theis

phenotype,mean population theis

ˆ

222

2121

111

2211

AA

AA

AA

X

A

NNNX

A

XX

ij

k

kij

ij

ii

ijji

ijijji

Page 13: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

12

21

12

2211 0

p

p

pp

Page 14: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Interpretations

• in the linear regression

is the heritable component of the genotype,

δis the non-heritable part;

• the sum of an individuals additive allelic effects, αi+αj is called the breeding value and is denoted Λij

• under random mating αican be interpreted as the average excess of allele Ai

• this is defined as the difference between the expected phenotypic value when one allele (e.g. the paternally transmitted) is fixed at Ai and the population average, μ;

X

XX

ˆ

ˆ

Page 15: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Linear Regression

.

,2~

where

~

,2 locus biallelic afor since

e varianc

population ingcorrespond theimplieswhich

;0

i.e. 0, is allele

drawnrandomly a ofeffect additive expected the

;populationin alleles - of proportion

12

1

2

21

2

2

21

2

1

2211

ijij

kk

NX

-NN

pp

pp

Ap

Page 16: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Graphically

Page 17: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Linear Regression Model solving

ijij NX 2~

X prob.

0 0

a(1+k) 1

2a 2

2N21p

212 pp22p

)var(

),cov(

2

2

N

NX

Page 18: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

))1(2(2

2221)1()(

2)(

2)(

)1(442)1()(

)1(222)1()(

122

22212

2122

22

21

22

222

221

22

122221

kppap

pappkaXNE

ppNVar

pNE

kppapappkaXV

kpapapppkaXE

Page 19: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

)](1[

)](1[2

]21[2

)]1(2)1(2[2),(

21

2121

221

121222

ppka

ppkpap

kpkpap

kppkppapNXCOV

Page 20: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

average excesses

X

Xi

isoneanotherpX

isoneanotherpX

iisalleleoneXE

)1|1(

)1|2(

)|(

11

12*1

*

X

matingrandom

pXpX 111212

111221 )2()( pp

Page 21: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Interpretations under random mating

• α= a [1+ k (p1-p2)] ;

α= - p2 α;

α= p1 α, Population parameters for k≠0• α is called the average effect of allelic substitution: substitute A1 A2for a randomly chosen A1 –

allele• then the expected change in X is,

(X12 -X11) p1 + (X22 -X12) p2 ;

• which equals α. (simple calculations).

Page 22: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

: Average effect of allelic substitution

))(1(

)1()1(

)()(

21

21

1222211121

ppka

kapkap

XXpXXp

1A

1A

1A

2A2A 2A

Page 23: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

α is a function of p2 and k :

Page 24: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Partitioning the genetic variance

• the variance, V (X ), of the genotypic values in a population is called the genetic variance:

• is the additive genetic variance, i.e. variance associated with

additive allelic effects;• dominance genetic

variance, i.e. due to dominance deviations;

221 )2( akppVD

DA VV

VXV

XVXV

)()ˆ(

)ˆ()(

)(22 222

211

221 ppppVA

Page 25: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

VA

221

221

221

22221

222

221

2211

222

211

)](1[2

2

)22(

)(42

0

)(2

ppkapp

pp

ppp

regressionLinearpppV

pp

ppV

A

A

Page 26: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

V (X); VA; VD are functions of p2 and k:

;)2(]dotted[

;))](1([2]dashed[2

21

22121

akppV

ppkappV

D

A

Page 27: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Example: The Booroola gene, (Lynch and Walsh, 1998)

Page 28: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

In summary• The homozygous effect a, and the dominance co

efficient k are intrinsic properties of allelic products.

• The additive effect αi, and the average excess αi* are properties of alleles in a particular population.

• The breeding value is a property of a particular individual in reference to a particular population. It is the sum of the additive effects of an individual's alleles.

• The additive genetic variance, VA, , is a property of a particular population. It is the variance of the breeding values of individuals in the population.

Page 29: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Multilocus traits

• Do the separate locus effects combine in an additive way, or do there exist non-linear interaction between different loci: epistasis?

• Do the genes at different loci segregate independently?

• Do the gene expression vary with the environmental context: gene by environment interaction?

• Are specic genotypes associated with particular environments: covariation of genotypic values and environmental effects?

Page 30: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Example: epistasis

Average length of vegetative internodes in the lateral branch(in mm) of teosinte. Table from Lynch and Walsh (1998).

Page 31: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Two independently segregating loci

• Extending the least-squares decomposition of X :

• Λk is the breeding value of the k'th locus,

δk is the dominance deviation of the k'th locus,

ε is a residual term due to epistasis;• if the loci are independently segregating

2211X

)(

)(

)()()()()()(

2121

2121

VVV

VVVVV

VVVVVXV

DA

D,D,A,A,

Page 32: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Neglecting V (ε)

• the epistatic variance components contributing to V (ε) are often small compared to VA and VD;

• in linkage analysis it is this often assumed that V (ε) = 0;

• note however: the relative magnitude of the variance components provide only limited insight into the physiological mode of gene action;

• epistatic interactions, can greatly inflate the additive and/or dominance components of variance;

Page 33: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Resemblance between relatives

A model for the trait values of two relatives:

Yk = Xk + ek, k = 1 , 2,

where for the k’th relative• Yk is the phenotypic value,• Yk is the genotypic value,• ek is the mean zero environmental deviation.• the ek’s are assumed to be mutually independent

and also independent of k. Hence, the covariance of the trait values of two relatives is given by the genetic covariance, C(X1; X2), i.e.

C(Y1; Y2) = C(X1; X2)

Page 34: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

A (preliminary) formula for C(X1 ,X 2)

For a single locus trait

C(X1; X2) = c1VA + c2VD

• c1 and c2 are constants determined by the type of relationship between the two relatives.

• same formula applies for multilocus traits if no epistatic variance components are included in the model, i.e. V (ε) = 0.

• in this latter case and are given by summation of the corresponding locus-specific contributions.

Page 35: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Joint distribution of sibling trait values

Single biallelic, dominant (k =1 ) model. Correlation 0.46.

Page 36: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Measures of relatedness

• N = the number of alleles shared IBD by two relatives at a given locus;

• the kinship coefficient, θ , is given by2 θ = E(N) / 2;

i.e. twice the kinship coefficient equals the expected proportion of alleles shared IBD at the locus.

• The coefficient of fraternity, Δ, is defined as

Δ = P(N = 2).

Page 37: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Some examples

• Siblings

(z0; z1; z2) = (1/4; 1/2; 1/4) implying E(N) = 1.Thus θ= 1/4 and Δ = 1/4:

• Parent-offspring

(z0; z1; z2) = (0; 1; 0) implying E(N) = 1.Thus θ = 1/4 and Δ = 0:

• Grandparent - grandchild

(z0; z1; z2) = (1/2; 1/2; 0) implying E(N) = 1=2.Thus θ = 1/8 and Δ = 0:

Page 38: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Covariance formula for a single locus

)2(2

)(

2

),( ),( 2121

DA

DA

VNPVNE

VθV

XXCYYC

Under the assumed model

),(

),(),(21

221121

2222

1111

ijij

jiji

ijji

ijji

Cov

CovXXCov

X

X

Page 39: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

A single locus; perfect marker data

2if

1if 2/

0if 0

(

i.e.

1or 0 if 0

2 if 1}2{

with

22

21

21

NVV

NV

N

|N),YYC

NN

NNI

VNIVN

|N),YC(Y

DA

A

DA

Page 40: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Covariance formula for multiple loci

n independently segregating loci assuming noepistatic interaction, i.e. putting V (ε) = 0

ly.respective ons,contributi

variancedominace and -additive specific-locus are ,

; locusat IBD shared alleles ofmumber theis

;)2(2

)(

2

2

),(),(

,,

,,

,,

2121

lDlA

l

l lDllAl

l lDl lA

DA

VV

lN

VNPVNE

VV

VV

XXCYYC

Page 41: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Covariance formula for multiple loci

n independently segregating loci assuming noepistatic interaction, i.e. putting V (ε) = 0

ly.respective ons,contributi

variancedominace and -additive specific-locus are ,

; locusat IBD shared alleles ofmumber theis

;)2(2

)(

2

2

),(),(

,,

,,

,,

2121

lDlA

l

l lDllAl

l lDl lA

DA

VV

lN

VNPVNE

VV

VV

XXCYYC

Page 42: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Covariance... continued

);MD|2(

and

;2/]MD|[

2 xx(x)

xx(x)

NP

NE

Define for every pair of relatives

For two related individuals we then have,

RDxAxDxA

lDxllAxl

l

x

VVVV

VNPVNE

YYC

xx

,,,2,

,,

21

2

;))MD|2(2

]MD|[(

)MD|,(

)()(

Page 43: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Haseman-Elston method

• Uses pairs of relatives of the same type: most often sib pairs;

• for each relative pair calculate the squared phenotypic difference: Z = (Y1 –Y2)2;

• given MDx regress the Z's on the expected proportion of alleles IBD, π(x) = E [Nx |MDx]/2, at the test locus;

• a slope coefficient β< 0, if statistically significant, is considered as evidence for linkage;

Page 44: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

HE: an example

0.5Proportion of marker alleles identical by decent

Solid line is the tted regression line;

Dotted line indicates true underlying relationship

Page 45: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

HE: motivation

)(2)(2

)(2)()(

][])[(

21

2121

212

21

YYCYV

YYCYVYV

YYVYYE

Assume strictly additive gene action at each locus,i.e.VD = 0. Then, for a putative QTL at x,

!in function linear a is This :NOTE

]2[2)(2

)MD|(2)(2 ]MD|)[(

)(

,,)(

212

21

x

RAxAx

xx

VVYV

YYCYVYYE

Page 46: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

HE: linkage test

0 : H1

vs

)0(,0 : H0

is test linkage The

2

]2)([2

where

]MD,[

,

,

,

)(21

xA

xA

RA

xx

V

V

VYV

|YYE

Page 47: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

HE: examples with simulated data

simulated data from n = 200 sib-pairs;top to bottom: h2 = 0:50; 0:33; 0:25.

Page 48: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Heritability and power

• for a given locus we may define the locus-specific heritability as the proportion of the total variance 'explained' by that particular site, e.g. (in the narrow-sense),

• the locus-specific heritability is the single most important parameter for the power of QTL linkage methods;

• heritabilities below 10% leads, in general, to unrealistically large sample sizes.

)(2

YV

Vh A

Page 49: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

HE: two-point analysis)(2

21

~~genotypes]marker |)[( mYYE

~

)(mwhere is the expected proportion of marker alleles shared IBD.

• depends on the type of relatives considered;• for sib pairs• recombination fraction (θ) and effect size (VA;l ) are confounded and cannot be separately estimated;

;)21(2~

,2

lAV

Page 50: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

HE: in summary

Simple, transparent and comparatively robust but:

• poor statistical power in many settings;• different types of relatives cannot be mixed;• parents and their offspring cannot be used in HE;• assumptions of the statistical model not generally

satisfied;

• Remedy:• use one of several suggested extensions of HE;• alternatively, use VCA instead

Page 51: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Mathematically:

Yi=+Tai+gi+qi+ei

where is the population mean, a are the “environmental” predictor variables, q is the major trait locus, g is the polygenic effect, and e is the residual error.

Polygenes Independent environment

QTL

Trait value

VCA

Page 52: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

VCA: an additive model

RDRAxDx

xAx

x

RDRAxDxA

DA

p

i ii

n

ll

p

iii

VVVVYYC

eVVVVV

eVVVYV

zYE

eXzY

,,,)(

2,)(

21

,,,,

1

11

2)MD|,(

)(

)()(

;)(

Page 53: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

VCA: major assumptionThe joint distribution of the phenotypic values in a pedigree is assumed to be multivariate normal with the given mean values, variances and covariances;

• the multivariate normal distribution is completely specified by the mean values, variances and covariances;• the likelihood, L, of data can be calculated and we can estimate the variance components VA;x; VD;x ; VA;R; VD;R;

Page 54: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

VCA: linkage test

The linkage test of

H0 : VA;x = VD;x = 0

uses the LOD score statistic

When the position of the test locus, x, is varied over a chromosomal region the result can be summarized in a LOD score curve.

)0(L

model) L(fulllogLOD

,,10

xDxA

x VV

Page 55: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

VCA vs HE: LOD score proles

From Pratt et al.; Am. J. Hum. Genet. 66:1153-1157, (2000)

Page 56: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Linkage methods for QTL

• Fully parametric linkage approach is difficult;• Model-free tests comprise the alternative choice; • We will discuss

Haseman-Elston Regression (HE);Variance Components Analysis (VCA);

Both can be viewed as two-step procedures:1. use polymorphic molecular markers to extract information on inheritance patterns;2. evaluate evidence for a trait-influencing locus at specified locations;

Page 57: Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:

Similarities and differences

• HE and VCA are based on estimated IBD-sharing given marker data;

• both methods require specification of a statistical model!('model-free' means 'does not require specification of genetic model')

• similarity in IBD-sharing is used to evaluate trait similarity using eitherlinear regression (HE) orvariance components analysis (VCA);