box 1 | quan ;g9) ) ) reviews s si quantat n t r t d on th

2
-/' @B(G< 27 -) -) -/' ;72C 9A P 9A P 6E+ IHAC SB?F 72C B"+!(,-.A )#'&* 8 )1 "B#R:*( ! =K"B4S 0 D> years ago years ago +(267;AB "'#P7) ! 9-"'2S . <: 2S4-85&*1 ! /SI0 (P 2\¡5 6&$0 n cv '# 2!09 (P 23eJ7SQbl[5 6m". (P 2vQBE63:K|)¥oyx\¡)x:4N>Zr 7vQn)\¡V`4pi (P]2QrA5IM;<1;G9q?N@3 ) |-)x1a0 7\¡V`4pi (P 2 |C3FKged (P 2 htvjkO^ z^ (P 24NB&D8=@N}(,/\¡)O¢u f_o£¤B |-{Q(Xx;G9W1.s+ 4NB);G9y* D8=@NB*TT~w'LFK0 ;G9) )) ( + + + + ( ) ( HO COOH O CO .2ae%' Jd2>WAN(9T#AaeQ ?58L0&iM;G,!G E1cBJd2K )11 1 2. )( )+-3S=l 3 1 .11 ; Jd2<I"@C F 1 a b c F 8 Parent 1 Parent 2 × × F 1 . . . )6QA,L K"'B=eEC)Q >L)o7(' PG0G:)8V AASA3)5<' eE@?%lM#;I (( .. (. .32 -10 B=! K"'aANJL)TF' eE-+*.N)2&' 1 4 3 2 6 5 8 7 9 a 1 2 3 4 5 6 7 8 9 –z –A –b –c –D –e –F –G –z –A –B –C –d –e –f –G –z –a –B –C –D –E –F –G –z –A/a –B –c –d –e –F –G –Z –A/a –b –c –D –E –f –g –z –A –B –C –D –E –f –g –D –E –F –G –z –A/a –b –C –z –a –B –C –d –E –F –g –Z –a –b –c –d –e –f –G b 1 2 3 4 5 6 7 8 9 AA AA aa Aa Aa AA Aa aa aa A a AA Aa Genotype aa Z E Map position G Test statistic A B C D F c ?;F!AGE/"78!aB,*3%3 7-0"N! 2P@ GE/V+=<"6C!aB,*3%3 )aB@C$('#)"RI >Je.2P@ >J"4:!aB=< (120- B (QTL) (Q 3O! 3O!3D) #%013"51 1 A B C D A (QTL) C D L,&+.-46$2) 3*216$) AA CC DD bb aa cc dd BB Aa Cc Dd Bb 1 F1 2 F2 AA CC DD bb aa cc dd BB Aa Cc Dd Bb 1B:2 =?7:#<SAB7 C7:#<SAQ7 =?9!<SAQ7 19>+.+@5 =?79>AC:6( DA8"';8T*-5* ,F*0"PLB"19>@5 . %% . %% . (%% . ))( =?79>AC:6( 20

Upload: others

Post on 22-Oct-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Box 1 | Quan ;G9) ) ) REVIEWS S SI quantat n t r t d on th

2 7- )

- )

C

AP

AP

6

I

SB

C

B AB R ( K B S

7000 years ago

500 years ago

2 67 AB

P S

S��������� - ������� ���������! �������������� SI

( 2 5 6 09

( 2 e 5 6

( 2v B 3 r7v 4

( 2 1 30

7 4

( 2 3

( 2

( 2o B

B 0

( ��������

+ + �� �+ ��+ ���

() (

HOCOOH

O

CO

ae

d W T AaeQL i G G

c d) 1 1 1 2 . ) (

S l3 1 .1 1 ;

d

NATURE REVIEWS | GENETICS VOLUME 2 | MAY 2001 | 3 7 1

R E V I EW S

In practice, several crossing schemes are used togenerate this mapping population12. In all of these,the parents are mated to generate an F1 population.In one approach, recombinant inbred lines can becreated by selfing (self-fertilizing) each of the F1progeny for several (usually eight) generations (FIG.

1)13. In an ‘F2 design’, the mapping population is gen-erated by mating the F1 progeny to each other (FIG. 2).In a ‘backcross design’, the mapping population isgenerated by crossing the F1 progeny to either, orboth, of the parents (FIG. 3). Several variations onthese crossing schemes have been designed to maxi-mize the shuffling of parental alleles12,14. Once allindividuals in the mapping population are scored forphenotype and multilocus genotype, the actual QTLmapping can begin. The statistical tools at the foun-dations of QTL mapping have been used for manyyears (BOX 2). In fact, in 1923, Karl Sax mapped a QTLfor seed size in the bean, Phaseolus vulgaris, by statis-tically associating it with a Mendelian locus for seed pigmentation15.

Today, we generally have much more detailed geneticmaps available. For example, Arabidopsis thaliana has1,262 genetic markers, which consist of restriction frag-ment length polymorphisms (RFLPs) and singlenucleotide polymorphisms (SNPs) that vary betweenthe two parental lines of the most commonly used set ofrecombinant inbred lines (FIG. 1)16,17. Cereon Genomicshas identified and made available a collection of 28,117SNPs, which include 15,674 insertion/deletion poly-morphisms that are polymorphic between the two par-ents of those same recombinant inbred lines.

Ultimately,QTL analysis yields a statistical descrip-tion of the genes that underlie the phenotypes ofinterest. The ‘statistical fog’ is not completely lifted,but we can see the shadows of those genes.

Box 1 | Quantitative genetics reborn

“… evolution is essentially a statistical problem,” W. F. R. Weldon (1893)

The early history of evolutionary genetics focused on understanding complex traits, particularly those relatingto humans, such as intelligence, temper and ‘artistic faculty’99. Without the benefit of Mendel’s ideas, FrancisGalton and the mathematician, Karl Pearson, established that useful predictions of the evolutionary trajectory of complex traits could be made without recourse to an explicit understanding of inheritance99. This area of‘quantitative genetics’ rested on the statistical properties of the MULTIVARIATE NORMAL DISTRIBUTION8,18.With the rediscovery of Mendel’s theory of heredity in 1900, a conflict arose between this ‘biometrical’ school

of quantitative genetics and the discrete genetics of the Mendelian school99. By 1910, it had been shown thatcontinuous phenotypic variation could result from the action of the environment on the segregation of manyMendelian loci99,100. By 1918, Ronald Fisher convincingly reconciled the discrete inheritance of Mendelism withthe biometrical approach101. Modern evolutionary quantitative genetics is largely based on the same statisticalfoundations that were laid by Pearson and Fisher102–108.

For most of the twentieth century, quantitative genetics had a crucial role in both agriculture and evolutionarybiology, but never seemed fully embraced by modern molecular genetics. There was (and perhaps still is) awidespread perception that quantitative genetics essentially ignored genetics, blanketing actual genes in what has been called a “statistical fog”109.

Mapping quantitative trait loci (QTL) — the single or many genes that underlie quantitative phenotypes —might represent an important part of the final synthesis of molecular and quantitative genetics. By working from the phenotype to the genotype, QTL mapping uses statistical techniques to localize chromosomal regions that might contain genes contributing to phenotypic variation in a complex trait of interest. Workingfrom the gene to the phenotype, molecular geneticists might be able to meet quantitative geneticists at somegenetic Promontory Point.

F1

a

b

cF8

Parent 1 Parent 2

×

× F1...

Figure 1 | Principles of mapping quantitative trait loci.The basic strategy behind mapping quantitative trait loci(QTL) is illustrated here for a | the density of hairs(trichomes) that occur on a plant leaf. Inbred parents thatdiffer in the density of trichomes are crossed to form an F

1

population with intermediate trichome density. b | An F1

individual is selfed to form a population of F2

individuals. c | Each F

2is selfed for six additional generations,

ultimately forming several recombinant inbred lines (RILs).Each RIL is homozygous for a section of a parentalchromosome. The RILs are scored for several geneticmarkers, as well as for the trichome density phenotype. In c, the arrow marks a section of chromosome thatderives from the parent with low trichome density. Theleaves of all individuals that have inherited that section ofchromosome from the parent with low trichome densityalso have low trichome density, indicating that thischromosomal region probably contains a QTL for this trait.

MULTIVARIATE NORMAL

DISTRIBUTION

The central limit theoremassures a normal (bell-shaped)distribution for a variable thatis the summation of manyindependent, random inputs.This applies to single ormultiple variables.

© 2001 Macmillan Magazines Ltd

) A

e QL o7

V

A Ae l

(( ..

(.

.3 2 -10

aA L Te N

374 | MAY 2001 | VOLUME 2 www.nature.com/reviews/genetics

R E V I EW S

Box 2 | Quantitative trait loci mapping methods

Several techniques exist for mapping quantitative trait loci (QTL) in the population; I have used the example of trichomedensity in leaves described in the figure to illustrate these methods. a | In the regression technique, the phenotype iscorrelated with each marker genotype110 (the middle panel represents the differential migration of DNA on a gel). In thiscase, a single marker ‘A’ is scored. Individuals that are homozygous for the A allele have high trichome density, individualsthat are homozygous for the ‘a’allele have low trichome density, and heterozygotes have intermediate trichome density. Alinear regression of trichome density on the number of A alleles shows a significant relationship between the marker andthe phenotype, which indicates that a QTL for trichome density is probably linked to that marker. The simple regressionmethod described is of limited use in localizing the chromosomal segment that contains a QTL. The methodunderestimates the effect of the QTL. The further the QTL is from the marker, the weaker the effect.

The interval mapping method uses a pair or two pairs of flanking markers at a time83,111. b | In this approach, the QTL is located within a chromosomal interval,defined by the flanking markers. The technique involves scoring a large numberof markers, as illustrated in the top panel, and then assessing the probability that an interval between two markers isassociated with a QTL that affects the trait of interest. The results of the analysis are plotted as a LIKELIHOOD-RATIO TEST

STATISTIC against the chromosomal map position,measured in recombination units (Morgans). The dotted line representsa significance threshold above which a likelihood-ratio test provides a statistically significant fit to a model of the data.The best estimate of the location of the QTL is given by the chromosomal location that corresponds to the highestsignificant likelihood ratio. Although the interval mapping method was an important advance, it too is statisticallybiased. In particular,QTL outside the interval under consideration can affect the ability to find a QTL within it112. Inaddition, false identification of a QTL can arise if other QTL are linked to the interval of interest (the false ‘ghost peak’onthe right)112. c | A third method, known as composite interval mapping, combines the interval mapping technique withmultiple regression analysis81,85,112. Composite interval mapping assesses the probability that an interval between twomarkers is associated with a QTL that affects the trait of interest, as well as controlling for the effects of other markers onthe trait (thus providing more accurate results). The results of the analysis are plotted as in b. In this method, the teststatistic is independent of QTL in other regions of the chromosomes (although it is still biased if the QTL is in the intervalimmediately adjacent to the interval of interest)85.

Zhao-Bang Zeng and his colleagues have extended this method to a multiple interval mapping technique. Multipleinterval mapping combines multiple QTL mapping analysis with the analysis of genetic architecture by using analgorithm to search for number,positions, effects and interactions of significant QTL113,114.

Several computer programs that carry out these mapping methods are freely available12,115. Unfortunately, mostof these programs lack an easy-to-use interface and are not of commercial quality, something desperately neededfor most users.

1

4

3

2 6

5

8

7 9

a

1 2 3 4 5 6 7 8 9

AA AA aa Aa Aa AA Aa aa aa

A

a

AA Aa

Genotype

aa

Tric

ho

me d

ensi

ty

Z

1 2 3 4 5 6 7 8 9

E G

Test

sta

tistic

A B C D F

–z–A–b–c

–D–e–F–G

–z–A–B–C

–d–e–f–G

–z–a–B–C

–D–E–F–G

–z–A/a–B–c

–d–e–F–G

–Z–A/a–b–c

–D–E–f–g

–z–A–B–C

–D–E–f–g

–D–E–F–G

–z–A/a–b–C

–z–a–B–C

–d–E–F–g

–Z–a–b–c

–d–e–f–G

Z E

Map position

G

Test

sta

tistic

A B C D F

b

c

Ghost peak

LIKELIHOOD-RATIO TEST

STATISTIC

A maximum-likelihoodmethod of hypothesis testing.The likelihood-ratio teststatistic is twice the naturallogarithm of the ratio of themaximum likelihood that thedata fit the alternativehypothesis to the maximumlikelihood that the data fit thenull hypothesis.

© 2001 Macmillan Magazines Ltd

374 | MAY 2001 | VOLUME 2 www.nature.com/reviews/genetics

R E V I EW S

Box 2 | Quantitative trait loci mapping methods

Several techniques exist for mapping quantitative trait loci (QTL) in the population; I have used the example of trichomedensity in leaves described in the figure to illustrate these methods. a | In the regression technique, the phenotype iscorrelated with each marker genotype110 (the middle panel represents the differential migration of DNA on a gel). In thiscase, a single marker ‘A’ is scored. Individuals that are homozygous for the A allele have high trichome density, individualsthat are homozygous for the ‘a’allele have low trichome density, and heterozygotes have intermediate trichome density. Alinear regression of trichome density on the number of A alleles shows a significant relationship between the marker andthe phenotype, which indicates that a QTL for trichome density is probably linked to that marker. The simple regressionmethod described is of limited use in localizing the chromosomal segment that contains a QTL. The methodunderestimates the effect of the QTL. The further the QTL is from the marker, the weaker the effect.

The interval mapping method uses a pair or two pairs of flanking markers at a time83,111. b | In this approach, the QTL is located within a chromosomal interval,defined by the flanking markers. The technique involves scoring a large numberof markers, as illustrated in the top panel, and then assessing the probability that an interval between two markers isassociated with a QTL that affects the trait of interest. The results of the analysis are plotted as a LIKELIHOOD-RATIO TEST

STATISTIC against the chromosomal map position,measured in recombination units (Morgans). The dotted line representsa significance threshold above which a likelihood-ratio test provides a statistically significant fit to a model of the data.The best estimate of the location of the QTL is given by the chromosomal location that corresponds to the highestsignificant likelihood ratio. Although the interval mapping method was an important advance, it too is statisticallybiased. In particular,QTL outside the interval under consideration can affect the ability to find a QTL within it112. Inaddition, false identification of a QTL can arise if other QTL are linked to the interval of interest (the false ‘ghost peak’onthe right)112. c | A third method, known as composite interval mapping, combines the interval mapping technique withmultiple regression analysis81,85,112. Composite interval mapping assesses the probability that an interval between twomarkers is associated with a QTL that affects the trait of interest, as well as controlling for the effects of other markers onthe trait (thus providing more accurate results). The results of the analysis are plotted as in b. In this method, the teststatistic is independent of QTL in other regions of the chromosomes (although it is still biased if the QTL is in the intervalimmediately adjacent to the interval of interest)85.

Zhao-Bang Zeng and his colleagues have extended this method to a multiple interval mapping technique. Multipleinterval mapping combines multiple QTL mapping analysis with the analysis of genetic architecture by using analgorithm to search for number,positions, effects and interactions of significant QTL113,114.

Several computer programs that carry out these mapping methods are freely available12,115. Unfortunately, mostof these programs lack an easy-to-use interface and are not of commercial quality, something desperately neededfor most users.

1

4

3

2 6

5

8

7 9

a

1 2 3 4 5 6 7 8 9

AA AA aa Aa Aa AA Aa aa aa

A

a

AA Aa

Genotype

aa

Tric

ho

me d

ensi

ty

Z

1 2 3 4 5 6 7 8 9

E G

Test

sta

tistic

A B C D F

–z–A–b–c

–D–e–F–G

–z–A–B–C

–d–e–f–G

–z–a–B–C

–D–E–F–G

–z–A/a–B–c

–d–e–F–G

–Z–A/a–b–c

–D–E–f–g

–z–A–B–C

–D–E–f–g

–D–E–F–G

–z–A/a–b–C

–z–a–B–C

–d–E–F–g

–Z–a–b–c

–d–e–f–G

Z E

Map position

G

Test

sta

tistic

A B C D F

b

c

Ghost peak

LIKELIHOOD-RATIO TEST

STATISTIC

A maximum-likelihoodmethod of hypothesis testing.The likelihood-ratio teststatistic is twice the naturallogarithm of the ratio of themaximum likelihood that thedata fit the alternativehypothesis to the maximumlikelihood that the data fit thenull hypothesis.

© 2001 Macmillan Magazines Ltd

374 | MAY 2001 | VOLUME 2 www.nature.com/reviews/genetics

R E V I EW S

Box 2 | Quantitative trait loci mapping methods

Several techniques exist for mapping quantitative trait loci (QTL) in the population; I have used the example of trichomedensity in leaves described in the figure to illustrate these methods. a | In the regression technique, the phenotype iscorrelated with each marker genotype110 (the middle panel represents the differential migration of DNA on a gel). In thiscase, a single marker ‘A’ is scored. Individuals that are homozygous for the A allele have high trichome density, individualsthat are homozygous for the ‘a’allele have low trichome density, and heterozygotes have intermediate trichome density. Alinear regression of trichome density on the number of A alleles shows a significant relationship between the marker andthe phenotype, which indicates that a QTL for trichome density is probably linked to that marker. The simple regressionmethod described is of limited use in localizing the chromosomal segment that contains a QTL. The methodunderestimates the effect of the QTL. The further the QTL is from the marker, the weaker the effect.

The interval mapping method uses a pair or two pairs of flanking markers at a time83,111. b | In this approach, the QTL is located within a chromosomal interval,defined by the flanking markers. The technique involves scoring a large numberof markers, as illustrated in the top panel, and then assessing the probability that an interval between two markers isassociated with a QTL that affects the trait of interest. The results of the analysis are plotted as a LIKELIHOOD-RATIO TEST

STATISTIC against the chromosomal map position,measured in recombination units (Morgans). The dotted line representsa significance threshold above which a likelihood-ratio test provides a statistically significant fit to a model of the data.The best estimate of the location of the QTL is given by the chromosomal location that corresponds to the highestsignificant likelihood ratio. Although the interval mapping method was an important advance, it too is statisticallybiased. In particular,QTL outside the interval under consideration can affect the ability to find a QTL within it112. Inaddition, false identification of a QTL can arise if other QTL are linked to the interval of interest (the false ‘ghost peak’onthe right)112. c | A third method, known as composite interval mapping, combines the interval mapping technique withmultiple regression analysis81,85,112. Composite interval mapping assesses the probability that an interval between twomarkers is associated with a QTL that affects the trait of interest, as well as controlling for the effects of other markers onthe trait (thus providing more accurate results). The results of the analysis are plotted as in b. In this method, the teststatistic is independent of QTL in other regions of the chromosomes (although it is still biased if the QTL is in the intervalimmediately adjacent to the interval of interest)85.

Zhao-Bang Zeng and his colleagues have extended this method to a multiple interval mapping technique. Multipleinterval mapping combines multiple QTL mapping analysis with the analysis of genetic architecture by using analgorithm to search for number,positions, effects and interactions of significant QTL113,114.

Several computer programs that carry out these mapping methods are freely available12,115. Unfortunately, mostof these programs lack an easy-to-use interface and are not of commercial quality, something desperately neededfor most users.

1

4

3

2 6

5

8

7 9

a

1 2 3 4 5 6 7 8 9

AA AA aa Aa Aa AA Aa aa aa

A

a

AA Aa

Genotype

aa

Tric

ho

me d

ensi

ty

Z

1 2 3 4 5 6 7 8 9

E G

Test

sta

tistic

A B C D F

–z–A–b–c

–D–e–F–G

–z–A–B–C

–d–e–f–G

–z–a–B–C

–D–E–F–G

–z–A/a–B–c

–d–e–F–G

–Z–A/a–b–c

–D–E–f–g

–z–A–B–C

–D–E–f–g

–D–E–F–G

–z–A/a–b–C

–z–a–B–C

–d–E–F–g

–Z–a–b–c

–d–e–f–G

Z E

Map position

G

Test

sta

tistic

A B C D F

b

c

Ghost peak

LIKELIHOOD-RATIO TEST

STATISTIC

A maximum-likelihoodmethod of hypothesis testing.The likelihood-ratio teststatistic is twice the naturallogarithm of the ratio of themaximum likelihood that thedata fit the alternativehypothesis to the maximumlikelihood that the data fit thenull hypothesis.

© 2001 Macmillan Magazines Ltd

374 | MAY 2001 | VOLUME 2 www.nature.com/reviews/genetics

R E V I EW S

Box 2 | Quantitative trait loci mapping methods

Several techniques exist for mapping quantitative trait loci (QTL) in the population; I have used the example of trichomedensity in leaves described in the figure to illustrate these methods. a | In the regression technique, the phenotype iscorrelated with each marker genotype110 (the middle panel represents the differential migration of DNA on a gel). In thiscase, a single marker ‘A’ is scored. Individuals that are homozygous for the A allele have high trichome density, individualsthat are homozygous for the ‘a’allele have low trichome density, and heterozygotes have intermediate trichome density. Alinear regression of trichome density on the number of A alleles shows a significant relationship between the marker andthe phenotype, which indicates that a QTL for trichome density is probably linked to that marker. The simple regressionmethod described is of limited use in localizing the chromosomal segment that contains a QTL. The methodunderestimates the effect of the QTL. The further the QTL is from the marker, the weaker the effect.

The interval mapping method uses a pair or two pairs of flanking markers at a time83,111. b | In this approach, the QTL is located within a chromosomal interval,defined by the flanking markers. The technique involves scoring a large numberof markers, as illustrated in the top panel, and then assessing the probability that an interval between two markers isassociated with a QTL that affects the trait of interest. The results of the analysis are plotted as a LIKELIHOOD-RATIO TEST

STATISTIC against the chromosomal map position,measured in recombination units (Morgans). The dotted line representsa significance threshold above which a likelihood-ratio test provides a statistically significant fit to a model of the data.The best estimate of the location of the QTL is given by the chromosomal location that corresponds to the highestsignificant likelihood ratio. Although the interval mapping method was an important advance, it too is statisticallybiased. In particular,QTL outside the interval under consideration can affect the ability to find a QTL within it112. Inaddition, false identification of a QTL can arise if other QTL are linked to the interval of interest (the false ‘ghost peak’onthe right)112. c | A third method, known as composite interval mapping, combines the interval mapping technique withmultiple regression analysis81,85,112. Composite interval mapping assesses the probability that an interval between twomarkers is associated with a QTL that affects the trait of interest, as well as controlling for the effects of other markers onthe trait (thus providing more accurate results). The results of the analysis are plotted as in b. In this method, the teststatistic is independent of QTL in other regions of the chromosomes (although it is still biased if the QTL is in the intervalimmediately adjacent to the interval of interest)85.

Zhao-Bang Zeng and his colleagues have extended this method to a multiple interval mapping technique. Multipleinterval mapping combines multiple QTL mapping analysis with the analysis of genetic architecture by using analgorithm to search for number,positions, effects and interactions of significant QTL113,114.

Several computer programs that carry out these mapping methods are freely available12,115. Unfortunately, mostof these programs lack an easy-to-use interface and are not of commercial quality, something desperately neededfor most users.

1

4

3

2 6

5

8

7 9

a

1 2 3 4 5 6 7 8 9

AA AA aa Aa Aa AA Aa aa aa

A

a

AA Aa

Genotype

aa

Tric

ho

me d

ensi

ty

Z

1 2 3 4 5 6 7 8 9

E G

Test

sta

tistic

A B C D F

–z–A–b–c

–D–e–F–G

–z–A–B–C

–d–e–f–G

–z–a–B–C

–D–E–F–G

–z–A/a–B–c

–d–e–F–G

–Z–A/a–b–c

–D–E–f–g

–z–A–B–C

–D–E–f–g

–D–E–F–G

–z–A/a–b–C

–z–a–B–C

–d–E–F–g

–Z–a–b–c

–d–e–f–G

Z E

Map position

G

Test

sta

tistic

A B C D F

b

c

Ghost peak

LIKELIHOOD-RATIO TEST

STATISTIC

A maximum-likelihoodmethod of hypothesis testing.The likelihood-ratio teststatistic is twice the naturallogarithm of the ratio of themaximum likelihood that thedata fit the alternativehypothesis to the maximumlikelihood that the data fit thenull hypothesis.

© 2001 Macmillan Magazines Ltd374 | MAY 2001 | VOLUME 2 www.nature.com/reviews/genetics

R E V I EW S

Box 2 | Quantitative trait loci mapping methods

Several techniques exist for mapping quantitative trait loci (QTL) in the population; I have used the example of trichomedensity in leaves described in the figure to illustrate these methods. a | In the regression technique, the phenotype iscorrelated with each marker genotype110 (the middle panel represents the differential migration of DNA on a gel). In thiscase, a single marker ‘A’ is scored. Individuals that are homozygous for the A allele have high trichome density, individualsthat are homozygous for the ‘a’allele have low trichome density, and heterozygotes have intermediate trichome density. Alinear regression of trichome density on the number of A alleles shows a significant relationship between the marker andthe phenotype, which indicates that a QTL for trichome density is probably linked to that marker. The simple regressionmethod described is of limited use in localizing the chromosomal segment that contains a QTL. The methodunderestimates the effect of the QTL. The further the QTL is from the marker, the weaker the effect.

The interval mapping method uses a pair or two pairs of flanking markers at a time83,111. b | In this approach, the QTL is located within a chromosomal interval,defined by the flanking markers. The technique involves scoring a large numberof markers, as illustrated in the top panel, and then assessing the probability that an interval between two markers isassociated with a QTL that affects the trait of interest. The results of the analysis are plotted as a LIKELIHOOD-RATIO TEST

STATISTIC against the chromosomal map position,measured in recombination units (Morgans). The dotted line representsa significance threshold above which a likelihood-ratio test provides a statistically significant fit to a model of the data.The best estimate of the location of the QTL is given by the chromosomal location that corresponds to the highestsignificant likelihood ratio. Although the interval mapping method was an important advance, it too is statisticallybiased. In particular,QTL outside the interval under consideration can affect the ability to find a QTL within it112. Inaddition, false identification of a QTL can arise if other QTL are linked to the interval of interest (the false ‘ghost peak’onthe right)112. c | A third method, known as composite interval mapping, combines the interval mapping technique withmultiple regression analysis81,85,112. Composite interval mapping assesses the probability that an interval between twomarkers is associated with a QTL that affects the trait of interest, as well as controlling for the effects of other markers onthe trait (thus providing more accurate results). The results of the analysis are plotted as in b. In this method, the teststatistic is independent of QTL in other regions of the chromosomes (although it is still biased if the QTL is in the intervalimmediately adjacent to the interval of interest)85.

Zhao-Bang Zeng and his colleagues have extended this method to a multiple interval mapping technique. Multipleinterval mapping combines multiple QTL mapping analysis with the analysis of genetic architecture by using analgorithm to search for number,positions, effects and interactions of significant QTL113,114.

Several computer programs that carry out these mapping methods are freely available12,115. Unfortunately, mostof these programs lack an easy-to-use interface and are not of commercial quality, something desperately neededfor most users.

1

4

3

2 6

5

8

7 9

a

1 2 3 4 5 6 7 8 9

AA AA aa Aa Aa AA Aa aa aa

A

a

AA Aa

Genotype

aa

Tric

ho

me d

ensi

ty

Z

1 2 3 4 5 6 7 8 9

E G

Test

sta

tistic

A B C D F

–z–A–b–c

–D–e–F–G

–z–A–B–C

–d–e–f–G

–z–a–B–C

–D–E–F–G

–z–A/a–B–c

–d–e–F–G

–Z–A/a–b–c

–D–E–f–g

–z–A–B–C

–D–E–f–g

–D–E–F–G

–z–A/a–b–C

–z–a–B–C

–d–E–F–g

–Z–a–b–c

–d–e–f–G

Z E

Map position

G

Test

sta

tistic

A B C D F

b

c

Ghost peak

LIKELIHOOD-RATIO TEST

STATISTIC

A maximum-likelihoodmethod of hypothesis testing.The likelihood-ratio teststatistic is twice the naturallogarithm of the ratio of themaximum likelihood that thedata fit the alternativehypothesis to the maximumlikelihood that the data fit thenull hypothesis.

© 2001 Macmillan Magazines Ltd

a 3 37 N

PV a 3 3

) a C R

eP

a

( 120-

B (����QTL)

Q

O O 3D

13

A

B

C

D

A (���QTL)

CD

L 4 2 )3 2 1 )

��� ���

AA CC DDbb aa cc ddBB

Aa Cc DdBb

��1�F1

��2�F2

AA CC DDbb

aa cc ddBB

�����

�����

�����

�������

Aa Cc DdBb� ��

1 B

7: SA 7

C 7: SAQ7

9 SAQ7

7 A :6

8 T 5F P L

.% %

.% %

.( % % .

)) (

7 A :6

20

Page 2: Box 1 | Quan ;G9) ) ) REVIEWS S SI quantat n t r t d on th

5 12 LL 4 9 4

9 10:7

ds ds ds222 ds222a

xui

ui

S ( 1

ds

on y A

tn A W G

uic

uic

nx

n

n G m e G

01 ) 1 10 ( ) (

a G P d

e cWS c N ;;GN A(

( )

O S ) ) G2 2 1

2 )

. 2 )

G( 13

11

2 13( 1

A B A BM

P ;; BM

(

B

���������� ���

����

trGeA n

tr AB M

g lo icG a

N M n hA M

-.8 70 3 80 25 4 :

3 05 294

1

S

5 1

,.- ,- ) ( )

DRI t Cp K KuaA B n

DRI t

OHO COOH

8 4

ClO

NH

OO

O

N S P( ,

00 251

()(

00

Cre

( ,

251

PM

re

re

758:46 I

0 0 1

O (

3 2

70

4 A 8( 0 A )K K A

4 (

80 88 )