mapping course-lecture 1

Upload: vahidvashar

Post on 30-May-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/14/2019 Mapping Course-Lecture 1

    1/21

    Mapping Course 2007

    Project funded by USDA-CSREESProject funded by USDA-CSREESJorge DubcovskyJorge Dubcovsky

  • 8/14/2019 Mapping Course-Lecture 1

    2/21

    Before Mapping

    6B 7A

    7B6A5D5B5A4D4A3D3B3A2D2B1D1B1A

    1BL1BS

    Nulli-tetrasomic lines: missing 1 pair of chromosomes that is compensated with a double dose of

    one homoeologous chromosomes. The chromosome location is identified by

    lack of amplification. Example: microsatellite located on chromosome 1B

    Ditelosomic lines: missing 1 arm. The name indicates the arm PRESENT.Example: microsatellitelocated on chromosome arm 1BL

    1BL2 1BL1 1BL6

    C-1BL31BL3

    1BL3-0.85-100

    1BL2-0.69-0.85

    1BL1-0.47-0.69

    1BL6-0.32-0.47

    C-1BL6-0.32

    Deletion bin map of chromosome 1B

    Deletion stocks: missing 1 segment. The name indicates the segmentABSENT.

    Example: microsatellite located on bin 1BL1.

    Previous mapping: check chromosome location in GrainGeneshttp://wheat.pw.usda.gov/GG2/index.shtml

    http://wheat.pw.usda.gov/GG2/index.shtmlhttp://wheat.pw.usda.gov/GG2/index.shtml
  • 8/14/2019 Mapping Course-Lecture 1

    3/21

    Mapping populations

    Temporary populations Immortal populations

  • 8/14/2019 Mapping Course-Lecture 1

    4/21

    Crossovers

    Meiotic chromosomes Meiotic products

    Meiosis with

    no crossover

    between loci

    1 and 2.

    Meiosis with

    1 crossover

    between loci

    1 and 2.

    Parental A

    Parental A

    Parental B

    Parental B

    Parental ARecombinant

    Parental B

    Recombinant

    A1 A2

    A1 A2

    A1 A2

    A1 A2

    A1 A2A1 A2

    A1 A2 A1 B2

    B1 A2

    B1 B2

    B1 B2

    B1 B2

    B1 B2 B1 B2

    B1 B2

    B1 B2

    Linked loci can be separated

    by a physical exchange of

    chromosome parts during

    meiosis.

    This process, called crossover

    generates recombinant

    haploid genotypes that differfrom the haploid parental

    genotypes.

    There is an average of 1crossover per short arm and

    2 crossovers per long arm.

    Therefore the average

    chromosome length is 150

    cM

  • 8/14/2019 Mapping Course-Lecture 1

    5/21

    Recombination frequency as estimate of distance

    L1 L2 L3 L4

    Parental A1 A2= 7

    Parental B1 B2= 7

    Recombinant A1 B2= 5

    Recombinant B1 A2= 5

    Rf= 10/24= 42%

    Parental A3 A4= 10

    Parental B3 B4= 10

    Recombinant A3 B4= 2

    Recombinant B3 A4= 2

    Rf= 4/24= 16%

    There is a rough proportionality

    between the distance betweenlinked loci and the chance of a

    crossover between non-sister

    chromatids between loci.

    By determining the frequency

    of recombinants, it is possible to

    obtain a measure of the mapdistance between the genes.

    A recombinant frequency of

    0.01 is the distance between loci

    for which one product of

    meiosis out of 100 is

    recombinant

  • 8/14/2019 Mapping Course-Lecture 1

    6/21

    Chiasma interference

    Recombination frequency per length unit in

    wheat (Lukaszewski and Curtis 1993)

    In wheat there is strong chiasma interference(one chiasma reduces the pb. of 2nd chiasma in

    its proximity).

    As a result of chiasma interference and the

    initiation of chromosome pairing from

    telomeres, chiasma in wheat are favored at the

    end of the chromosomes.

    Small genetic distances in the centromeric

    region correspond to large genetic distances

    in the wheat physical maps.

  • 8/14/2019 Mapping Course-Lecture 1

    7/21

    Recombination fraction and CentiMorgans

    Meiotic chromosome

    Parental AA1 A2

    A1 A2

    B1 B2

    B1 B2

    A1 A2

    A1 A2

    B1 B2

    B1 B2

    Meiotic product

    Parental A

    Parental B

    Parental B

    When two crossovers are produced between two markers, no recombination is detected unlessan additional markers is included in the middle.

    Mapping functions (Haldane and Kosambi) correct the recombination fraction r for this

    possibility.

    Haldane: does not consider interference and is not appropriate for wheat

    Kosambi: considers chiasma interference: cM=100*[ log [(1+2r)/(1-2r)]

    Recombination fraction Kosambi cM

    Rf= 0.01 1.0 cMRf= 0.02 2.0 cMRf= 0.10 10.1 cMRf= 0.20 21.2 cM

    Rf= 0.49 114.9 cM

  • 8/14/2019 Mapping Course-Lecture 1

    8/21

    Three point analysis

    If the distance between three linked markers is known

    A--------5cM---------B

    A-----3cM-----C

    C------------8cM---------------B

    We can determine the likelihood of each order:

    C-----3cM-----A--------5cM---------B Likely

    A-----3cM-----C---2cM---B Unlikely

    A--------5cM---------B----------8cM-------------C Very unlikely

    The Mapmaker command THREE POINT calculates the likelihood of the

    three alternative orders for each three points and discards those that are very

    unlikely

  • 8/14/2019 Mapping Course-Lecture 1

    9/21

    Expected Genotypic Classes

    1:1:1:11:1:1:1DH

    1:1:1:11:1:1:1RIL or RSL

    1:2:1:2:4:2:1:2:19:3:3:1F2

    1:1:1:11:1:1:1BC1F

    2

    Codominant

    markers

    Dominant markersPopulation

    A1 A2

    A1 A2

    B1

    B2

    B1 B2Expected Genotypic Classes

  • 8/14/2019 Mapping Course-Lecture 1

    10/21

    Significance of linkage

    Example of observed and expected segregation values.

    38.2Example of F2, two locus L1 L2

    Genotype: AA : AH : AB : HA : HH : HB : BA : BH : BB

    Observed: 5.0 : 2.0 : 0:0 : 0:0 :11.0 : 0:0 : 0:0 : 0:0 : 6.0

    Expected: 1.5 : 3.0 : 1.5 : 3.0 : 6.0 : 3.0 : 1:5 : 3.0 : 1.5

    0.44Genotype: AA : AB : BA : BB

    Observed: 6.00 : 5.00 : 7.00 : 7.00

    Expected : 6.25 : 6.25 : 6.25 : 6.25

    Example of RSL, two locus L1

    L2 2

    The df of the 2 is the number of classes minus 1.

    Look for the critical value in a 2 table: critical value =0.05, 23df

    = 7.81

    Calculate 2=[(O-E)2/E]=(0.252+1.252+0.752+0.752)/6.25= 0.44

    Since 0.44

  • 8/14/2019 Mapping Course-Lecture 1

    11/21

    Calculating recombination frequencies in DH populations

    In a DH, backcross, or RSL population the calculation of the

    recombination fractions is very simple

    Count the number of recombinant chromosomes and divide by the total

    The total is the number of chromosomes for which we have information

    for the two loci

    RSLs

    L1 A A B B B A B B - A A A A A A B A B B B A B A B B B B

    L2 A A B B B B B A B B - B B B A A A B A A A B A A A A B

    #CO 0 0 0 0 0 1 0 1 1 1 1 1 0 1 0 0 1 1 0 0 0 1 1 1 0

    There are 12 recombiantion events in 25 scored pairs

    RF= 12/25= 0.48 Mapmaker calculates it as 0.48

  • 8/14/2019 Mapping Course-Lecture 1

    12/21

    Calculating recombination frequency in F2 populations

    In an F2, it is not possible to determine whether the H1H2 are parental or

    recombinant. The following examples show two different pairs of chromosomes in an

    F2 individual, one parental and one recombinant. The crossovers are indicated by x.

    Example 1: H1H2 genotype from parental chromosomes

    A A1

    A2

    A x B B

    A x B1

    B2

    B B B

    Score: A H1 H2 H B B

    Example 2: H1H2 genotype derived from recombinant chromosomesA A

    1x B

    2B B B

    B B1

    xA2

    A A A

    Score: H H1

    H2

    H H H

    Therefore, in an F2counting the recombinants is just an approximation.

    Mapmaker uses Maximum Likelihood to estimate the distances more precisely:

    Example 3 estimating RF in an F2:

    L3 A H A H B H H B A H B H A A H H B B A A H H H B

    L4 H H A H B H H B A H B H A A H H B B A B H H H B#CO 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0

    RF 3/48 0.0625 Mapmaker calculates it more precisely as 0.065

  • 8/14/2019 Mapping Course-Lecture 1

    13/21

    LOD scores

    The maximum likelihood method of estimation is a general statistical method to estimate

    the value of a parameter: the value chosen for the parameter is that which is most probable

    (ormost likely) given the occurrence of a certain set of observations.

    LOD score (Log10 of the odds ratio) is calculated as:

    Pb. of obtaining the observed data under linkage

    LOD= log -------------------------------------------------------------------------

    Pb. of obtaining the observed data under random assortment

    The ratio between the two probabilities is transformed to log10.

    The higher the value the more likely is the linkage.

    For example, a LOD=3 indicates that is 1000 times (103) more likely to obtain the observed

    data from linked markers than from unlinked ones.

    LOD scores are used to determine the most likely order of markers in a map.

    Mapmaker assigns 0 to the better order and negative LODs to alternative orders.

    For example, if order abc has LOD=0 and order La Lc Lb has a LOD score of 2, that

    indicates that it is 100 (102) less likely to obtain the observed data from the 2nd order than

    from the 1st

    order.

  • 8/14/2019 Mapping Course-Lecture 1

    14/21

    Recombinant inbred lines

    Since RILs have more generations for recombination, recombination

    frequencies need to be adjusted.

    Recombination levels observed in RILs can be related to recombination in apopulation derived from a single meiosis using Haldane and Waddington (1931)formulas.

    2r R R= ------- or r = -------

    1+2r 2-2R

    Some examples

    R r R/r

    0.01 0.005 1.98

    0.1 0.056 1.80

    0.2 0.125 1.60

    0.3 0.214 1.40

    0.4 0.333 1.20

    0.5 0.500 1.00

    When linkages are tight, the

    recombination frequency

    among RILs is twice the

    conventional F2 rate.

  • 8/14/2019 Mapping Course-Lecture 1

    15/21

    Mapping advice

    Take extreme care entering genotypic data. Merge linked markers into single group

    Review all double crossovers (if possible re-extract DNA and re run marker)

    Too many double crossovers for a marker indicate problems. Repeat PCR!

    Too many double crossovers along an individual suggest DNA problems. Recheck

    conflicting markers with the same tube of DNA.

    Try to have 1 DNA for the complete project, if not possible keep track of DNA replacements

    using a version number/date. Record mapping date of each markers If in doubt regarding the genotype of an individual, it is better to enter it as a missing value

    than to guess. Each error expands your map 2 crossovers!

  • 8/14/2019 Mapping Course-Lecture 1

    16/21

    Map comparisons

    Compare order with previous maps.http://maswheat.ucdavis.edu/

    Use our site in collaboratorsarea: user: cap05, password:Genomic$

    Map visualization tooluser: wheatcap, password:

    jagger

    We can enter your map there!

    Japanese site for theWheat Gene Catalogue:http://map.lab.nig.ac.jp:8080/cmap/

    http://maswheat.ucdavis.edu/http://map.lab.nig.ac.jp:8080/cmap/http://map.lab.nig.ac.jp:8080/cmap/http://maswheat.ucdavis.edu/
  • 8/14/2019 Mapping Course-Lecture 1

    17/21

    Answers to exercises

    Exercise 1: You have assigned four loci to chromosome 1A of wheat using nullitetrasomics. Using fifty doublehaploids from a hybrid between two varieties A and B you recover the following number of recombinant and parentalchromosomes:

    Locus 1- Locus 2: AA:25 AB:0 BA:1 BB:24 -> 1/50=0.02 = 2%Locus 1- Locus 3: AA:23 AB:3 BA:4 BB:20 -> 7/50=0.14 = 14%Locus 1- Locus 4: AA:25 AB:1 BA:1 BB:23 -> 2/50=0.04 = 4%Locus 2- Locus 3: AA:21 AB:3 BA:5 BB:21 -> 8/50=0.16 = 16%Locus 2- Locus 4: AA:24 AB:2 BA:1 BB:23 -> 3/50=0.06 = 6%Locus 3- Locus 4: AA:23 AB:2 BA:3 BB:22 -> 5/50=0.10 = 10%

    Construct a map of these four loci.AnswerL2- (2%) - L1- (4%) - L4 (10%) - L3

    Exercise 2: Chi Square for Double haploids, RSLs or BackcrossesIn similar population of 100 double haploids you mapped two loci: 1 and 5. You checked parental and recombinantchromosomesL1 L5 Observed: AA:28 AB:22 BA:24 BB:26L1 L5 Expected: AA:25 AB:25 BA:25 BB:25|O-E| 3 3 1 1

    2=[(O-E)2/E]=(32+32+12+12)/25= 20/25=0.8 Critical 2 = 7.81 No significant linkage

  • 8/14/2019 Mapping Course-Lecture 1

    18/21

    Answers to exercises

    Exercise 3: 2 for the F2 population

    5+2+11+6=24 individuals and 16 classesExpected per class: 24 individuals / 16 classes = 1.5 per class

    1 : 2 : 1 : 2 : 4 : 2 : 1 : 2 :1

    1.5 : 3.0 : 1.5 : 3.0 : 6.0 : 3.0 : 1.5 : 3.0 : 1.5

    Genotype: A1A

    2: A

    1H2: A

    1B2: H

    1A

    2: H

    1H2

    : H1B2

    : B1A

    2: B

    1H2

    : B1B2

    Observed: 5.0 : 2.0 : 0.0 : 0.0 : 11.0 : 0.0 : 0.0 : 0.0 : 6.0

    Expected: 1.5 : 3.0 : 1.5 : 3.0 : 6.0 : 3.0 : 1:5 : 3.0 : 1.5 .

    |Difference| 3.5 1.0 0.5 3.0 5.0 3.0 1.5 3.0 4.5

    2 =[(O-E)2/E]=38.2 Critical valueDf=8 2 =15.51

    Exercise 4: Recombination fractions in RI or SSD

    The following is the data for MapMaker for a Recombinant Inbred Population of 50 lines and 4 loci (RI or SSD). Calculate the

    corrected RF r between loci 1-2, 2-3 and 3-4. Remember that in a SSD r= R/(2-2R) where R is the proportion of recombinants in

    the RI.

    data type ri self50 4 0*locus1 aaaaa aaaaa bbbbb bbbbb aaaaa bbbbb aaaaa bbbbb aaaaa bbbbb

    *locus2 aaaaa aaaaa bbbbb bbbbb baaaa bbbbb aaaaa bbbbb aaaaa bbbbb

    *locus3 aaaaa aaaaa bbbbb bbbbb bbbbbbbbbb aaaaa bbbbb aaaaa bbbbb

    *locus4 aaaaa aaaaa bbbbb bbbbb bbbbb aaaaa bbbbb bbbbb aaaaa bbbbb

    L1-L2= 1/50 R= 0.02 r= 0.010

    L2-L3= 4/50 R= 0.08 r= 0.043

    L3-L4= 10/50 R= 0.20 r= 0.125

  • 8/14/2019 Mapping Course-Lecture 1

    19/21

    Answers to exercises

  • 8/14/2019 Mapping Course-Lecture 1

    20/21

    Answers to exercises

  • 8/14/2019 Mapping Course-Lecture 1

    21/21

    Answers to puzzle

    Xabg601 AAAHAHAAAAAHAAAAAAHHHA AAHAAHHAAHHHAAHAAA AAHAAAAHAHHHAAAA

    X X

    Xabg602 AAAHAHAAAAAHAAHAAAHHHA AAHAAHHAAHHHAAHAAA AAHAAAAHHHHHAAAA

    X X

    XBamy AAAHAHAAAAAHAAHAAAHHHA AAHAAHHHAHHHAAHAHA AAHAAAAHHHHHAAAA

    X

    XksuH11 AAAHAHAAAAAHAAHAAAHHHA AAHAAHHHAHHHAAHAHA AAHAHAAHHHHHHAAA

    X X X X

    Xabc305 AAAHAHAAAAHHAAHAAAHHHH AAHAAHHHHHHHAAHAHA AAHHHAAHHHHHHHAA

    X

    Xmwg2112 AAAHAHAAAAHHAAHAAAHHHH AHHAAHHHHHHHAAHAHA AAHHHAAHHHHHHHAA

    X X X X X X

    Xbcd1302 AHAHAHAAAHHHAAHAAHHHHH AHHAAHHHHHHHHAHAHA AAHHHAHHHAHHHHAH

    X X X X X

    Xpsr164 AHAHAHAAHHHHAAHHAHHHHH AHHAHHHHHHHHHAHAHA AHHHHAHHHAHAHHAH

    X X

    XksuC2 AHAHAHAAHHHHHAHHAHHHHH AHHAHHHHHHHHHAHAHA AHHHHAHAHAHAHHAH

    X

    Xpsr1051 AHAHAHAAHHHHHAHHAHHHHH AHHAHHHHHHHHHAHAHH AHHHHAHAHAHAHHAH

    X X

    Xbcd1262 AHAHAHHAHHHHHAHHHHHHHH AHHAHHHHHHHHHAHAHH AHHHHAHAHAHAHHAH

    X XXDhn6 AHAHAHHAHHHHHAHHHHHHAH AHHAHHHHHHHHHAHHHH AHHHHAHAHAHAHHAH

    X X

    Xpsr922 AHAAAHHAHHHHAAHHHHHHAH AHHAHHHHHHHHHAHHHH AHHHHAHAHAHAHHAH

    XX

    Xcdo669 AHAAAHHAHHHHAHAHHHHHAH AHHAHHHHHHHHHAHHHH AHHHHAHAHAHAHHAH

    X X XX X X

    Xwg622 AHAAA-HAHAHHAHAHHHHAAH AH-AHHHHAAHHHAHHHH AHHHHAHAHAHHHAAH

    X X

    Xpsr921 AHAAAAHAHAHHAHAHHHAAAH AHHAHHHHAAHHHAHHHH AHHHHAHAHAHHHAAH