genome-wide association studies in 50 agronomic traits in peanut using the ‘reference set’...

1
Inclusive Market-Oriented Development (IMOD) – our approach to bringing prosperity in the drylands. ICRISAT is a member of the CGIAR Consortium. Manish K Pandey 1 , Hari D Upadhyaya 1 , Abhishek Rathore 1 , Vincent Vadez 1 , MS Sheshashayee 2 , Manda Sriswathi 1 , Mansee Govil 1 , Ashish Kumar 1,3 , MVC Gowda 4 , Shivali Sharma 1 , Falalou Hamidou 1 , V Anil Kumar 1 , Pawan Khera 1,5 , Ramesh S Bhat 4 , Aamir W Khan 1 , Sube Singh 1 , Hongjie Li 6 , Emmanuel Monyo 7 , H L Nadaf 4 , Ganapati Mukari 4 , Scott Jackson 8 , Baozhu Guo 5 , Xuanqiang Liang 9 , Rajeev K Varshney 1, * 1 International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India; 2 University of Agricultural Sciences, GKVK, Bangalore, India; 3 Jawahar Lal Nehru Krishi Vishwa Vidyalaya, Jabalpur, India; 4 University of Agricultural Sciences, Dharwad, India; 5 Crop Protection and Management Research Unit, USDA-ARS, Tifton, USA; 6 ShangDong Shofine Seed Technology Co Ltd, Jining, China; 6 International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Nairobi, Kenya; 8 The University of Georgia, Athens, USA; 9 Crop Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, China; *Address for correspondence: [email protected] Genome-wide association studies in 50 agronomic traits in peanut using the ‘reference set’ comprising 300 genotypes from 48 countries of the semi-arid tropics of the world Abstract Acknowledgements Summary S. No. Traits MTAs P value range PVE % range Disease resistance 1 Aspergillus (ASP) 1 9.68E-07 24.69 2 Groundnut rosette disease (GRD) 31 5.25E-20 − 7.66E-07 10.25−39.29 3 Late leaf spot (LLS) 1 1.54E-06 18.10 4 Early leaf spot (ELS) 6 4.21E-07 − 7.59E-08 9.18−10.99 Quality and nutrition 5 Oil content (OC) 25 3.18E-10 − 1.70E-06 5.84−40.37 6 Oleic acid (OLE) 2 2.47E-06 16.42 7 Oleic/linoleic acid ratio (OLR) 22 1.52E-12 − 1.95E-06 13.67−47.45 8 Zinc content (ZC) 1 8.62E-07 15.63 9 Protein content (PC) 11 8.62E-08 − 2.14E-06 11.63−36.09 10 Sound mature kernel % (SMK%) 3 8.13E-08 − 2.11E-06 22.21−29.02 Physiological and yield component 11 Leaf length (LLN) 30 6.67E-08 − 1.26E-06 12.48−21.61 12 Harvest index (HI) 41 7.35E-17 − 2.12E-06 5.81−31.80 13 Root volume (RTVOL) 1 6.16E-07 39.59 14 Root water loss (RWL) 2 8.28E-07 − 6.11E-08 11.60 −13.66 15 Haulm weight (HLMWT) 6 3.59E-07 − 1.42E-06 10.16−12.44 16 SPAD chlorophyll meter reading (SCMR) 127 1.27E-10 − 2.12E-06 7.78−18.27 17 Seed length (SDL) 9 1.04E-06 − 2.78E-07 11.81−13.29 18 Shelling percentage (ShP) 2 8.15E-08 − 1.99E-06 34.43−36.45 19 Shoot weight (ShWT) 2 5.03E-07 − 1.82E-06 10.38−12.13 20 Specific leaf area (SLA) 3 2.48E-07 − 2.09E-06 8.72−19.54 21 Seed width (SDWD) 3 1.82E-07 − 1.39E-06 14.91−30.09 22 Seed weight (SDWT) 5 7.89E-08 − 2.16E-06 12.73−26.08 23 Total leaf area (TLA) 3 1.17E-07 − 2.03E-06 13.07−15.52 24 Total leaf weight (TLWT) 2 1.17E-06 − 1.20E-06 10.03−10.25 25 Pod yield 33 4.55E-08 − 2.16E-06 9.74−37.36 Traits under well watered and drought stress 26 Harvest index_drought stress 36 2.97E-10 − 1.95E-06 8.95−85.40 27 Harvest index_well watered 11 2.35E-10 − 1.40E-06 8.83−39.29 28 Haulm weight_drought stress 10 3.56E-12 − 1.77E-06 9.34−32.26 29 Haulm weight_well watered 10 1.01E-10 − 2.09E-06 8.93−21.25 30 Leaf area_well watered 1 1.77E-06 9.89 31 Leaf area_drought stress 1 8.16E-07 19.84 32 Leaf dry weight_drought stress 4 8.43E-07 − 1.24E-06 9.73−19.09 33 SCMR_drought stress 10 1.05E-07 − 1.91E-06 8.24−12.42 34 SCMR_well watered 16 8.05E-10 − 2.12E-06 9.23−14.31 35 Seed weight_drought stress 7 3.20E-22 − 9.03E-08 12.63−90.09 36 Seed weight_well watered 46 7.33E-33 − 1.93E-06 10.32−88.90 Total 524 7.33E-33 − 1.32E-06 5.81−90.09 Global / base collection Composite collection Core collection Multiseason and multilocation evaluation 518 component traits for 50 traits Reference set (304 genotypes) Wild, landraces and breeding lines Association analysis Marker-trait associations Molecular ID based on SSRs for all the genotypes of peanut reference set Low PIC for DArTs (0.16) than SSRs (0.71) Population structure identified three subgroups (K = 3) based on SSRs LD decay of 15 cM in peanut genome based on SSRs 524 MTAs identified for 36 traits with PVE % ranging from 5.81 to 90.09 Financial support from the Peanut Foundation (USA) and the CGIAR Generation Challenge Programme (GCP) is gratefully acknowledged. This work has been undertaken as part of the CGIAR Research Program on Grain Legumes. A high-resolution genotyping (4,957 polymorphic DArT features and 154 SSRs) and multiple season phenotyping data for 50 important agronomic, disease and quality traits were generated on the ‘reference set’ of peanut. These data were subjected to comprehensive analyses for allelic diversity, population structure, linkage disequilibrium (LD) decay and marker-trait association (MTA) in the ‘reference set’. The DArT markers (2.0 alleles/locus, 0.125 PIC) showed lower allele frequency and polymorphic information content (PIC) than SSR markers (22.21 alleles /locus, 0.715 PIC). Multi-allelic SSRs identified three sub-groups while the LD simulation trend line based on squared-allele frequency correlations (r 2 ) predicted LD decay of 15-20 cM. Association analysis resulted in identification of 524 highly significant MTAs (pvalue >2.1x10-6) with wide phenotypic variance range (5.81-90.09%) for 36 traits. These MTAs upon validation may be deployed in genomics-assisted breeding for developing improved peanut cultivars with enhanced resistance/tolerance to different stress and higher pod yield with improved oil/ seed/ nutritional quality. These results have been published recently in detail in the PLOS ONE journal (PLoS ONE, 9(8):e105228, DOI:10.1371/journal.pone.0105228). Phenotyping Disease resistance Quality and nutritional Physiological and yield component Traits under well watered and drought stress LD decay based on SSR markers DArT : 4,597 SSR : 154 Population structure and association analysis Genome-wide distribution of marker-trait associations (a) Grouping based on SSR and DArT markers (b) Principle co-ordinate analysis based on SSR and DArT markers (c) Population structure at different K values (d) Three subgroups based on mean Fst values Global distribution of genotypes containing linked-marker allele(s) for different traits Population structure

Upload: icrisat

Post on 19-Aug-2015

3 views

Category:

Government & Nonprofit


0 download

TRANSCRIPT

Page 1: Genome-wide association studies in 50 agronomic traits in peanut using the ‘reference set’ comprising 300 genotypes from 48 countries of the semi-arid tropics of the world

Inclusive Market-Oriented Development (IMOD) –our approach to bringing prosperity in the drylands.

ICRISAT is a member of the CGIAR Consortium.

Manish K Pandey1, Hari D Upadhyaya1, Abhishek Rathore1, Vincent Vadez1, MS Sheshashayee2, Manda

Sriswathi1, Mansee Govil1, Ashish Kumar1,3, MVC Gowda4, Shivali Sharma1, Falalou Hamidou1, V Anil Kumar1,

Pawan Khera1,5, Ramesh S Bhat4, Aamir W Khan1, Sube Singh1, Hongjie Li6, Emmanuel Monyo7, H L Nadaf4,

Ganapati Mukari4, Scott Jackson8, Baozhu Guo5, Xuanqiang Liang9, Rajeev K Varshney1,*

1International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India; 2University of Agricultural Sciences, GKVK,

Bangalore, India; 3Jawahar Lal Nehru Krishi Vishwa Vidyalaya, Jabalpur, India; 4University of Agricultural Sciences, Dharwad, India; 5Crop

Protection and Management Research Unit, USDA-ARS, Tifton, USA; 6ShangDong Shofine Seed Technology Co Ltd, Jining, China; 6International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Nairobi, Kenya; 8The University of Georgia, Athens, USA; 9Crop

Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, China; *Address for correspondence: [email protected]

Genome-wide association studies in 50 agronomic traits in peanut using the ‘reference set’ comprising 300 genotypes from 48 countries of the semi-arid tropics of the world

Abstract

Acknowledgements

Summary

S. No. Traits MTAs P value range PVE % range

Disease resistance

1 Aspergillus (ASP) 1 9.68E-07 24.69

2 Groundnut rosette disease (GRD) 31 5.25E-20 − 7.66E-07 10.25−39.29

3 Late leaf spot (LLS) 1 1.54E-06 18.10

4 Early leaf spot (ELS) 6 4.21E-07 − 7.59E-08 9.18−10.99

Quality and nutrition

5 Oil content (OC) 25 3.18E-10 − 1.70E-06 5.84−40.37

6 Oleic acid (OLE) 2 2.47E-06 16.42

7 Oleic/linoleic acid ratio (OLR) 22 1.52E-12 − 1.95E-06 13.67−47.45

8 Zinc content (ZC) 1 8.62E-07 15.63

9 Protein content (PC) 11 8.62E-08 − 2.14E-06 11.63−36.09

10 Sound mature kernel % (SMK%) 3 8.13E-08 − 2.11E-06 22.21−29.02

Physiological and yield component

11 Leaf length (LLN) 30 6.67E-08 − 1.26E-06 12.48−21.61

12 Harvest index (HI) 41 7.35E-17 − 2.12E-06 5.81−31.80

13 Root volume (RTVOL) 1 6.16E-07 39.59

14 Root water loss (RWL) 2 8.28E-07 − 6.11E-08 11.60 −13.66

15 Haulm weight (HLMWT) 6 3.59E-07 − 1.42E-06 10.16−12.44

16 SPAD chlorophyll meter reading (SCMR) 127 1.27E-10 − 2.12E-06 7.78−18.27

17 Seed length (SDL) 9 1.04E-06 − 2.78E-07 11.81−13.29

18 Shelling percentage (ShP) 2 8.15E-08 − 1.99E-06 34.43−36.45

19 Shoot weight (ShWT) 2 5.03E-07 − 1.82E-06 10.38−12.13

20 Specific leaf area (SLA) 3 2.48E-07 − 2.09E-06 8.72−19.54

21 Seed width (SDWD) 3 1.82E-07 − 1.39E-06 14.91−30.09

22 Seed weight (SDWT) 5 7.89E-08 − 2.16E-06 12.73−26.08

23 Total leaf area (TLA) 3 1.17E-07 − 2.03E-06 13.07−15.52

24 Total leaf weight (TLWT) 2 1.17E-06 − 1.20E-06 10.03−10.25

25 Pod yield 33 4.55E-08 − 2.16E-06 9.74−37.36

Traits under well watered and drought stress

26 Harvest index_drought stress 36 2.97E-10 − 1.95E-06 8.95−85.40

27 Harvest index_well watered 11 2.35E-10 − 1.40E-06 8.83−39.29

28 Haulm weight_drought stress 10 3.56E-12 − 1.77E-06 9.34−32.26

29 Haulm weight_well watered 10 1.01E-10 − 2.09E-06 8.93−21.25

30 Leaf area_well watered 1 1.77E-06 9.89

31 Leaf area_drought stress 1 8.16E-07 19.84

32 Leaf dry weight_drought stress 4 8.43E-07 − 1.24E-06 9.73−19.09

33 SCMR_drought stress 10 1.05E-07 − 1.91E-06 8.24−12.42

34 SCMR_well watered 16 8.05E-10 − 2.12E-06 9.23−14.31

35 Seed weight_drought stress 7 3.20E-22 − 9.03E-08 12.63−90.09

36 Seed weight_well watered 46 7.33E-33 − 1.93E-06 10.32−88.90

Total 524 7.33E-33 − 1.32E-06 5.81−90.09

Global / base collection

Composite

collection

Core

collection

Multiseason and multilocation evaluation

518 component traits for 50 traits

Reference set (304 genotypes)Wild, landraces and breeding lines

Association analysis Marker-trait associations

Molecular ID based on SSRs for all the genotypes of peanut reference set

Low PIC for DArTs (0.16) than SSRs (0.71)

Population structure identified three subgroups (K = 3) based on SSRs

LD decay of 15 cM in peanut genome based on SSRs

524 MTAs identified for 36 traits with PVE % ranging from 5.81 to 90.09

Financial support from the Peanut Foundation (USA) and the CGIAR Generation

Challenge Programme (GCP) is gratefully acknowledged. This work has been

undertaken as part of the CGIAR Research Program on Grain Legumes.

A high-resolution genotyping (4,957 polymorphic DArT features and 154 SSRs) and multiple season phenotyping data for

50 important agronomic, disease and quality traits were generated on the ‘reference set’ of peanut. These data were subjected to

comprehensive analyses for allelic diversity, population structure, linkage disequilibrium (LD) decay and marker-trait association (MTA)

in the ‘reference set’. The DArT markers (2.0 alleles/locus, 0.125 PIC) showed lower allele frequency and polymorphic information

content (PIC) than SSR markers (22.21 alleles /locus, 0.715 PIC). Multi-allelic SSRs identified three sub-groups while the LD simulation

trend line based on squared-allele frequency correlations (r2) predicted LD decay of 15-20 cM. Association analysis resulted in

identification of 524 highly significant MTAs (pvalue >2.1x10-6) with wide phenotypic variance range (5.81-90.09%) for 36 traits. These

MTAs upon validation may be deployed in genomics-assisted breeding for developing improved peanut cultivars with enhanced

resistance/tolerance to different stress and higher pod yield with improved oil/ seed/ nutritional quality. These results have been

published recently in detail in the PLOS ONE journal (PLoS ONE, 9(8):e105228, DOI:10.1371/journal.pone.0105228).

Phenotyping Disease resistance

Quality and nutritional

Physiological and yield

component

Traits under well watered and

drought stress

LD decay based on SSR markers

DArT : 4,597

SSR : 154

Population structure and association analysis

Genome-wide distribution of marker-trait

associations

(a) Grouping based on SSR and DArT markers

(b) Principle co-ordinate analysis based on SSR

and DArT markers

(c) Population structure at different K values

(d) Three subgroups based on mean Fst values

Global distribution of genotypes containing

linked-marker allele(s) for different traits

Population structure