inference of population structure and local...

49

Upload: others

Post on 03-Jul-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Inference of population structure and local

adaptation

using population genomic data

A tutorial using the R package LEA

[email protected]

Computational and Mathematical Biology groupUniversité Grenoble-Alpes, France

March, 2015

Page 2: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Outline: Part 1

I Theory (40min): Inference of population structure andindividual ancestry coe�cients

I Estimates based on sparse matrix factorization methods

I Detection of outlier loci

I Practice (40min): Data analysis using R

Page 3: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Outline: Part 2

I Theory (40min): Ecological association tests.

I Estimates based on latent factor models

I Detection of genetic variation linked to ecological gradients

I Practice (40min): Data analysis using R

Page 4: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Data: Genotypic matrices

I Population genetic data for a diploid species are recorded in agenotypic matrix

X =

0 0 1 . . .1 2 0 . . .0 2 0 . . .. . . . . .

with n rows (individual genotypes) and L columns (genomicposition or locus).

I Each locus is called an SNP (Single NucleotidePolymorphism), and each value (0,1,2) encodes the number ofmutant (or derived) nucleotides at locus `.

Page 5: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Inference of population structure

I Sampled genotypes in X share ancestry from K ancestralpopulations, where K is unknow.

I The ancestry coe�cients are stored in the Q-matrix, anddisplayed using barplot representations.

I qik is the fraction of individual i 's genome that originates from

ancestral population k .

I A F-matrix records allele frequencies at each locus and in all Kancestral populations.

Page 6: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Nonnegative Matrix Factorization: Rationale

I The observed frequencies are a combination of ancestralfrequencies, with weights given by Q

p(xi` = j) =K∑

k=1

qik fk`(j) , j = 0, 1, 2.

I which writes asP = QF .

I The matrices Q and F are factors of the P matrix that can beestimated using matrix factorization methods.

Page 7: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Visualization of ancestry coe�cients.

I The individual ancestry coe�cients, stored in the matrix Q,can be displayed using a barplot representation as follows

FIN GBR IBS CEU TSI CHS CHB JPT YRI LWK ASW PUR CLM MXL

0

0.5

1

0

0.5

1

Anc

estryco

efficients

sNMF

sNMF

Q-matrix computed from the 1000 Genomes project data

(n = 1, 092 individuals and L = 2.2M SNPs)

Page 8: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

NMF: Principles

I NMF (Nonnegative Matrix Factorization) computesleast-square estimates of admixture proportions

I For the normalized data matrix X, it �nds matrices Q and Fthat minimize

LS(Q,F) = ‖X−QF‖2F , Q,F ≥ 0 .

I We used additional constraints

K∑k=1

qik = 1 ,2∑

j=0

fk`(j) = 1 , j = 0, 1, 2.

Page 9: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Sparse NMF (sNMF)

I sNMF computes regularized least-square estimates ofadmixture proportions

LS(Q,F) = ‖X−QF‖2F +√α

n∑i=1

‖qi .‖21 , Q,F ≥ 0 ,

where α is a non-negative regularization parameter thatpenalizes intermediate values of ancestry coe�cients.

I We solved the LS problem by using the AlternatingNon-negativity-constrained Least Squares algorithm withthe Active Set method (ANLS-AS, Kim and Park 2011).

Page 10: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Cross-entropy criterion

I We employ a cross-validation technique to evaluate theprediction error of ancestry estimation algorithms

I Our criterion provides an estimate of cross-entropy,measuring the capability of an algorithm to correctly predictmissing (or masked) genotypes.

Page 11: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Choosing K and α: Cross-entropy criterion

I The cross-entropy criterion provides an estimate of thefollowing quantity

H(psample, ppred) = −2∑

j=0

psample(j) log ppredi` (j) , j = 0, 1, 2.

Page 12: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Cross-entropy for sNMF

I Running sNMF example for K = 1− 12 (α = 100)

project=snmf(genotype,K=1:12,alpha=100,entropy=T)

entropy=sapply(1:12,FUN=function(i)

cross.entropy(project,K = i))

Page 13: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Cross-entropy for sNMF

● ●

● ●

2 4 6 8 10 12

0.52

0.54

0.56

0.58

0.60

Cross−entropy

Number of ancestral populations (K)

entr

opy

Cross-entropy as a function of K. Lower values indicate better

models

Page 14: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Test assay used by Frichot et al. (2014)

Data set Sample size Number of SNPs

HGDP00778 934 78KHGDP00542 934 48.5KHGDP00927 934 124KHGDP00998 934 2.6KHGDP01224 934 10.6KHGDP-CEPH 1,043 660K1000 Genomes 1,092 2.2MA. thaliana 168 216K

Ref: Patterson et al. 2012; Li et al. 2008;The 1000 Genomes Project

Consortium 2012; Atwell et al. 2010

Page 15: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

HGDP-CEPH (660K SNPs): sNMF estimates are very close tothose of likelihood-based approaches

Ba

ntu

Ke

nya

Ba

ntu

So

uth

Afr

ica

Ma

nd

en

ka

Yo

ru

ba

Sa

nM

bu

tiP

yg

my

Bia

ka

Pyg

my

Orca

dia

n

Ad

yg

ei

Ru

ssia

n

Ba

sq

ue

Fre

nch

Ita

lia

n

Sa

rd

inia

n

Tu

sca

n

Mo

za

bite

Be

do

uin

Dru

ze

Pa

lestin

ian

Ba

loch

i

Bra

hu

i

Ma

kra

ni

Sin

dh

i

Pa

tha

n

Bu

ru

sh

o

Ha

za

ra

Uyg

ur

Ka

lash

Ha

n

Ha

n-N

Ch

ina

Da

iD

au

rH

ezh

en

La

hu

Mia

o

Oro

qe

n

Sh

eT

ujia

Tu

Xib

o Yi

Mo

ng

ola

Na

xi

Ca

mb

od

ian

Ja

pa

ne

se

Ya

ku

t

Me

lan

esia

n

Pa

pu

an

Ka

ritia

na

Su

ru

iC

olo

mb

ian

Ma

ya

Pim

a

Africa Europe Middle East Central/South Asia East Asia Oceania Americasa

0

0.5

1

0

0.5

1

An

ce

str

y c

oe

ffic

ien

ts

ADMIXTURE

sNMF

Ancestry estimates using ADMIXTURE (top) and sNMF (down)

Page 16: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

sNMF runs faster than likelihood-based approaches(HGDP-CEPH data, 660K SNPs)

5 6 7 8 9 10

10

6.6

3.3

0

5 10 15 20

42

25

8

Run

-tim

e (h

ours

) ADMIXTUREsNMF

ADMIXTUREsNMF

A B

Number of clusters Number of clusters

Page 17: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Detecting outlier loci: genome scan for selection

I Rationale: Locally adaptive loci exhibit larger di�erences inallele frequency than selectively neutral loci.

I Common approaches to separate selective from neutralprocesses focus on the evaluation of population di�erentiationstatistics across loci (Lewontin and Krakauer 1975). The mostclassical measure of population di�erentiation is

FST = 1−σ2Sσ2T

,

where σ2S is the variance of allele frequencies withinsubpopulations (or ancestral populations) and σ2T is thevariance in the total population.

Page 18: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Statistical testing problem

I Null Hypothesis: For each locus, we want to test thenull-hypothesis

FST = F neutralST

where F neutralST is the (unknown) population di�erentiation

value for non-adaptive alleles.

Histogram of −log(Fst)

−log(Fst)

Fre

qu

en

cy

0 2 4 6 8

02

04

06

08

01

00

12

0

Page 19: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Multiple testing and test calibration issues

I False Discovery Rate (FDR) is de�ned as

FDR = prob(False Discovery | Positive test) = q.

I The Benjamini-Hochberg algorithm provides a list of candidateloci with expected FDR = q (see R scripts).

I The algorithm requires that the test are correctly calibrated,ie, the distribution of p-values is uniform when we assume thatthe null hypothesis, H0, is correct.

Page 20: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Converting FST's into signi�cance values

I Theory: FST corresponds to the squared correlation coe�cientin a regression model where allele frequencies are regressed onpopulation labels. We de�ned

t − scores =

√(n − 2)FST1− FST

∼ t(n − 2)

where t(n − 2) is the Student distribution with n − 2 degreesof freedom.

I Using R, signi�cance values can be computed from t-scores asfollowsp.values=2-2*pt(t.scores, df=n-2, lower=T)

Page 21: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Test calibration

I In�ation/De�ation factor: Tests can be calibrated by rescalingt-scores as followsadj.p.values=2-2*pt(t.scores/lambda,df=n-2,lower=T)

Histogram of adjusted p−values

adj.p.values

Fre

quen

cy

0.0 0.2 0.4 0.6 0.8 1.0

020

4060

8010

012

014

0

Histogram of adjusted p−values

adj.p.values

Fre

quen

cy

0.0 0.2 0.4 0.6 0.8 1.0

050

100

150

200

Histogram of adjusted p−values

adj.p.values

Fre

quen

cy

0.0 0.2 0.4 0.6 0.8 1.0

010

020

030

040

0

Conservative, well-calibrated, and liberal test histograms

Page 22: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Genome scan result (simulated data with �known outliers'')

●●●

●●

●●

●●

●●

●●

●●●●

●●

●●●●

●●

●●

●●

●●●●

●●●●●●

●●

●●●●●

●●●

●●●●

●●

●●●●●

●●

●●

●●●●●●

●●●●

●●●●●●●

●●●●●●

●●●●●●

●●●●●●

●●●●●

●●●●●●●●●

●●●●●

●●●●●

●●●●

●●

●●●●

●●●●

●●

●●●●●●●●

●●●

●●●●●

●●

●●

●●●●●

●●

●●

●●

●●

●●●●●

●●

●●●●●

●●●●●●●●

●●●●

●●●

●●

●●●

●●

●●●●●

●●●●

●●

●●●●●●

●●●●●

●●

●●●●●

●●●●●●

●●

●●●●●●●●●

●●●

●●●●●●●●

●●

●●●●

●●●●●●●●

●●●●●●●

●●

●●

●●

●●●

●●●●●

●●●●

●●

●●●●●

●●●●●●

●●●

●●

●●

●●●

●●

●●

●●●

●●●●●

●●●●●●●●●

●●●

0 100 200 300 400 500

02

46

81

0

Index

−lo

g1

0(a

dj.p.v

alu

es)

●●

●●

● ● ●●●

● ●●

●●

●●

●●●

●●

●●

● ●

●●

●●

●●●●

●●

●●

● ●●●

● ●●

●●

●●

●●

●●

●●

● ●

●● ●

●●●

●●

●●●

●●●

●●

● ●

●●

●●

●● ●

●●

●●

●●

●●

Signi�cance values for outlier tests (log scale, 500 loci). Red points

correspond to loci targeted by natural selection. Larger circles

correspond to loci in the candidate list.

Expected FDR = .15 � Observed FDR = .10

Page 23: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Summary

I Sparse non-negative matrix factorization/ANLS-AS is a fastand accurate method for estimating individual ancestrycoe�cients.

I Without loss of accuracy, sNMF computed estimates of ancestrycoe�cients within run-times approximately 10 to 100 timesfaster than those of the likelihood-based approach ADMIXTURE.

I Estimation of ancestral allele frequencies enables genome scanfor adaptive allele based on population di�erentiation statistics.

I Using R standard statistical tools can be applied to the controlof the FDR when detecting outlier loci.

Page 24: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Practice (40 minutes)

I Instructions, scripts and data sets are available from OlivierFrancois' webpage. Add∼/LEA/Tutorial-LEA.zipto the webpage URL.

I Read install instructions from the snmf.r script

I Install the LEA from R Bioconductor (Example running Linux)install.packages("LEA0.2.tar.gz",repos=NULL,

type="source")

Page 25: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Part 2

I Theory (40min): Ecological association tests.

I Estimates based on latent factor models

I Detection of genetic variation linked to ecological gradients

I Practice (40min): Analysis of data using R

Page 26: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Introduction

I Tests to identify associations between loci and environmentalor ecological gradients

I Context: ecological genomics and health applications

I Correction for population structure, demography and otherconfounding factors

I Applications to human genomics data sets

Page 27: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Human ecological genomics

I Objective: Evaluating the e�ects of interactions betweenhumans and their environments (climate, diet, pathogens) onhealth and well-being.

I Understanding the genetic origins of chronic diseases(diabetes, asthma, etc).

I Examples:I Excess of genes associated with autoimmune diseases such as

celiac disease, type 1 diabetes, and multiples sclerosis inresponse to pathogenic densities.

I EGLN1 and PPARA confer tolerance to hypoxia andadaptation to high altitude in tibetans (Simonsen et al. 2011).

Page 28: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Local adaptation

I Local adaptation through natural selection plays a central rolein shaping human genetic variation.

I A way to investigate signatures of local adaptation in genomesis to identify allele frequencies that exhibit high correlationwith environmental variables (Novembre and Di Rienzo 2009).

Page 29: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Signatures of local adaptation

A

G

Adaptive mutation Neutral mutation

Page 30: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Genome scans

I Loci with high correlations or (z-scores) are potentially underselection

Page 31: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Caveat

I In�ated number of false positives caused by populationstructure and isolation by distance patterns.

Page 32: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Model for testing associations between loci and ecological gradients

I We proposed to combine linear regression and factor models(Frichot et al. 2013).

I Latent Factor Mixed model (LFMM):

Yi` = µ` + BT` Xi + UT

i V` + εi` (1)

I B` is a d -dimensional vector of regression coe�cients.

Page 33: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Rationale

I The matrix UTV estimates the part of genetic variation thatcannot be explained by adaptation to the environment.

Yi` = µ` + BT` Xi + UT

i V` + εi`

I U = (Ui )i=1,...,K contains K unknown factors (U is ofdimension n × K .

I ε is the residual error from low-rank approximation (K ≤ n).

I The number of factors K can be chosen by evaluating thenumber of ancestral populations in ancestry estimationprograms.

Page 34: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Background literature on LFMMs

I They can be viewed as Structural Equation Models (Sanchezet al. JASA 2005)

I EM algorithms (Sammel and Ryan, Biometrics 1996; An et al.Stat. Med. 2013)

I Bayesian factor regression models (West, Bayesian Stat. 2003;Woodward et al. Biometrics 2014)

I Use of control gene lists (Listgarden et al. PNAS 2010)

I Population genetics (Frichot et al MBE 2013)

Page 35: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Model proposed

I Bayesian Hierarchical model with (weakly) informative priordistributions

I Prior distribution on regression coe�cients B ∼ N(0,Λ)

I Prior distribution on factor Ui ∼ N(0, σ2U IK ) and V` ∼ N(0, IK )

I Hyperprior distributions on Λ and σ2U are Inv-Gammadistributions (sparsity).

I Fixed number of latent factors K .

Page 36: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Estimation algorithm

I Stochastic algorithm: Gibbs sampler based on alternatingregression models.

I Simple Monte-Carlo estimate for the standard deviation ofregression coe�cients

I Computation of locus-speci�c z-scores and p-values.

Page 37: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Human Genome Diversity Project (SNP arrays)

I Worldwide sample of DNA from 1,043 individuals in 52populations

I The genotypes were generated on Illumina 650K arrays

I Climatic data for each of the 52 population samples from theWorldClim database at 30 arcsecond (1km2) resolution

I These data included 11 bioclimatic variables interpolated fromglobal weather station data collected during a 50 year period(1950-2000), and were summarized with their PC1

Page 38: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Results

I A total of 2,624 (0.4%) SNPs obtained z-scores > 5.

Page 39: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Results

I A total of 2,624 (0.4%) SNPs obtained z-scores > 5.

I A total of 508 (0.08%) SNPs obtained z-scores > 6.

Page 40: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Results

I A total of 2,624 (0.4%) SNPs obtained z-scores > 5.

I A total of 508 (0.08%) SNPs obtained z-scores > 6.

I A total of 65 (0.007%) SNPs obtained z-scores > 7.

Page 41: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Results

I Among loci with z-scores greater than 5, 28 were GWAS-SNPswith known disease or trait association.

I Among the 65 SNPs with z-scores greater than 7, 31 wereintra-genic SNPs.

Page 42: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

GWAS-SNPs associated with environmental predictors.

Gene Trait association − log10P-value

OCA2/HERC2 Eye and hair color, pigmentation 9.15DHCR7 Vitamin D levels 7.78SLC45A2 Hair color 6.90

Intergenic MUC7 Alcoholism 8.91ZMIZ1 Crohn's disease 8.77KLK3 Prostate Cancer 8.61ICOSLG Celiac disease 7.02HLA-DRA Systemic sclerosis 6.97

NCAPG-LCORL Height 9.43BOK Brain structure and development 9.43

Page 43: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Genic SNPs associated with environmental predictors.

Gene Annotation (dbSNPs) − log10p-value

EPHB4 Heart morphogenesis and angiogenesis 16.54NRG1 Nervous system development, cell proliferation 16.21RBM19 Regulation of embryonic development 15.98EYA2 Eye development and DNA repair 15.9POLA1 Mitotic cell cycle and cell proliferation 15.87

Page 44: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Summary

I Fast algorithms based on low rank approximations (ML andGibbs Sampler algorithms)

I Separate neutral from adaptive variation

I Many new adaptive SNPs with functions associated tomulticellular organ development

Page 45: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Using LFMM in practice

I Narrow the range of the number of factors, K , by investigatingplausible values using sNMF

I Multiple runs of LFMM increase the power to detectsigni�cant associations.project=lfmm("example.lfmm","gradient.env",K=8,

it=10000,b=5000,rep=5)

I Get the z-scores and combine them using the median valuezs=z.scores(project,K=8)

zs.combined=apply(zs,MARGIN=1,median)

Page 46: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Test calibration using in�ation factor correction

I For each locus `, p-values are calibrated using the followingformula

p` = prob(χ21 > z2` /λ) ` = 1, . . . , L.

I Suggested value for λ close to the genomic in�ation factor

λ = median(z2)/0.456

(λ ≈ 1 indicates that the histogram of p-values is �at).

I Visual inspection of histograms or QQ-plots may help as well.

Page 47: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Multiple testing issue and candidate list

I Candidate list of genes associated with ecological gradients areprovided by the Benjamini-Hochberg FDR control algorithm.

Page 48: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Practice (40 minutes)

I Instructions, scripts and data sets are available from OlivierFrancois' webpage. Add∼/LEA/Tutorial-LEA.zipto the webpage URL.

I Read instructions from the lfmm.r script

Page 49: Inference of population structure and local …membres-timc.imag.fr/.../LEA/files/SlidesTutorial.pdfInference of population structure and local adaptation using population genomic

Acknowledgments

I Eric Frichot

I Sean Schoville, François Mathieu, Théo Trouillon, GuillaumeBouchard

I This work received support from the �Région Rhône-Alpes�,Xerox Research, National Science Foundation USA, andEnsimag-Grenoble INP.