nonparametric bayesian modelling of individual co ... · nonparametric bayesian modelling of...

22
Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails Am´ elie Cr´ epet and Jessica Tressou ANSES, French agency for food, environmental and occupational health safety INRA UR-1204, M´ et@risk, Paris, France Journeys MAS, Bordeaux, 31 August - 3 September, 2010

Upload: truongquynh

Post on 21-Jun-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Nonparametric bayesian modelling of individualco-exposures to various pesticides to determine

cocktails

Amelie Crepet and Jessica Tressou

ANSES, French agency for food, environmental and occupational health safety

INRA UR-1204, Met@risk, Paris, France

Journeys MAS, Bordeaux, 31 August - 3 September, 2010

Page 2: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Pericles project (ANR 2009-2011)

For statisticiansWhat are the cocktails of pesticides simultaneously present in thediet ?

Co-exposure assessment

Co-exposure clustering

For toxicologistsWhat are the possible combined effects of multiple residues ofpesticides ?

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 2

/ 22

Page 3: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Data

Contamination levels : (DGCCRF, DGAL, SISE-EAUX)

p = 1, . . . , P = 79 pesticidesa = 1, . . . , 121 commoditiesH : 0 if the value < LOD

Consumption : the National French survey (INCA 2, 2006)

consumed quantities cia of n = 3, 337 individualsn = 1, 439 children (3-17 years) and n = 1, 898 adults7 days detailed

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 3

/ 22

Page 4: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Empirical exposure distribution ximp

ximp = log10(Ap

∑a=1

cia × qpam/wi)

M=100 values computed byrandomly sampling for each individual i and pesticide p

consumptions of food a observed on a day : ciapesticide residue levels qpam of the different commodities Ap

dividing by the associated individual body weight wi

normalized the logarithm of exposures

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 4

/ 22

Page 5: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Exposure Structure

Co-exposures, ximp, i :individuals ; m :residues levels ; p :pesticides

M = 1, 95th percentile of exposure to each pesticide PM = 100, account for contamination variability

Pesticide 1 Pesticide p Pesticide P

Individual 1

Individual i

Individual n

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 5

/ 22

Page 6: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Infinite Mixture ModelsNon-parametric Bayesian Models

A way of getting very flexible modelsNo assumption on the number of mixture componentsNo prior parametric assumptionInfer an adequate model (size/complexity) without doingBayesian model comparison (AIC, DIC...)Derived from finite parametric model with number of parametersgoing to infinity

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 6

/ 22

Page 7: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Mixture Models

Let x = (x1, . . . , xn), with xi a P dimensional vector xi = (xi1, . . . , xiP)distributed with a density of probability :

f (x) =∫

Θk(x|θ)G(dθ)

wherek(.|θ) the known density of the mixture components,θ ∈ Θ is a latent variable,G the unknown mixture distribution, infinite dim. parameter

How to place an appropriate prior on G ?

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 7

/ 22

Page 8: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Dirichlet Process - DP

Define a prior on a unknown probability distribution

G|γ, H ∼ D(dG|γ, H) is a Dirichlet process with parameters

a real γ > 0a probability measure H

if and only if for any partition B1, . . . , BK of Ω,

(G(B1), . . . , G(BK)) ∼ Dir(γH(B1), . . . , γH(BK))

E[G(B)] = H(B) V[G(B)] = H(B)(1−H(B))/(1 + γ)

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 8

/ 22

Page 9: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Dirichlet Process Mixture Models - DPM

G ∼ DP(γ, H)

θi|G ∼ G

xi|θi ∼ k(.|θi)

G

xi

H

γ

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 9

/ 22

Page 10: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Stick-Breaking Representation - SB

Sample G ∼ DP(γ, H)⇔ G =∞∑

k=1πkδφk

φk ∼ H

Infinite mixing proportions,∞∑

k=1πk = 1 : πk = βk

k−1∏l=1

(1− βl)

Infinite sequence of Beta r.v. : βk ∼ Beta(1, γ)

Good quality of approximation for reasonable K < ∞

G =K∑

k=1πkδφk (Ishwaran and James 2001)

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 10

/ 22

Page 11: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Stick-Breaking Representation - SB

G

xi

H

γ

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 11

/ 22

Page 12: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Modeling co-exposure to pesticides M = 1

x = (x1, . . . , xn), with xi a P dimensional vector xi = (xi1, . . . , xiP)

xi|θi ∼ k(.|θi) (1)θi|G ∼ G (2)

G ∼ DP(γ, H) (3)

kernel k : a MultivariateNormal N (µ, τ), φ = (µ, τ)base probability measure H : the conjugate Normal-Wishart

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 12

/ 22

Page 13: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Model for cocktails of pesticides M = 100

One level of hierarchy

xim = (xpim, p = 1, . . . , P) with i = 1, . . . , n and m = 1, . . . , M

xim|θim ∼ k(.|θim) (4)θim|Gi ∼ Gi (5)

Gi ∼ DP(αi, G0) (6)G0 ∼ DP(γ, H) (7)

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 13

/ 22

Page 14: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Algorithm and Random Block

Gibbs sampler based on the Stick-Breaking priors

Random Block to reduce the heavy computational burdenat each Gibbs cycle select d < P pesticidesa subset of random observations xi = (xil1 , . . . , xild) is used insteadof xi = (xi1, . . . , xiP)

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 14

/ 22

Page 15: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Validation data

Comparison between Stick-breaking (SB) and “Random-BlockStick-Breaking” (RB-SB) algorithms (N = 30 atomes, 30000 iterations)

0 200 400 600 800 1000

−11

000

−90

00−

8000

−70

00−

6000

−50

00−

4000

Iterations

Q−

crite

rion

SBSB−RB

Q-criterion of the first 1, 000 iterationsperformed with the SB and the SB-RBalgorithms

1 2 3 4 5

−10

−5

05

10

Optimal partion obtained with the SBalgorithm : mixture of 3 gaussiandistributions

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 15

/ 22

Page 16: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Children co-exposure to pesticides

Three main clusters of children with similar co-exposuresBox-plot of the cluster 1 : 699 children

−2 −1 0 1 2 3

AcrinathrinAzinphos−methyl

AzoxystrobinBenalaxylBifenthrinBitertanol

BromopropylateBupirimate

CaptanCarbendazim

CarbofuranChlordane

ChlorfenvinphosChlorothalonilChlorpropham

ChlorpyrifosChlorpyrifos−methylChlorthal−dimethyl

CypermethrinCyprodinil

DDTDeltamethrin

DiazinonDichlofluanid

DichlorvosDicofol

DieldrinDifenoconazole

DimethoateDiphenylamine

DithiocarbamateEndosulfan

EsfenvalerateEthion

FenhexamidFenitrothionFludioxonil

FolpetHeptachlor

HexachlorobenzeneHexaconazole

ImazalilImidacloprid

IprodioneKresoxim−methyl

Lambda−CyhalothrinLindaneLinuron

MalathionMaleic hydrazide

Metalaxyl−MMethamidophos

MethidathionMethiocarb

MethomylMyclobutanil

OmethoateOxadixyl

PenconazolePhosalone

PhosmetPirimicarb

Pirimiphos−methylProchloraz

ProcymidonePropargite

PropyzamidePyrimethanilPyriproxyfenQuinoxyfen

SulphurTebuconazoleTebufenozideTebufenpyrad

ThiabendazoleThiophanate−methyl

TolylfluanidTriadimenolVinclozolin

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 16

/ 22

Page 17: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Box-plot of the cluster 2 : 238 children−2 −1 0 1 2 3

AcrinathrinAzinphos−methyl

AzoxystrobinBenalaxylBifenthrinBitertanol

BromopropylateBupirimate

CaptanCarbendazim

CarbofuranChlordane

ChlorfenvinphosChlorothalonilChlorpropham

ChlorpyrifosChlorpyrifos−methylChlorthal−dimethyl

CypermethrinCyprodinil

DDTDeltamethrin

DiazinonDichlofluanid

DichlorvosDicofol

DieldrinDifenoconazole

DimethoateDiphenylamine

DithiocarbamateEndosulfan

EsfenvalerateEthion

FenhexamidFenitrothionFludioxonil

FolpetHeptachlor

HexachlorobenzeneHexaconazole

ImazalilImidacloprid

IprodioneKresoxim−methyl

Lambda−CyhalothrinLindaneLinuron

MalathionMaleic hydrazide

Metalaxyl−MMethamidophos

MethidathionMethiocarb

MethomylMyclobutanil

OmethoateOxadixyl

PenconazolePhosalone

PhosmetPirimicarb

Pirimiphos−methylProchloraz

ProcymidonePropargite

PropyzamidePyrimethanilPyriproxyfenQuinoxyfen

SulphurTebuconazoleTebufenozideTebufenpyrad

ThiabendazoleThiophanate−methyl

TolylfluanidTriadimenolVinclozolin

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 17

/ 22

Page 18: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Box-plot of the cluster 3 : 239 children−2 −1 0 1 2 3

AcrinathrinAzinphos−methyl

AzoxystrobinBenalaxylBifenthrinBitertanol

BromopropylateBupirimate

CaptanCarbendazim

CarbofuranChlordane

ChlorfenvinphosChlorothalonilChlorpropham

ChlorpyrifosChlorpyrifos−methylChlorthal−dimethyl

CypermethrinCyprodinil

DDTDeltamethrin

DiazinonDichlofluanid

DichlorvosDicofol

DieldrinDifenoconazole

DimethoateDiphenylamine

DithiocarbamateEndosulfan

EsfenvalerateEthion

FenhexamidFenitrothionFludioxonil

FolpetHeptachlor

HexachlorobenzeneHexaconazole

ImazalilImidacloprid

IprodioneKresoxim−methyl

Lambda−CyhalothrinLindaneLinuron

MalathionMaleic hydrazide

Metalaxyl−MMethamidophos

MethidathionMethiocarb

MethomylMyclobutanil

OmethoateOxadixyl

PenconazolePhosalone

PhosmetPirimicarb

Pirimiphos−methylProchloraz

ProcymidonePropargite

PropyzamidePyrimethanilPyriproxyfenQuinoxyfen

SulphurTebuconazoleTebufenozideTebufenpyrad

ThiabendazoleThiophanate−methyl

TolylfluanidTriadimenolVinclozolin

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 18

/ 22

Page 19: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Exposures correlations of cluster 1 : 699 children

Cocktail 7

Cocktail 4

Cocktail 5

Cocktail 1

Cocktail 6

Cocktail 2

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 19

/ 22

Page 20: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Exposures correlations of cluster 2 : 238 children

Cocktail 6

Cocktail 2

Cocktail 4

Cocktail 3

Cocktail 1

Cocktail 7

Cocktail 5

Cocktail 3incomplete

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 20

/ 22

Page 21: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Extensions

Prior distribution on γ : Gamma(aγ, bγ)Use Poisson-Dirichlet process instead of DP : slow convergence

Pareto kernel for kDP for censored data

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 21

/ 22

Page 22: Nonparametric bayesian modelling of individual co ... · Nonparametric bayesian modelling of individual co-exposures to various pesticides to determine cocktails ... 2006) consumed

Thanks

F. Heraud, JC Leblanc and JL Volatier, AFSSA

ORP (Pesticide Residues Observatory) for financial supportANR (National Research Agency) for financial support

Thank you for your attention

Amelie Crepet (ANSES) DP for co-exposure clusteringJourneys MAS, Bordeaux, 31 August - 3 September, 2010 22

/ 22