some challenges to make current data- driven (‘statistical’) models even more relevant to public...

35
Some challenges to make current data-driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics, Penn State University

Upload: mariela-ellett

Post on 16-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Some challenges to make current data-driven (‘statistical’) models even more

relevant to public health

Ottar Bjornstad Center for Infectious Disease Dynamics,

Penn State University

Page 2: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Focus on time series analysis of incidence analysis

0

50k

100k

150k

200k

250k

300k

350k

400k

‘44 ‘55

Quarterly measles incidence 44-55

0-1y1-3y3-5y

5-10y10-15y

15- y

Page 3: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Outline:

~ 1993

then –> now

current challenges

mostly anecdotal personal reflections

Page 4: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

~ 1993

Page 5: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

~1993 maturing* mathematical formalism incorporating: (cf yesterdays talks by Mick and Val)

– Seasonal forcing

– Age-structure & non-homogenous mixing

– Spatial diffusion & metapopulation dynamics

– Plausible scenarios of scaling of transmission with pop size

– Stochasticity

But, early days w.r.t. letting these general models loose on data …

* for directly transmitted persistent (SI), fully immunizing (SIR) or fully non-immunizing (SIS)

Page 6: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

~1993 Early days w.r.t. letting these general models loose on data …

… because of many challenges

The obvious:

–Absence of data on key state variables (eg susceptible)

–Disparity between key state variables and observed quantities (eg incidence is not prevalence)

The less obvious:

–Weariness regarding whether a detailed quantitative match to data should be a critical characteristic of mathematical models

Page 7: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

then –> now

Page 8: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Today’s expectation for match (1)

eg TSIR forecast for E&W measles (E&W)

IncidenceTSIR forecast

Page 9: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

),(~1

1

1

ttt

tttt

tttt

INegBinI

IS

IBSS

S - Susceptibles, B - Births , I - Infected and Infective, - epidemic intensity, - correction for time discretization, β - seasonal transmission rate

Host dynamics

Transmission dynamics

Stochasticity

TSIR: Discrete time ‘piecewise constant’ B-D process (cf Chain-binomial)If l is small relative to S, then the chain can be approximated by an unconstrained B-D process; The conditional distribution of It+1 is the sum of It Geometric distributions -> NegBin with clumping It

Page 10: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Today’s expectation for match (2)

eg age-structured TSIR forecast for rubella (South Africa)

Metcalf et al 2013

* difference in heat intensity is due to underreporting

*

Page 11: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Yesteryear’s expectations

Kot and Schaffer (1985) JTB:

“One way of resolving the problem is to view the motion in phase space,i.e. in a vector space whose axes are the state independent variables.However, for most real world ecological and epidemiological systems, thisrequirement is not easily met. It is often difficult even to enumerate all ofthe state variables, much less to follow their magnitudes over time. Put another way, the variables studied in nature are generally embedded in more complex systems. As a practical matter, it is unlikely that population dynamicists will ever be able to write down the complete governing equations for any natural system.”

ID complexities: age-structured mixing, age-specific seasonality in transmission, spatial heterogeneity, heterogeneities in susceptibility, etc, etc

Page 12: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

Page 13: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

1) Perhaps models may have some qualitative relevance?- Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

Page 14: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

1) Perhaps models may have some qualitative relevance?- Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c)

Page 15: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

1) Perhaps models may have some qualitative relevance?- Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c)

3) A seasonal chain-binomial model can in fact be recast as a non-autonomous autoregressive regression: I dear you! - Time-series SIR ver 1 (Finkenstadt & Grenfell) and TSIR ver 2

Page 16: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

1) Perhaps models may have some qualitative relevance?- Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c)

3) A seasonal chain-binomial model can in fact be recast as a non-autonomous autoregressive regression: I dear you! - Time-series SIR ver 1 (Finkenstadt & Grenfell) and TSIR ver 2

4) Why in the world does the TSIR seem to fit measles in E&W? - ‘Emergent simplicity’ (Grenfell); Dynamic homogeneity (Earn &c)

Page 17: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Journey from there to here (2)

5) We believe! Real dynamics can be predicted by simple mechanistic models (that incorporates key idiosyncrasies) - POMP et al (King &c)

- Hierarchical models with observation process (Cauchemez &c). - Age-structured TSIR (Metcalf &c).….. (cf Simon’s talk)

Page 18: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Journey from there to here (2)

5) We believe! Real dynamics can be predicted by simple mechanistic models (that incorporates key idiosyncrasies) - POMP et al (King &c)

- Hierarchical models with observation process (Cauchemez &c). - Age-structured TSIR (Metcalf &c).….. (cf Simon’s talk)

Lessons from last 20 years: - ‘All models are wrong …’ Some much less than we expected.- Emergent simplicity once key idiosyncrasies are identified - ?Tactical/strategical? models may be more relevant than we expected.

[The prevailing notion that computation was the important driver in the field is wrong (Cambridge MRCs BUGS has been around since 20 years)]

Page 19: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Some current challenges

Page 20: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Some critical issues are:

(i) use nonlinear stochastic modeling to identify all potentially undesirable side effects of intervention-induced reduction in circulation.

-Rubella and CRS (cf Jess’ talk)

-Chikenpox vaccine and increased shingles incidence

-Whooping cough and the role of natural antigen circulation in maintaining immune memory. The possibility of long-term vaccine failure.

More case law!

Page 21: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

1920 1940 1960 1980 2000

Pertussis in Massachusetts

Years

Re

port

ed

pe

rtus

sis

case

s (/

yr.)

10

10

01

00

010

00

0

Cas

es (

/mo.

)

1990 1995 2000 2005

200

400 Recent cases

Vaccine introduced

The first decades of vaccine induced control was extremely successful …

Mass-vaccination introduced in most rich countries in mid ‘40s - early ’50s

Page 22: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

1920 1940 1960 1980 2000

Pertussis in Massachusetts

Years

Re

port

ed

pe

rtus

sis

case

s (/

yr.)

10

10

01

00

010

00

0

Cas

es (

/mo.

)

1990 1995 2000 2005

200

400 Recent cases

Vaccine introduced

The first decades of vaccine induced control was extremely successful …… Then even in very high cover areas throughout the developed world (e.g. Massachusetts with consistent >95% cover) the disease re-emerged!

Mass-vaccination introduced in most rich countries in mid ‘40s - early ’50s

Page 23: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Re-emergence is associated with a completely different core group

age class

dens

ity0.

000

.05

0.1

00

.15

0-1 5-6 10-15 15-60

pre-vaccine era

age class

dens

ity0.

000

.05

0.1

00

.15

0-1 5-6 12-13 20-21 30-31 40-41 50-51

vaccine era

Massachusets age-incidence patterns

Lavine, King and Bjornstad. 2011. PNAS

Page 24: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

The ‘anamnestic’ 4 compartment SIR model – re-exposure helps maintain immune memory

S – suceptible

I – Infected

R – Highly immune

W – Waning: resistant to infection and will get boosted or loose immunity depending on competing rates

- force of infection

- boosting coeffiecient

- recovery rate

- rate of loss of immunity

- rate of loss of immunity

Lavine, King and Bjornstad. 2011. PNAS

Page 25: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

As long as the anamnestic response is at least 10x greater than the naïve response:

Pre-vaccination prediction

Age

Page 26: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Post-vaccination prediction:

Lavine, King and Bjornstad. 2011. PNAS

Page 27: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

0.0 0.2 0.4 0.6 0.8 1.0

Vaccine coverage

Pro

po

rtio

n in

fect

ed Total incidence

0.0

008

0.0

011

0.0

014

? 0.5? 20? 5000

Natural immune boosting in pertussis dynamics and the potential for long-term vaccine failure

Inci

denc

e

Vaccine coverage

‘SIR’

‘SIS’

Page 28: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Pro

po

rtio

n ca

ses

0.0

00

.01

0.0

20.

03

0.0

4

Eff ect of booster v accine at a g e 15

1 3 5 7 9 11 13 15 17 19 21 23

Boost at 15No boost

Predicted public health consequences of a booster vaccine at age 15 …

… The booster may push circulation towards adults of childbearing age and increase perinatal infection and increase severe disease.

(cf CRS but different mechanism)

Page 29: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

Some critical issues are:

(ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules

Page 30: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

(ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules

e.g. Measles Incidence in China: 3 provinces

From Matt Ferrari

Page 31: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

(ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules; *

* Perreti et al PNAS 2013 110:5253-5257 (‘Model-free forecasting outperforms the correct mechanistic model for simulated and experimental data’) is not the way to go

Page 32: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

(iii) probabilistically projecting possible/probable build-up of ‘susceptible pockets’ in the face of imperfect vaccination programs

Page 33: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

(iii) probabilistically projecting possible/probable build-up of ‘susceptible pockets’ in the face of imperfect vaccination programs

Burkina Faso: 2009

Malawi: 2010

•>135,000 cases following 10 years of low incidence

France, 2011

0

200

400

600

800

1000

1200

1986

Jul-

86

1987

Jul-

87

1988

Jul-

88

1989

Jul-

89

1990

Jul-

90

1991

Jul-

91

1992

Jul-

92

1993

Jul-

93

1994

Jul-

94

1995

Jul-

95

1996

Jul-

96

1997

Jul-

97

1998

Jul-

98

MES / AÑO

CA

SOS

1986

-96

0

2000

4000

6000

8000

10000

12000

CA

SOS

1997

-98

CAMPAÑA MAYO 87SARAMPIÓN 9M - 14 A

CAMPAÑA JUNIO 92M-M-R 1 - 10 A

Sao Paolo: 1997

Eg Measles (from Matt Ferrari)

Page 34: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

(iv) Important challenge:

We need accurate parametric anchoring of mechanistic models

-7000

-6000

-5000

-4000

-3000

-2000

-1000

2 e-04 4 e-04 6 e-04 8 e-04 1 e-03

2 e-04

4 e-04

6 e-04

8 e-04

1 e-03

intra.beta

inte

r.b

eta

Inference for mechanistic models usually reveal severe multicollinearity among parameters:-Various parameter combinations can fit observed data equally well,-but will not make the same out of sample predictions

Log-likelihood

Intrastage β

Inte

rsta

ge β

Eg 2-stage PDV model (cf Klepac et al. 2009)

Page 35: Some challenges to make current data- driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics,

- ‘All models are wrong …’ Some much less than we expected.- Emergent simplicity once key idiosyncrasies are identified. -?Tactical/strategical? models may be more relevant than we expected.

- We have an enormous arsenal of model fitting tools.- Multicollinearity makes anchoring of estimates a critical challenge.

Current modeling challenges:- Study unanticipated Public health consequences - Consequences of rapidly changing demographics- Understand build-up of susceptible pockets

Thank you!