a location-mixture autoregressive model for online...

26
A Location-Mixture Autoregressive Model for Online Forecasting of Lung Tumor Motion Dan Cervone, Natesh Pillai, Debdeep Pati, Ross Berbeco, John H. Lewis Harvard Statistics Department and Harvard Medical School August 6, 2014 Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 1 / 17

Upload: others

Post on 10-Feb-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

A Location-Mixture Autoregressive Model for OnlineForecasting of Lung Tumor Motion

Dan Cervone, Natesh Pillai, Debdeep Pati, Ross Berbeco, John H.Lewis

Harvard Statistics Department and Harvard Medical School

August 6, 2014

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 1 / 17

Page 2: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Introduction

External beam radiotherapy:

Lung tumor patients are given an implant (fiducial) at the location oftheir tumor.

X-ray tomography can reveal location of the fiducial, thus the tumor.

Radiotherapy is applied to the tumor location in a narrow beam,minimizing exposure within healthy tissue.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 2 / 17

Page 3: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Introduction

Fiducial External beam radiotherapy

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 3 / 17

Page 4: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Introduction

Our problem:

Patient’s respiration means the tumor is in constant motion.

Tracking the fidual lags 0.1-2s behind, depending on specificmachinery used.

We need to forecast the location of the tumor to overcome thislatency and ensure concentrated, accurate radiothrapy.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 4 / 17

Page 5: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

The data

We consider data from 11 patients:

multiple days receiving treatment.multiple radiotherapy sessions (“beams”) per treatment.

−9.0

−8.5

−8.0

−7.5

−7.0

−6.5

dat$t − min(dat$t)

X p

ositi

on (

mm

)

Patient 11, day 6 beam 3

0

5

10

15

20

25

dat$t − min(dat$t)

Y p

ositi

on (

mm

)

−14.5

−14.0

−13.5

−13.0

−12.5

−12.0

Z p

ositi

on (

mm

)

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100time (s)

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 5 / 17

Page 6: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

The data

We consider data from 11 patients:

multiple days receiving treatment.multiple radiotherapy sessions (“beams”) per treatment.

−10

−5

0

5

10

15

20

dat$t − min(dat$t)

PC

1 p

ositi

on (

mm

)

Patient 11, day 6 beam 3

−1.0

−0.5

0.0

0.5

1.0

dat$t − min(dat$t)

PC

2 p

ositi

on (

mm

)

−0.4

−0.2

0.0

0.2

0.4

0.6

PC

3 p

ositi

on (

mm

)

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100time (s)

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 5 / 17

Page 7: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Semiperiodic features

0 10 20 30 40

−5

05

10

time (s)

Firs

t PC

pos

ition

(m

m)

Signals at different frequencies.

Fluctuations in location/amplitude/periodicity.

Repeated motifs.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 6 / 17

Page 8: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Location-mixture autoregressive process

Assume Yi , i = 0, . . . , n is a time series in R, and for some p,

Yn|Yn−1,Yn−2, . . . ∼dn∑j=1

αn,jN(µn,j , σ2)

µn,j = µn,j +

p∑`=1

γ`Yn−`

αn,j are mixture weights:∑dn

j=1 αn,j = 1.

Mixture means have autoregressive component∑p

`=1 γ`Yn−`.

Mixture means have location shift component µn,j .

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 7 / 17

Page 9: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Location-mixture autoregressive process

Assume Yi , i = 0, . . . , n is a time series in R, and for some p,

Yn|Yn−1,Yn−2, . . . ∼dn∑j=1

αn,jN(µn,j , σ2)

µn,j = µn,j +

p∑`=1

γ`Yn−`

αn,j are mixture weights:∑dn

j=1 αn,j = 1.

Mixture means have autoregressive component∑p

`=1 γ`Yn−`.

Mixture means have location shift component µn,j .

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 7 / 17

Page 10: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Location-mixture autoregressive process

Let Yi−1:p be the subseries (Yi−1 . . . Yi−p)′, and define mixturecomponents:

αi ,j =exp

[−1

2(Yi−1:p − Yi−j−1:p)′Σ−1(Yi−1:p − Yi−j−1:p)]∑

`≤i−p exp[−1

2(Yi−1:p − Yi−`−1:p)′Σ−1(Yi−1:p − Yi−`−1:p)]

µi ,j = Yi−j − γ ′Yi−j−1:p + γ ′Yi−1:p

Or, using latent variables,

Yi |Mi = j ,Yi−1, . . . ,Y0 ∼ N(Yi−j + γ ′(Yi−1:p − Yi−j−1:p), σ2)

P(Mi = `|Yi−1, . . . ,Y0) ∝

exp

[−1

2(Yi−1:p − Yi−`−1:p)′Σ−1(Yi−1:p − Yi−`−1:p)

]

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 8 / 17

Page 11: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Location-mixture autoregressive process

Latent variables Mi instantiate time series motifs: let

Ω =

(τ2 γ ′

γ Σ

),

then Mi = j implies

(Yi−0:p − Yi−j−0:p)′Ω−1(Yi−0:p − Yi−j−0:p) = small.

Assuming

Yi−0:p|Mi = j ,Yi−j−0:p ∼ N(Yi−j−0:p,Ω)

yields the LMAR predictive distribution for Yi |Mi = j ,Yi−1, . . .

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 9 / 17

Page 12: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Location-mixture autoregressive process

Latent variables Mi instantiate time series motifs: let

Ω =

(τ2 γ ′

γ Σ

),

then Mi = j implies

(Yi−0:p − Yi−j−0:p)′Ω−1(Yi−0:p − Yi−j−0:p) = small.

Assuming

Yi−0:p|Mi = j ,Yi−j−0:p ∼ N(Yi−j−0:p,Ω)

yields the LMAR predictive distribution for Yi |Mi = j ,Yi−1, . . .

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 9 / 17

Page 13: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Location-mixture autoregressive process

Latent variables Mi instantiate time series motifs: let

Ω =

(τ2 γ ′

γ Σ

),

then Mi = j implies

(Yi−0:p − Yi−j−0:p)′Ω−1(Yi−0:p − Yi−j−0:p) = small.

Assuming

Yi−0:p|Mi = j ,Yi−j−0:p ∼ N(Yi−j−0:p,Ω)

yields the LMAR predictive distribution for Yi |Mi = j ,Yi−1, . . .

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 9 / 17

Page 14: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Illustration of time series motifs

40 50 60 70 80 90 100

−10

05

1015

20

time (s)

Fis

t PC

pos

ition

(m

m)

Yi−1:p most recent p values of times series.

Yi−j−1:p previous instances of this motif.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 10 / 17

Page 15: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Illustration of time series motifs

99.6 99.8 100.0 100.2 100.4

05

1015

time (s)

Firs

t PC

pos

ition

(m

m)

Current trajectorySimilar past trajectoriesFuture point

Yi−1:p most recent p values of times series.

Yi−j−1:p previous instances of this motif.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 10 / 17

Page 16: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Parameter estimation

We focus on estimating Ω:

1-1 correspondence between Ω and LMAR parameters γ,Σ, σ2.

Latent variables Mi yield estimating equation:

h(Ω) =∑i≥p−1

2log(|Ω|)−

1

2

∑j≤i−p

1[Mi = j ](Yi−0:p − Yi−j−0:p)′Ω−1(Yi−0:p − Yi−j−0:p)

Ω = argmax(h(Ω)) computable quickly using EM algorithm.

Not loglikelihood, but Ω enjoys same large-sample properties as MLE.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 11 / 17

Page 17: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

k step ahead predictive distributions

Given a point estimate Ω, k-step ahead predictive distributions can beapproximated by marginalizing over future steps 1 to (k − 1):

Yi+k |Yi ,Yi−1, . . . ∼∑j

αkj N(µkj , σ

2k).

Mixture components αkj , µ

kj , σ

2k available cheaply, analytically.

No need for sampling or Monte Carlo.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 12 / 17

Page 18: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

k step ahead predictive distributions

Given a point estimate Ω, k-step ahead predictive distributions can beapproximated by marginalizing over future steps 1 to (k − 1):

Yi+k |Yi ,Yi−1, . . . ∼∑j

αkj N(µkj , σ

2k).

Mixture components αkj , µ

kj , σ

2k available cheaply, analytically.

No need for sampling or Monte Carlo.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 12 / 17

Page 19: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Predictive performance on clinical data

We tested predictive performance using the following procedure for allclinical observations in our data set.

1 Train model on first 35s of data2 Fit prediction model in under 5 seconds3 Simulate predictions for the next 40 seconds of data using model fit4 Compare predictions with observed values

Predictions must be made in real time (30Hz)!

Alternative prediction models considered:

Our model: LMAR

Neural networks

Penalized AR model

LICORS (Georg and Shalizi, 2012)

For all methods, any tuning parameters were set to patient-independentoptimal values, using separate data.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 13 / 17

Page 20: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Predictive performance on clinical data

We tested predictive performance using the following procedure for allclinical observations in our data set.

1 Train model on first 35s of data2 Fit prediction model in under 5 seconds3 Simulate predictions for the next 40 seconds of data using model fit4 Compare predictions with observed values

Predictions must be made in real time (30Hz)!

Alternative prediction models considered:

Our model: LMAR

Neural networks

Penalized AR model

LICORS (Georg and Shalizi, 2012)

For all methods, any tuning parameters were set to patient-independentoptimal values, using separate data.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 13 / 17

Page 21: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Predictive performance on clinical data

We tested predictive performance using the following procedure for allclinical observations in our data set.

1 Train model on first 35s of data2 Fit prediction model in under 5 seconds3 Simulate predictions for the next 40 seconds of data using model fit4 Compare predictions with observed values

Predictions must be made in real time (30Hz)!

Alternative prediction models considered:

Our model: LMAR

Neural networks

Penalized AR model

LICORS (Georg and Shalizi, 2012)

For all methods, any tuning parameters were set to patient-independentoptimal values, using separate data.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 13 / 17

Page 22: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Example 1 (k = 6)

LICORS

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40

−4

0

4

8

40 42 44 46 48 50 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80

−4

0

4

8

40 42 44 46 48 50 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80

−4

0

4

8

40 42 44 46 48 50 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80

−4

0

4

8

−4

0

4

8

TRAINING DATA

LMAR

NN

PENAL. AR

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 14 / 17

Page 23: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Example 2 (k = 6)

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40

−10

−5

0

5

10

−10

−5

0

5

10

40 42 44 46 48 50 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80

−10

−5

0

5

10

40 42 44 46 48 50 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80

−10

−5

0

5

10

40 42 44 46 48 50 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80

−10

−5

0

5

10LICORS

TRAINING DATA

LMAR

NN

PENAL. AR

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 15 / 17

Page 24: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Quantitative summaries (point prediction)

For all methods, predictive performance varies by patient, but someexamples:

0.2s forecast (k = 6) 0.4s forecast (k = 12) 0.6s forecast (k = 18)

Pat. Method RMSE MAE RMSE MAE RMSE MAE

9

LMAR 0.58 0.22 1.29 0.52 2.03 0.90NNs 0.73 0.32 1.69 0.64 2.45 0.92Ridge 0.81 0.34 1.68 0.73 2.42 0.98

LICORS 1.35 0.53 2.20 0.98 2.64 1.19

10

LMAR 0.88 0.36 1.73 0.77 2.55 1.19NNs 1.09 0.44 2.16 0.93 2.98 1.35Ridge 0.95 0.45 1.84 0.94 2.67 1.41

LICORS 1.62 0.61 2.20 1.10 3.25 1.56

11

LMAR 1.13 0.44 2.59 1.06 3.70 1.49NNs 1.24 0.50 2.95 1.19 3.99 1.70Ridge 1.19 0.63 2.69 1.51 3.99 2.40

LICORS 1.64 0.57 3.04 1.09 4.21 1.65

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 16 / 17

Page 25: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Quantitative summaries (interval prediction)

Summaries of interval/distributional predictions:

0.2s Forecast (k = 6) 0.4s Forecast (k = 12) 0.6s Forecast (k = 18)

Patient Method Coverage Log PS Coverage Log PS Coverage Log PS

9

LMAR 0.89 0.87 0.90 1.65 0.92 2.07NNs 0.86 1.02 0.78 2.20 0.80 2.77Ridge 0.81 1.54 0.81 2.21 0.81 2.64

LICORS 0.86 1.62 0.81 1.98 0.79 2.31

10

LMAR 0.86 1.18 0.88 1.94 0.91 2.33NNs 0.84 1.23 0.76 2.25 0.79 2.65Ridge 0.83 1.35 0.84 2.03 0.84 2.44

LICORS 0.86 1.61 0.82 2.02 0.81 2.31

11

LMAR 0.85 1.38 0.87 2.13 0.91 2.36NNs 0.87 1.50 0.80 2.70 0.83 2.91Ridge 0.86 1.63 0.85 2.44 0.85 2.84

LICORS 0.88 1.56 0.83 1.99 0.82 2.25

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 17 / 17

Page 26: A Location-Mixture Autoregressive Model for Online ...dcervone.com/slides/lung_tumor_prediction.pdfLung tumor patients are given an implant ( ducial) at the location of their tumor

Conclusions

Future improvements:

Hierarchical models; information sharing between patients.

Code optimizations (current pipeline in R/C++)

Further reading:

D. Cervone, N. Pillai, D. Pati, R. Berbeco, J. H. Lewis, “ALocation-Mixture Autoregressive Model for Online Forecasting ofLung Tumor Motion”. Annals of Applied Statistics (to appear).arXiv:1309.4144.

Dan Cervone (Harvard) Online forecasting of lung tumor motion August 6, 2014 18 / 17