modeling fmri data with uncertain hemodynamic response or stimulus functions martin lindquist...

46
Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Upload: gwen-oneal

Post on 20-Jan-2016

225 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Modeling fMRI data with uncertain hemodynamic response or

stimulus functions

Martin LindquistDepartment of StatisticsColumbia University

Page 2: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Functional MRI

• Functional MRI (fMRI) performed using BOLD contrast can detect changes in blood oxygenation and flow that occur in response to neural activity.

• A primary goal of fMRI research is to use information provided by the BOLD signal to make conclusions about the underlying neuronal activity.

Page 3: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Overview

Stimulus Neuronal Activity

Hemodynamics

Given data and stimulus function, estimate the hemodynamic response function (HRF).

Given data only, estimate activity.

Part I:

Part II:

Page 4: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Modeling the Hemodynamics

• A number of methods exist for modeling the relationship between stimulus and BOLD response.

– Linear time invariant (LTI) system− BOLD response to events add linearly− Relatively simple to use

– Non-linear models (e.g. Balloon model) − Consists of a set of ODEs− More complicated/time-consuming than linear models

• Both types provide a means for estimating the HRF.

Page 5: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Estimating the HRF

• The ability to accurately model the evoked hemodynamic response to a neural event plays an important role in the analysis of fMRI data.

• When analyzing the shape of the estimated HRF, summary measures (e.g., amplitude, delay, and duration) can be extracted.

• They can be used to infer information regarding the intensity, onset latency, and duration of the underlying activity.

Page 6: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Summary Measures

Estimate amplitude (H), time-to-peak (T), and full-width at half-max (W).

Ideally, these parameters should be directly interpretable in terms of changes in neuronal activity, and estimated so that statistical power is maximized.

Page 7: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Interpretability

Solid – expected relationships

Dashed – relationships that complicate interpretation.

Q1. Do changes in parameters related to neural activity directly translate into changes in corresponding parameters of the HRF?

Q2. Does the HRF model recover the true parameters of the response?

Page 8: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

• BOLD physiology limits the interpretability of parameters in terms of neuronal and metabolic function.

• We treat the evoked BOLD response as the signal of interest, without making a direct quantitative link to neuronal activity.

• Here we focus on the ability of different models to recover differences in the height, time-to-peak, and width of the true BOLD response.

• Which model is most efficient while giving rise to the least amount of bias and misspecification?

Page 9: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

LTI System

• The dominant analysis strategy is to assume that BOLD responses to events add linearly (Boynton et al.1996) and use a set of smooth functions to model the underlying HRF.

• We model the relationship between stimuli and BOLD response using a linear time invariant (LTI) system.

• The stimulus acts as the input and the HRF acts as the impulse response function.

Page 10: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Convolution Examples

Hemodynamic Response Function

Predicted Response

Block Design

Experimental Stimulus Function

Event-Related

* *

= =

Page 11: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

General Linear Model

• The General linear model (GLM) approach treats the data as a linear combination of model functions (predictors) plus noise (error).

• The model functions are assumed to have known shapes, but their amplitudes are unknown and need to be estimated.

• The GLM framework encompasses many of the commonly used techniques in fMRI data analysis (and data analysis more generally).

Page 12: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

npnpp

p

p

n XX

XX

XX

Y

Y

Y

2

1

1

0

2

221

111

2

1

1

1

1

We can write the GLM model as

εXβY where

fMRI Data Design matrix

Model parameters

Noise

Matrix Formulation

Page 13: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Assume the model:

εXβY where

2)( Vε Var

If V is known the optimal solution for is:

YVXXVX 111ˆ TT

GLM - Solution

Inference is performed using linear combinations of .^

Page 14: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Basis Functions

• A linear combination of several basis functions can be used to account for possible delays and dispersions in the HRF.

• The stimulus function is convolved with each of the basis functions to give a set of regressors.

• The parameter estimates give the coefficients that determine the combination of basis functions that best model the HRF for the trial type and voxel in question.

Page 15: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

• Canonical HRF + derivatives

• Finite impulse response functions

• Many more…..

Examples:

Basis Functions

Page 16: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Basis Functions

Time (s)

ModelModelImage ofImage ofpredictorspredictors Data & FittedData & Fitted

Single HRFSingle HRF

HRF + HRF + derivativesderivatives

Finite Finite Impulse Impulse Response Response (FIR)(FIR)

Page 17: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Smooth FIR

• The FIR solution tends to be very noisy.

• To constrain the fit to be smoother (but otherwise of arbitrary shape), a Gaussian prior can be placed on the filter parameters .

• The maximum a posteriori estimate of gives a smoothed version of the filter.

Red – FIRBlue – Smooth FIR

Page 18: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Non-linear Models

• Alternatively, one can use non-linear models with free parameters for magnitude and onset/peak delay.

• Common criticisms of such approaches are their computational costs and potential convergence problems.

• However, increases in computational power make nonlinear estimation over the whole brain feasible.

Page 19: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Inverse Logit Model

• Superposition of three inverse logit (sigmoid) functions.

• Each function has three variable parameters dictating the amplitude, position and slope.

Lindquist & Wager (2007)

333

222

111

)(

)(

)()|(

DTtL

DTtL

DTtLth

1)1()( xexL

Page 20: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Flexibility

By shifting the position of the second IL function one can model differences in duration.

By shifting the position of all three IL functions one can model differences in onset.

Page 21: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Model fitting

• Model fitting is performed using either simulated annealing or gradient descent.

• We typically constrain the solution so that the fitted response begins and ends at 0, which leads to a model with 7 variable parameters.

• Alternatively, we use a 4 parameter model where only the position of each function and the total amplitude is allowed to vary.

Page 22: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Simulation Study

• We compare the different models ability to handle shifts in onset and duration. The models we studied were:

– The canonical HRF– The canonical HRF + 1st derivative– The canonical HRF + 1st & 2nd derivative– The FIR model– The Smooth FIR model– Non-linear Gamma– Inverse Logit Model

Lindquist & Wager (2007)Lindquist, Loh, Atlas & Wager (2008)

Non-linear

GLM based

Page 23: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Estimation

• After fitting each model we estimate H, T and W using closed form solutions (when available) or the fitted HRF.

• For models that include the canonical HRF and its derivatives it is common to only use the non-derivative term as an estimate of amplitude.

• However, this will be biased and instead we use “derivative boost” (Calhoun et al., 2004).

Page 24: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Stimulus function

Assumed in analysis of simulation data

Assumed (black) with delayed “true” (gray)

“True” response of extended duration

1

3

5

7

9

1 2 3 4 5

Onset shift

Duration

A B

C

Simulation

Page 25: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Simulation

• Datasets generated for 15 “subjects”, consisting of the “true” BOLD time series plus white noise, with plausible effect size (noise std equal to 1, Cohen’s d = 0.5).

• Estimates of amplitude (H), time-to-peak (T) and width (W) were obtained for each model. The average values across the 15 subjects were compared with the true values to assess model dependent bias in the estimates.

• In addition, for each subject and voxel the residuals were checked for model misspecification.

Page 26: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Detecting Mis-modeling

• Let r(i) be the whitened residuals and K(i) a Gaussian kernel.

• When no mis-modeling is present

is normal with mean 0 for all w, t.

• Calculate using random field theory.

1w+t

t=iw i)r(i)K(t=(t)Y

))(max( tYP wt

Loh, Lindquist & Wager (2008)

Page 27: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

T

W

Mis

-mo

de

ling

GAM TD DD FIR sFIR NL IL

H

1

3

5

7

9

1 2 3 4 5

Onset shift

Duration No

Significant Bias

Negative Bias

Positive Bias

Page 28: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Inference

• We perform population inference using the estimated amplitude for each subject and one of the following methods:

- Summary Statistics Approach- Assumes normality

- Bootstrap- Non-parametric- Use the bias-corrected accelerated (BCa) version

- Sign-permutation test- Non-parametric

Page 29: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Results

Page 30: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Conclusions

• The canonical HRF based models (GAM, TD & DD) are highly susceptible to model misspecification.

• The FIR models (FIR & sFIR) and the IL model provide the most flexibility to handle differences in onset and duration.

• The IL model performs best in terms of bias and model misspecification, but is computationally more demanding than the FIR models.

Page 31: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Overview

Stimulus Neuronal Activity

Hemodynamics

Given data and stimulus function, estimate the hemodynamic response function (HRF).

Given data only, estimate activity.

Part I:

Part II:

Page 32: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Overview

Stimulus Neuronal Activity

Hemodynamics

Given data and stimulus function, estimate the hemodynamic response function (HRF).

Given data only, estimate activity.

Part I:

Part II:

?

Page 33: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Unknown Stimulus Functions

• Most statistical analysis of fMRI data assume that the timing and duration of the psychological processes are known.

• However, in many studies, it is hard to specify this information a priori (e.g., threat/emotional experience and drug uptake).

• In these situations using a standard GLM-based analysis is not appropriate and alternatives need to be explored.

Page 34: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Change Point Analysis

• Our approach uses change point analysis to detect changes in brain activity without prior knowledge of the exact onset or duration.

• We can make population inferences on whether, when, and for how long an fMRI time series deviates from a baseline level.

• We can then characterize brain responses in terms of their relationship to physiological changes (e.g. reported stress).

Page 35: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Change Point Analysis

• We propose a three step procedure:

1. Use HEWMA (Hierarchical EWMA), to determine voxels with time courses that deviate from baseline in the population (Lindquist, Waugh, & Wager 2007).

2. Estimate voxel-specific distributions of onset times and durations from the fMRI response.

3. Perform spatial clustering of voxels according to onset and duration characteristics, and anatomical location using a hidden Markov random field model.

Page 36: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Two states: Active/Inactive

1)1( ttt zxz nt ,1

Calculate smoothed EWMA statistic across time (t) for each subject:

m

i

iim

i

ipop ZWWZ

1

1

1

1** biiW

Use the weighted average of each subjects EWMA statistic to get group results:

Monte Carlo simulation provides correction for multiple comparisons

HEWMA

Search across time for deviations from baseline (inactive) state.

Page 37: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Estimating Change Points

• Each subject’s time series is a sequence of normally distributed observations that may at some unknown time i undergo a shift in mean.

• This in turn may be followed by a return to baseline at i + i where i is also unknown.

• Both i and i are random variables drawn from unknown population distributions: g(t) = P(i=t) and g(t) = P(i=t), respectively.

Page 38: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

• The distributions are estimated assuming no functional form, and allowing for the possibility of no response.

• We assume contiguous observations come from the same component except at i and i + i.

),( 211 N

Active:

Inactive:

),( 222 N

i i+ i

i ~ g

i ~ g

Estimate: 1=(1, 1), 2=(2, 2), g, g.

Page 39: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

By treating i and i as missing data, we can employ the EM algorithm to calculate the MLE of g(t) and g(t).

Conditional likelihood (i and i known):

Baseline state Active state

Joint likelihood (i and i unknown):

Baseline state

OnsetDistribution

DurationDistribution

Page 40: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

BA

C

D

Using the estimated distributions we can calculate the probability of activation as a function of time:

t

j

jtPjPtP1

)()() at time activation(

Simulated data

Page 41: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Spatial Clustering

• A Hidden Markov Random Field model is used to cluster voxels based on onset and duration characteristics.

• While the data Y from each voxel is observed, the cluster labels X are unobserved.

• Conditional on a neighborhood of voxels, a voxel’s cluster membership is independent of all non-neighbors:

• We use the ICM algorithm to approximate the maximum a posteriori estimate of X:

Page 42: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Experiment

• 24 participants were scanned in a 3T GE magnet.

• Participants were informed that they were to be given 2 min to prepare a 7 min speech, the topic of which would be revealed to them during scanning. After the scan, the speech would be delivered to a panel of expert judges.

• During a run, 215 images were acquired (TR = 2s).

Stress and increased heart rate were reported

throughout the entire preparation interval.

Page 43: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

ResultsTransientSustained

Onset of speech task

90-160 s

45-90 s

Lindquist, Waugh, & Wager 2007Lindquist & Wager, in press

Page 44: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

1

2

3

4

5

Visual cue | Speech preparation

12345

HR

1. Visual cortex2. Superior temporal sulci3. Ventral striatum4. Superior temporal sulci5. Ventromedial PFC

MPFC only area with sustained activation throughout speech preparation.

Page 45: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Summary

• In many experiments the exact form of the stimulus and/or HRF are not known a priori.

• There exist a number of linear and non-linear techniques for estimating the HRF, but there are substantial differences in terms of power, bias and parameter confusibility.

• Using change point methods we can make inference about activation with unknown onset and duration.

Page 46: Modeling fMRI data with uncertain hemodynamic response or stimulus functions Martin Lindquist Department of Statistics Columbia University

Comments

• Collaborators:

– Lauren Atlas– Ji-Meng Loh– Lucy Robinson– Tor Wager

• Matlab implementation of HEWMA freely available at:

– http://www.columbia.edu/cu/psychology/tor/