inference in dynamic environments mark steyvers scott brown uc irvine this work is supported by a...

36
Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research (AFOSR grant number FA9550-04-1-0317)

Post on 22-Dec-2015

216 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Inference in Dynamic Environments

Mark Steyvers

Scott Brown

UC Irvine

This work is supported by a grant from the US Air Force Office of Scientific Research (AFOSR grant number FA9550-04-1-0317)

Page 2: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Overview

• Experiments with dynamically changing environments

– How do observers adapt their decision-making strategies?

– How quickly can observers detect changes?

• Theory development

– Bayesian ideal observers

– Process models

– Measure individual differences in adaptation

Page 3: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Part I Decision Criteria Adaption

( a quick overview from last year’s presentation)

Page 4: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

DifficultBlock

EasyBlock

Decision Environment

parameter

Subject’sdecision criterion

Decision lag

EasyBlock

DifficultBlock

Time

Environments alternating between easy and hard blocks

Time

Page 5: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Realistic 3D gaming tasks

• Decide which of two types of missiles is approaching

• How quickly can a participant adapt to changes in the similarity between the two missiles?

A=“not puffy”

B=“very puffy”

Page 6: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Modeling Results

• Developed a lagged signal detection model

• Model estimates that participants take an average of 9 trials to switch to a new decision criterion

• Individual differences

lag

0 10 20 30 40

#Par

tici

pan

ts

Page 7: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Part IIPrediction and Change Detection

Page 8: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Change detection and prediction

• Prediction of future observations in time-series data

• Changepoint models– E.g. changing to different coins at random times

• Accurate prediction requires detection of change– stock market– “hot hand” players

Page 9: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Basic Task

• Given a sequence of random numbers, predict the next one

Page 10: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

= observed data

= prediction

Two-dimensional prediction task

• Touch screen monitor

• 1500 trials • Self-paced • Same sequence

for all subjects

Page 11: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

0

5

10

0

5

10

Subject 4

40 50 60 70 80 90 100 110 120 1300

5

10

Time

Subject 12

Sequence Generation

• (x,y) locations are drawn from a binomial distribution of size 10, and parameter θ

• At every time step, probability 0.1 of changing θ to a new random value in [0,1]

• Example sequence:

Time

θ=.12 θ=.95 θ=.46 θ=.42 θ=.92 θ=.36

Page 12: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

0

5

10

Optimal Bayesian Solution

0

5

10

Subject 4

40 50 60 70 80 90 100 110 120 1300

5

10

Time

Subject 12

= observed sequence

Optimal Bayesian Solution= prediction

0

5

10

Optimal Bayesian Solution

0

5

10

Subject 4

40 50 60 70 80 90 100 110 120 1300

5

10

Time

Subject 12

Subject 4 – change detection too slow0

5

10

Optimal Bayesian Solution

0

5

10

Subject 4

40 50 60 70 80 90 100 110 120 1300

5

10

Time

Subject 12

Subject 12 – change detection too fast

(sequence from block 5)

Page 13: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Tradeoffs

• Detecting the change too slowly will result in lower accuracy and less variability in predictions than an optimal observer.

• Detecting the change too quickly will result in false detections, leading to lower accuracy and higher variability in predictions.

Page 14: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

0.5 1 1.5 2 2.5 3 3.52.7

2.8

2.9

3

3.1

3.2

3.3

3.4

3.5

Mean Absolute Movement

Me

an

Abs

olu

te T

ask

Err

or

12

34

5

6

7

8 9

OPTIMAL SOLUTION

Average Error vs. Movement

= subject

Relatively many changes

Relatively few changes

Page 15: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

A simple model

1. Make new prediction some fraction α of the way between old prediction and recent outcome. α = change proportion

2. Fraction α is a linear function of the error made on last trial

3. Two free parameters: A, B

A<B bigger jumps with higher error

A=B constant smoothing

1 (1 )t t tp y p

t t tError y p

α

0

1

A

B

BA

Page 16: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Average Error vs. Movement

0.5 1 1.5 2 2.5 3 3.52.7

2.8

2.9

3

3.1

3.2

3.3

3.4

3.5

Mean Absolute Movement

Me

an

Abs

olu

te T

ask

Err

or

12

34

5

6

7

8 9

OPTIMAL SOLUTION

= subject

= model

Page 17: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

One-dimensional Prediction Task

1

2

3

4

5

6

7

8

9

10

11

12

12 PossibleLocations

• Where will next blue square arrive on right side?

Page 18: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Average Error vs. Movement

0 0.5 1 1.5 2

1.2

1.4

1.6

1.8

2

2.2

2.4

Mean Absolute Movement

Me

an

Abs

olu

te T

ask

Err

or

12

3

4

56

7

8

910

11

12

13

14

15

16

17

1819

2021

OPTIMAL SOLUTION

= subject

= model

Page 19: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Inference and Prediction Judgments

• Many errors in prediction

– too much movement

• What is causing this?

– Faulty change detection or faulty prediction judgments?

– Gambler’s fallacy: people predict too many alternations

• New experiments:

– ask directly about the generating process

– inference judgment: what currently is the state of the system?

Page 20: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Tomato Cans Experiment

• Cans roll out of pipes A, B, C, or D

• Machine perturbs position of cans (normal noise)

• At every trial, with probability 0.1, change to a new pipe (uniformly chosen)

(real experiment has response buttons and is subject paced)

A B C D

Page 21: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Tomato Cans Experiment

(real experiment has response buttons and is subject paced)

A B C D • Cans roll out of pipes A, B, C, or D

• Machine perturbs position of cans (normal noise)

• At every trial, with probability 0.1, change to a new pipe (uniformly chosen)

• Curtain obscures sequence of pipes

Page 22: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Tasks

A B C D• Inference:

what pipe produced the last can?

A, B, C, or D?

• Prediction: in what region will the next can arrive?

1, 2, 3, or 4?

1 2 3 4

Page 23: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Experiment 1

• 63 subjects

• 12 blocks

– 6 blocks of 50 trials for inference task

– 6 blocks of 50 trials for prediction task

– Identical trials for inference and prediction

Page 24: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

INFERENCE PREDICTION

Sequence

0 0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

1

1

2

3

4

5

6

1 2 3 4 5 6A B C D

Trial

Page 25: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

INFERENCE PREDICTION

Sequence

Ideal Observer

0 0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

1

1

2

3

4

5

6

1 2 3 4 5 6A B C D

Trial

Page 26: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

INFERENCE PREDICTION

Sequence

Ideal Observer

Individualsubjects

Trial0 0.2 0.4 0.6 0.8 1

0

0.2

0.4

0.6

0.8

1

1

2

3

4

5

6

1 2 3 4 5 6A B C D

Page 27: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

INFERENCE PREDICTION

Sequence

Ideal Observer

Trial0 0.2 0.4 0.6 0.8 1

0

0.2

0.4

0.6

0.8

1

1

2

3

4

5

6

1 2 3 4 5 6A B C D

Individualsubjects

Page 28: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

INFERENCE PREDICTION

Sequence

Ideal Observer

Trial0 0.2 0.4 0.6 0.8 1

0

0.2

0.4

0.6

0.8

1

1

2

3

4

5

6

1 2 3 4 5 6A B C D

Individualsubjects

Page 29: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

0 20 40 6030

40

50

60

70

80

90

100

Changes (%)

Acc

ura

cy

0 20 40 6030

40

50

60

70

80

90

100

Changes (%)

Acc

ura

cy

= Subjectideal

ideal

INFERENCE PREDICTION

Page 30: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

0 20 40 6030

40

50

60

70

80

90

100

Changes (%)

Acc

ura

cy

ideal

0 20 40 6030

40

50

60

70

80

90

100

Changes (%)

Acc

ura

cy

ideal

INFERENCE PREDICTION

= Process model

= Subject

Page 31: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

50

60

70

80

90

100low alpha

50

60

70

80

90

100

% A

ccu

racy

(ag

ain

st T

rue

)

med alpha

0 10 20 30 40 50 6050

60

70

80

90

100

% Changes

high alpha

Varying Change Probability

• 136 subjects

• Inference judgments only

Subjects track changes in alpha

ideal

ideal

ideal

Prob. = .08

Prob. = .16

Prob. = .32

Page 32: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Number of Perceived Changes per Subject

Low medium high

Change Probability

(Red line shows ideal number of changes)

Subject #1

Page 33: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Number of Perceived Changes per Subject

55% of subjects show increasing pattern

45% of subjects show non-increasing pattern

Low, medium, high change probability Red line shows ideal number of changes

Page 34: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Conclusion

• Adaptation in non-stationary decision

environments

• Subjects are able to track changes in dynamic decision environments

• Individual differences

– Over-reaction: perceiving too much change

– Under-reaction: perceiving too little change

Page 35: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Potential deliverable outcomes

• Simple measurement models of decision making in dynamic environments.

• Classify different observers’ decision making abilities under different kinds of environments

– E.g., does this observer respond too slowly to changes in their environment?

– Is another observer close to the theoretically optimal decision mechanism?

– Conversely, classify different observers as optimally suited to different tasks, depending on dynamic properties of decision making environments in those tasks.

Page 36: Inference in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific Research

Publications

• Brown, S.D., & Steyvers, M. (2005). The Dynamics of Experimentally Induced Criterion Shifts. Journal of Experimental Psychology: Learning, Memory & Cognition, 31(4), 587-599.

• Steyvers, M., & Brown, S. (2005). Prediction and Change Detection. In: Advances in Neural Information Processing Systems, 19

• Navarro, D.J., Griffiths, T.L., Steyvers, M., & Lee, M.D. (2006). Modeling individual differences using dirichlet processes. Journal of Mathematical Psychology, 50, 101-122.

• Wagenmakers, E.J., Grunwald, P., & Steyvers, M. (2006). Accumulative prediction error and the selection of time series models. Journal of Mathematical Psychology, 50, 149-166.