oliver b. martin brian c. williams {omartin, williams}@mit

31
July 13, 2005 AAAI-05 Diagnosis as Approximate Belief State Enumeration for Probabilistic Concurrent Constraint Automata Oliver B. Martin Brian C. Williams {omartin, williams}@mit.edu Massachusetts Institute of Technology Michel D. Ingham [email protected] a.gov Jet Propulsion Laboratory

Upload: maxima

Post on 04-Feb-2016

45 views

Category:

Documents


2 download

DESCRIPTION

Diagnosis as Approximate Belief State Enumeration for Probabilistic Concurrent Constraint Automata. Oliver B. Martin Brian C. Williams {omartin, williams}@mit.edu Massachusetts Institute of Technology. Michel D. Ingham [email protected] Jet Propulsion Laboratory. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Diagnosis as Approximate Belief State Enumeration for Probabilistic Concurrent Constraint Automata

Oliver B. MartinBrian C. Williams{omartin, williams}@mit.eduMassachusetts Institute of Technology

Michel D. [email protected] Propulsion Laboratory

Page 2: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Space Shuttle Discovery Launch

Wednesday, July 13, 2005 @ 3:51pm ET

Page 3: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Paper Contribution

Problem1. Mode estimates were approximated by tracking a set

of most likely trajectories

2. No observation function

Solution1. Compute estimates by using the HMM belief state

update equations directly in the utility function

2. Generate a compact set of observation probability rules offline using a model counting method

Page 4: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

ClosedClosed

ValveValve

OpenOpenStuckStuckopenopen

StuckStuckclosedclosed

OpenOpen CloseClose

0. 010. 01

0. 010. 01

0.010.01

0.010.01

inflow = outflow = 0

Page 5: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Estimation of PCCA

Belief State Evolution visualized with a Trellis Diagram

Compute the belief state for each estimation cycle

0.7

0.2

0.1

Complete history knowledge is captured in a single belief state by exploiting the Markov property

Belief states are computed using the HMM belief state update equations

closed open stuck-open

cmd=open, obs=flow cmd=closed, obs=no-flow

Page 6: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

HMM Belief State Update Equations Propagation Equation

Update Equationt t+1

Page 7: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Page 8: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Approximations to PCCA Estimation

1. The belief state can be accurately approximated by maintain the k most likely estimates

2. The probability of each state can be accurately approximated by the most likely trajectory to that state

3. The observation probability can be reduced to 1.0 for observations consistent with the state, and 0.0 for observations inconsistent with the state

t+11.0 or 0.0

Page 9: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Previous Estimation Approach

Livingstone Williams and Nayak, AAAI-96

Livingstone 2 Kurien and Nayak, AAAI-00

Titan Williams et al., IEEE ‘03

Best-First Trajectory Enumeration (BFTE)

Deep Space One

Earth Observing One

0.4

0.2

Page 10: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Increased estimator accuracy Maintain compact belief state encoding

Maintain estimator responsiveness Solved efficiently as an Optimal Constraint

Satisfaction Problem (OCSP) using Conflict-directed A* with a tight admissible heuristic

Best-First Belief State Enumeration

0.7

0.2

Page 11: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Conflict-directed A* heuristic HMM propagation equation:

Split into admissible heuristic for partial assignments:

Page 12: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Accuracy Results EO-1 Model (12 components) 30 estimation cycles (nominal operations)

Page 13: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Complexity Analysis

Best-First Belief State Enumeration (BFBSE) n·k arithmetic computations 1 OCSP

Best-First Trajectory Enumeration (BFTE) n arithmetic computation k OCSPs

Recall A* best case: n·b, worst case: bn

Page 14: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Performance Results Heap Memory Usage

Page 15: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Performance Results Run-time (1.7 GHz Pentium M, 512MB RAM)

Page 16: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Current Work

Extend BFBSE to use both HMM belief state update equations

Use observation probabilities within the conflict-directed search to avoid unlikely candidates

Done efficiently using a conditional probability table

(Published in S.M. Thesis and i-SAIRAS ’05)

Page 17: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Conclusion

Best-First Belief State Enumeration

Increased PCCA estimator accuracy by computing the Optimal Constraint Satisfaction Problem (OCSP) utility function directly from the HMM propagation equation.

Improved PCCA estimator performance by framing estimation as a single OCSP.

Page 18: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Backup Slides

Page 19: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Simple Two Switch Scenario Sw1=on

Sw2=on

Sw1=bknSw2=bkn

Sw1=onSw2=on

Sw1=onSw2=bkn

0.7

0.3

0.343

0.063

0.147

0.147

(0.7)(0.7)

(0.7)(0.3)

(1)(1) Most likely trajectories (k=2) Very reactive Gross lower-bound Extraneous computation

Multiple OCSP instances Estimates generated and thrown away Sw1=bkn

Sw2=bkn

0.3

Sw1=bknSw2=on

Sw1=bknSw2=bkn

(0.3)(0.7)

(0.3)(0.3)

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Page 20: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Approximate Belief State Enumeration (k=2)

Sw1=onSw2=on

Sw1=bknSw2=bkn

Sw1=onSw2=on

Sw1=onSw2=bkn

Sw1=bknSw2=on

Sw1=bknSw2=bkn

0.7

0.3

0.343

0.363

0.147

0.147

(0.7)(0.7)

(0.7)(0.3)

(0.3)(0.3)

(0.3)(0.7)

(1)(1)

Complete probability Assuming approximate belief state

is the true belief state Tighter lower bound

Single OCSP Best-first order using A* Unfortunately, Not MPI

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Page 21: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Belief State Update Complete probability distribution is calculated using the

Hidden Markov Model Belief State Update equations A Priori Probability:

Solved as single OCSP A* Cost Function:

( )1 0, 0, 1 0, 0, 1P( | , ) P( | , ) P( | , )t tt tk ii

t t t t t t t t tj k k k k i

x ss

s o x v x v s ob

m m m+ < > < > + < > < - >

ÎÎ

æ ö÷ç ÷¢= = =ç ÷ç ÷÷çè øå Õ

Sw1=onSw2=on

Sw1=bknSw2=bkn

Sw1=bknSw2=bkn

0.7

0.3

0.363(0.3)(0.3)

(1)(1)

( ) ( )1 1 0, 0, 1

( )( ) P( | , ) max P( | , ) P( | , )

t tt t h hg hi

t t t t t t t t tg g g g h h h h i

v xx n x ns

f n x v x v x v x v s ob

m m m+ + < > < - >

¢ÎÎ ÏÎ

æ ö÷ç ÷¢ ¢ç= = = × = = × ÷ç ÷ç ÷è øå Õ Õ

D

g(n) h(n)

Page 22: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Three Switch EnumerationExample

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3

0.343

0.147

0.21

0.153

(0.7)(0.7)(1)(0.7)

Assume No commands No observations

Enumeration scheme same as most likely trajectories Expand tree by adding mode assignments Only difference is the cost function

0.147

Sw1=bknSw2=onSw3=bkn

Sw1=onSw2=bknSw3=on

Sw1=bknSw2=onSw3=bkn

Sw1=bknSw2=bknSw3=bkn

Sw1=bknSw2=bknSw3=on

Sw1=onSw2=bknSw3=bkn

(0.3)(0.3)(1)(0.3)

(1)(1)(0.7)(1)

(0.3)(0.3)(1)(0.7)

(0.7)(0.7)(1)(0.3)

(1)(1)(0.3)(1)

Page 23: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Why max in f(n)?

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3Sw1=bknSw2=onSw3=bkn

Initial Approximate Belief State and Transition Probabilities

A* Cost Function

{ }

0.357 {Sw1 = bkn} ? {Sw1 = on}

f(Sw1=bkn) = f(Sw1=bkn) = (0.3)(0.3)( ? )( ? )·0.7 + (1.0)(1.0)( ? )( ? )·0.3 = 0.51

Which mode assignment next? Sw2=bkn?

f(Sw1=bkn) = f(Sw1=bkn) = (0.3)(0.3)(1.0)( ? )·0.7 + (1.0)(1.0)(0.3)( ? )·0.3 = 0.3 Or Sw2=on?

f(Sw1=bkn) = f(Sw1=bkn) = (0.3)(0.3)(0.0)( ? )·0.7 + (1.0)(1.0)(0.7)( ? )·0.3 = 0.21

f(Sw1=bkn) = f(Sw1=bkn) = (0.3)(0.3)(1.0)(0.3)·0.7 + (1.0)(1.0)(0.3)(1.0)·0.3 = 0.153 Best Best f(n)f(n)??

Combinatorial Problem{Sw1 = bkn}: 0.51

{Sw2 = bkn}: 0.3 {Sw2 = on}: 0.21

{Sw3 = bkn}0.153

{Sw3 = on}0.147

{Sw3 = bkn}0.21

{Sw3 = on}0.0

( ) ( )1 1 0, 0, 1

( )( ) P( | , ) max P( | , ) P( | , )

t tt t h hg hi

t t t t t t t t tg g g g h h h h i

v xx n x ns

f n x v x v x v x v s ob

m m m+ + < > < - >

¢ÎÎ ÏÎ

æ ö÷ç ÷¢ ¢ç= = = × = = × ÷ç ÷ç ÷è øå Õ Õ

D

Page 24: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

GuaranteedAdmissible

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3Sw1=bknSw2=onSw3=bkn

Initial Approximate Belief State and Transition Probabilities

A* Cost Function

{ }

0.357 {Sw1 = bkn} ? {Sw1 = on}

f(Sw1=bkn) = f(Sw1=bkn) = (0.3)(0.3)( ? )( ? )·0.7 + (1.0)(1.0)( ? )( ? )·0.3 = 0.51

Maximize each term regardless of mode assignment to sw2

f(Sw1=bkn) = f(Sw1=bkn) = (0.3)(0.3)(1.0)( ? )·0.7 + (1.0)(1.0)(0.7)( ? )·0.3 = 0.42

f(Sw1=bkn) = f(Sw1=bkn) = (0.3)(0.3)(1.0)(0.7)·0.7 + (1.0)(1.0)(0.7)(1.0)·0.3 = 0.357

Best Best f(n)f(n)??

Combinatorial Problem{Sw1 = bkn}: 0.51

{Sw2 = bkn}: 0.3 {Sw2 = on}: 0.21

{Sw3 = bkn}0.153

{Sw3 = on}0.147

{Sw3 = bkn}0.21

{Sw3 = on}0.0

( ) ( )1 1 0, 0, 1

( )( ) P( | , ) max P( | , ) P( | , )

t tt t h hg hi

t t t t t t t t tg g g g h h h h i

v xx n x ns

f n x v x v x v x v s ob

m m m+ + < > < - >

¢ÎÎ ÏÎ

æ ö÷ç ÷¢ ¢ç= = = × = = × ÷ç ÷ç ÷è øå Õ Õ

D

Page 25: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Tree Expansion

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3Sw1=bknSw2=onSw3=bkn

Initial Approximate Belief State and Transition Probabilities

A* Cost Function

{ }

0.357 {Sw1 = bkn} 0.343 {Sw1 = on}

f(Sw1=on) = f(Sw1=on) = (0.7)(0.7)(1.0)(0.7)·0.7 + (0.0)(0.0)(0.7)(1.0)·0.3 = 0.343

Due to no MPI, all children of each node must be placed on the queue before deciding which node to expand next

- Node is in the queue- Node is in the queue

- Node was dequeued and expanded- Node was dequeued and expanded

( ) ( )1 1 0, 0, 1

( )( ) P( | , ) max P( | , ) P( | , )

t tt t h hg hi

t t t t t t t t tg g g g h h h h i

v xx n x ns

f n x v x v x v x v s ob

m m m+ + < > < - >

¢ÎÎ ÏÎ

æ ö÷ç ÷¢ ¢ç= = = × = = × ÷ç ÷ç ÷è øå Õ Õ

D

Page 26: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Tree Expansion

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3Sw1=bknSw2=onSw3=bkn

Initial Approximate Belief State and Transition Probabilities

A* Cost Function

{ }

0.357 {Sw1 = bkn} 0.343 {Sw1 = on}

f(Sw1=bkn, Sw2=bkn) = f(Sw1=bkn, Sw2=bkn) = (0.3)(0.3)(1.0)(0.7)·0.7 + (1.0)(1.0)(0.3)(1.0)·0.3 = 0.237

Sw1=bkn has the best cost so it is dequeued and its children are expanded

0.237 {Sw1 = bkn, Sw2=bkn} 0.21 {Sw1 = bkn, Sw2=on}

f(Sw1=bkn, Sw2=on) = f(Sw1=bkn, Sw2=on) = (0.3)(0.3)(0.0)(0.7)·0.7 + (1.0)(1.0)(0.7)(1.0)·0.3 = 0.21

( ) ( )1 1 0, 0, 1

( )( ) P( | , ) max P( | , ) P( | , )

t tt t h hg hi

t t t t t t t t tg g g g h h h h i

v xx n x ns

f n x v x v x v x v s ob

m m m+ + < > < - >

¢ÎÎ ÏÎ

æ ö÷ç ÷¢ ¢ç= = = × = = × ÷ç ÷ç ÷è øå Õ Õ

D

Page 27: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Tree Expansion

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3Sw1=bknSw2=onSw3=bkn

Initial Approximate Belief State and Transition Probabilities

A* Cost Function

{ }

0.357 {Sw1 = bkn} 0.343 {Sw1 = on}

f(Sw1=on, Sw2=bkn) = f(Sw1=on, Sw2=bkn) = (0.7)(0.7)(1.0)(0.7)·0.7 + (0.0)(0.0)(0.3)(1.0)·0.3 = 0.343

Sw1=on is now the next best and is dequeued and expanded

0.237 {Sw1 = bkn, Sw2=bkn} 0.21 {Sw1 = bkn, Sw2=on}

f(Sw1=on, Sw2=on) = f(Sw1=on, Sw2=on) = (0.7)(0.7)(0.0)(0.7)·0.7 + (0.0)(0.0)(0.3)(1.0)·0.3 = 0.0

0.343 {Sw1 = on, Sw2=bkn} 0.0 {Sw1 = on, Sw2=on}

( ) ( )1 1 0, 0, 1

( )( ) P( | , ) max P( | , ) P( | , )

t tt t h hg hi

t t t t t t t t tg g g g h h h h i

v xx n x ns

f n x v x v x v x v s ob

m m m+ + < > < - >

¢ÎÎ ÏÎ

æ ö÷ç ÷¢ ¢ç= = = × = = × ÷ç ÷ç ÷è øå Õ Õ

D

Page 28: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Tree Expansion

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3Sw1=bknSw2=onSw3=bkn

Initial Approximate Belief State and Transition Probabilities

A* Cost Function

{ }

0.357 {Sw1 = bkn} 0.343 {Sw1 = on}

{Sw1=on, Sw2=bkn} is now the next best

0.237 {Sw1 = bkn, Sw2=bkn} 0.21 {Sw1 = bkn, Sw2=on} 0.343 {Sw1 = on, Sw2=bkn} 0.0 {Sw1 = on, Sw2=on}

0.147 {Sw1 = on, Sw2=bkn, Sw3=bkn} 0.343 {Sw1 = on, Sw2=bkn, Sw3=on}

( ) ( )1 1 0, 0, 1

( )( ) P( | , ) max P( | , ) P( | , )

t tt t h hg hi

t t t t t t t t tg g g g h h h h i

v xx n x ns

f n x v x v x v x v s ob

m m m+ + < > < - >

¢ÎÎ ÏÎ

æ ö÷ç ÷¢ ¢ç= = = × = = × ÷ç ÷ç ÷è øå Õ Õ

D

Page 29: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Tree Expansion

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3Sw1=bknSw2=onSw3=bkn

Initial Approximate Belief State and Transition Probabilities

A* Cost Function

{ }

0.357 {Sw1 = bkn} 0.343 {Sw1 = on}

{Sw1=on, Sw2=bkn, Sw3=on} is the best estimate!

0.237 {Sw1 = bkn, Sw2=bkn} 0.21 {Sw1 = bkn, Sw2=on} 0.343 {Sw1 = on, Sw2=bkn} 0.0 {Sw1 = on, Sw2=on}

0.147 {Sw1 = on, Sw2=bkn, Sw3=bkn} 0.343 {Sw1 = on, Sw2=bkn, Sw3=on}

Continue expanding to get more estimates in best-first order

( ) ( )1 1 0, 0, 1

( )( ) P( | , ) max P( | , ) P( | , )

t tt t h hg hi

t t t t t t t t tg g g g h h h h i

v xx n x ns

f n x v x v x v x v s ob

m m m+ + < > < - >

¢ÎÎ ÏÎ

æ ö÷ç ÷¢ ¢ç= = = × = = × ÷ç ÷ç ÷è øå Õ Õ

D

Page 30: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Three Switch EnumerationExample

Switch ont+1 bknt+1

ont 0.7 0.3

bknt 0 1

Sw1=onSw2=bknSw3=on

0.7

0.3

0.343

0.147

0.21

0.153

(0.7)(0.7)(1)(0.7)

Assume No commands No observations

Enumeration scheme same as most likely trajectories Expand tree by adding mode assignments Only difference is the cost function

0.147

Sw1=bknSw2=onSw3=bkn

Sw1=onSw2=bknSw3=on

Sw1=bknSw2=onSw3=bkn

Sw1=bknSw2=bknSw3=bkn

Sw1=bknSw2=bknSw3=on

Sw1=onSw2=bknSw3=bkn

(0.3)(0.3)(1)(0.3)

(1)(1)(0.7)(1)

(0.3)(0.3)(1)(0.7)

(0.7)(0.7)(1)(0.3)

(1)(1)(0.3)(1)

Page 31: Oliver B. Martin Brian C. Williams {omartin, williams}@mit

July 13, 2005

AAAI-05

Summary Approximate belief state enumeration successfully

framed as an OCSP with an admissible heuristic for A* Still only considers a priori probability

Performance Impact Uses same OCSP scheme as the most likely trajectories

algorithm Changed only by using a different cost function Conflicts can still be used to prune branches A* is worse case exponential in the depth of the tree but the tree

depth will not change Only one OCSP solver necessary

No redundant conflicts to be generated No extraneous estimates generated and thrown out

MPI no longer holds Queue will contain a larger number of implicants