continuous time and resource uncertainty cse 574 lecture spring ’03 stefan b. sigurdsson

Continuous Time and Resource Uncertainty

CSE 574 LectureSpring ’03

Stefan B. Sigurdsson

(Big Mars Rover Picture)

Lecture Overview

Context– Classical planning– The Mars Rover domain– Relaxing the assumptions– Q: What’s so different?

InnovationDiscussion

(Shakey Picture)

Slide shamelessly lifted from http://www.cs.nott.ac.uk/~bsl/G53DIA/Slides/Deliberative-architectures-I.pdf

STRIPS-Like Planning

Propositional logicClosed world assumptionFinite and staticComplete knowledgeDiscrete timeNo exogenous effects

World Description

Attainment – “Win or lose”Conjunctions of positive literals

Goal Description

Conjunctive preconditionSTRIPS operatorsConj. effect (add/delete)InstantaneousSequentialDeterministic

Actions

Plan…

(Big Mars Rover Picture)

The Mars Rover Domain

Robot control, with…– Positioning and navigation– Complex choices (goals and actions)– Rich utility model– Continuous time and concurrency – Uncertain resource consumption– Metric quantities– Very high stakes!

But alone in a finite, static universe

Resources? Metric Quantities?

What are those?

Various flavors:– Exclusive (camera arm)– Shared (OS scheduling) – Metric quantity (fuel, power, disk space)

Uncertainty

Alright, Whatsit Really Mean?

Is This Really A Planning Problem?Better suited to OR/DT-type scheduling?

– Time, resources, metric quantities, concurrency, complicated goals/rewards…

Complex, inter-dependent activities– Select, calibrate, use, reuse, recalibrate sensors– OR-type scheduling can’t handle rich choices

Insight: Maybe we can borrow some tricks?

Can Planners Scale Up?

Large plans– Sequences of ~ 100 actions

Where do we start?– POP? – MDP? – Graph/SATplan?

Where do we start?– POP? (Branch factors are too big)– MDP? – Graph/SATplan?

Where do we start?– POP? (Branch factors are too big)– MDP? (Complete policy is too large)– Graph/SATplan?

Where do we start?– POP? (Branch factors are too big)– MDP? (Complete policy is too large)– Graph/SATplan? (Discrete representations)

Which Extensions First?

Metric quantities– Time– Resources

Resource UncertaintyConcurrency

What about non-determinism? Reasonable for Graphplan?

A (Very Incomplete)Research Timeline

1971 STRIPS (Fikes/Nilson)1989 ADL (Pednault)1991 PEDESTAL (McDermott)1992 UCPOP (Penberthy/Weld) 1992 SENSp (Etzioni et al.) CNLP (Peot/Smith)1993 Buridan (Kushmerick et al.)1994 C-Buridan (Draper et al.) JIC Scheduling (Drummond et al.) HSTS (Muscettola) Zeno (Penb./Weld) Softbots (Weld/Etzioni) MDP (Williamson/Hanks)1995 DRIPS (Haddawy et al.) IxTeT (Laborie/Ghallab)1997 IPP (Koehler et al.)

Not implemented ADL impl.

SensingConformant

Contingent

Planning + schedulingMetric time/resources

Safe planningDec. theory goalsUncertain utility

Shared resources

1998 PGraphplan (Blum/Langford) Weaver (Blythe) PUCCINI (Golden) CGP (Smith/Weld) SGP (Weld et al.) Pgraphplan (Blum/Langford)1999 Mahinur (Onder/Pollack) ILP-PLAN (Kautz/Walzer) TGP (Smith/Weld) LPSAT (Wolfman/Weld)2000 T-MDP (Boyan/Littman) HSTS/RA (Jónsson et al.)

Since then?

Uncertain/dynamicSensing

Conformant

ContingentResources

Resources

Domain Assumptions

Expressive logicNon-determinism

ObservationGoal modelPlan utility

Durative actionsComplex concurrence

Continuous timeMetric quantitiesBranching factor

Resource uncertaintyResource constraints

Goal selectionSafe planning

Exogenous events

Classical

Bleeding edge

Select contingencies

Serialized goals?

Brain-teaser: Domain Spec

State space S– Cartesian product of continuous and discrete axes

(time, position, achievements, energy…)

Initial state si– Probability distribution

Domain theory– Concurrent, non-deterministic, uncertain

What else?(S, si, , …)

Brain-teaser: Kalman Filters

Curiously missing from the paper we read (?)

1983 Kalman filters paper: Voyager enters Jupiter orbit through a 30 second window after 11 years in space

Hugh Durrant-Whyte’s robots

Why not for the Mars Rover?

Context Summary

Complex, exciting domainPushes the planning envelope

– Expression– Scaling

Where do we start?

Lecture Overview

ContextInnovation

– Just-in-case planning– Incremental contingency planning

Discussion

Just-In-Case Planning

Motivated by domain characteristics– Metric quantities – Large branch factors

Implications – Not plan, not policy– Expanded plan

What about concurrency?

Branch Heuristics

Most probable failure point (scheduling)Highest utility branch point (planning)

What is the intrinsic difference?

When To Execute A Contingency?

Incremental Contingency Planning AlgorithmInput: Domain description and master planOutput: Highest-utility branch pointAlgorithm:

– Compute value, estimate resources during master plan– Approximate branch point utilities– Select highest-utility branch point– Solve w/ new initial, goal conditions– Repeat while necessary

Branch Utility Approximation

… without constructing plan– Construct a plan graph– Back-propagate utility functions through plan

graph, instead of regression searching– Compute branch point utilities throughout input

Back-Propagating Distributions

Mausam:

“Some parts of the paper are tersely written, which make it a little harder to understand. I got quite confused in the discussion of utility propagation. It would have been nicer had they given some theorems about the soundness of their method.”

Well, me too

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(1, 5)

(3, 3)

(10, 15)

(2, 2)

[(CDE)(ABDE)]

[(DCE)(AB)(DABE)]

(1, 5)

(3, 3)

(10, 15)

(2, 2)

(CDE, ABDE)

(DCE, AB, DABE)

Utility Estimation

(CDE, ABDE)

(DCE, AB, DABE)

Utility Estimation

(CDE, ABDE)

(DCE, AB, DABE)

(DCE, ABDE)

MAX operator:

Utility Estimation

(CDE, ABDE)

(DCE, AB, DABE)

(DCE, ABDE)

MAX operator:

(Then combine w/Monte Carlo results)

Lecture Overview

ContextInnovationDiscussion

– Q: Evaluation? Inference?

Evaluation

Optimal branch selection? (Greedy…)

Incremental Contingencies…

Sometimes adding one contingency at a timeis non-optimal

Examples?

Incremental Contingencies…

Work Go clim

Exercis

Sometimes adding one contingency at a timeis non-optimal

Evaluation

Optimal branch selection?What else?

Inference

Where can we take these ideas?What can we add to them?

Inference

Where can we take these ideas?What can we add to them?

Optimal branch selectionOptimistic branchingMutexes in plan graphNoisy/costly sensors

continuous time and resource uncertainty cse 574 lecture spring ’03 stefan b. sigurdsson

Documents

574-574-1-pb review of history of molecular biology by...

574- bedoin-mont ventoux

ms:nmr:x-ray: dr. bill boggess, directordr. jaroslav zajiek,...

edt 574 assignment 2.4

kiwanis international from building our communities to...

14-574 kentucky brief

troop 574 adult orientation may 18, 2013. troop 574 welcome...

574- reviewhandout

friendly word 574

plano - dart.org553 538e 538w 466 554 155 41 5 554 553 554...

574 t gp_natural_light

ines j. schornagel, vigfús sigurdsson, evert h.j. nijhuis,...

southbend@estes-express.com • phone: (574) 234-8107 (574...

574 samurai john barry

pdf (574 k)

stefan falke stefan@me.wustl.edu stefan falke...

python 574 installation guide

2017–2018 bulletin of information · (574) 631-8052;...

574 t gp_natural_light_ani

c 2,000b 042-574-1515 042-577-9988 042-572-0416...