decision making in the presence of uncertaintymilos/courses/cs1571-fall06/lectures/cla… · cs...

25
CS 1571 Intro to AI M. Hauskrecht CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht [email protected] 5329 Sennott Square Decision making in the presence of uncertainty CS 1571 Intro to AI M. Hauskrecht Administration Solutions to HW-9 and HW-10 will be posted on the course web page Office hours: Milos: Thursday 2:00-3:30pm Swapna: Thursday 5:45 pm - 7:15 pm Final exam: December 11, 2006 12:00-1:50pm, 5129 Sennott Square

Upload: others

Post on 18-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

1

CS 1571 Intro to AI M. Hauskrecht

CS 1571 Introduction to AILecture 26

Milos [email protected] Sennott Square

Decision making in the presenceof uncertainty

CS 1571 Intro to AI M. Hauskrecht

Administration• Solutions to HW-9 and HW-10

– will be posted on the course web page

• Office hours:– Milos: Thursday 2:00-3:30pm– Swapna: Thursday 5:45 pm - 7:15 pm

• Final exam:– December 11, 2006– 12:00-1:50pm, 5129 Sennott Square

Page 2: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

2

CS 1571 Intro to AI M. Hauskrecht

Decision-making in the presence of uncertainty

• Computing the probability of some event may not be our ultimate goal

• Instead we are often interested in making decisions about our future actions so that we satisfy goals

• Example: medicine– Diagnosis is typically only the first step – The ultimate goal is to manage the patient in the best

possible way. Typically many options available:• Surgery, medication, collect the new info (lab test)• There is an uncertainty in the outcomes of these

procedures: patient can be improve, get worse or even die as a result of different management choices.

CS 1571 Intro to AI M. Hauskrecht

Decision-making in the presence of uncertainty

Main issues:• How to model the decision process with uncertain

outcomes in the computer ?• How to make decisions about actions in the presence of

uncertainty?

The field of decision-making studies ways of making decisions in the presence of uncertainty.

Page 3: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

3

CS 1571 Intro to AI M. Hauskrecht

Decision making example.Assume we want to invest $100 for 6 months • We have 4 choices:

1. Invest in Stock 12. Invest in Stock 23. Put money in bank4. Keep money at home

Stock 1

Stock 2

Bank Stock 1 value can go up or down:Up: with probability 0.6Down: with probability 0.4

0.6

0.4

110

90

Home

100

(up)

(down)

CS 1571 Intro to AI M. Hauskrecht

Decision making example.Assume we want to invest $100 for 6 months • We have 4 choices:

1. Invest in Stock 12. Invest in Stock 23. Put money in bank4. Keep money at home

Stock 1

Stock 2

Bank Stock 1 value can go up or down:Up: with probability 0.6Down: with probability 0.4

0.6

0.4

110

90

Monetary Outcomesfor up and down states

Home

100

(up)

(down)

Page 4: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

4

CS 1571 Intro to AI M. Hauskrecht

Decision making example.

Investing of $100 for 6 months

Stock 1

Stock 2

Bank

0.6

0.4

110

90

Monetaryoutcomesfor differentstates

0.4

0.6

140

80

1011.0

1001.0

Home

(up)

(down)

(up)

(down)

CS 1571 Intro to AI M. Hauskrecht

Decision making example.

We need to make a choice whether to invest in Stock 1 or 2, put money into bank or keep them at home. But how?

Stock 1

Stock 2

Bank

0.6

0.4

110

90

Monetaryoutcomesfor differentscenarios

0.4

0.6

140

80

1011.0

1001.0

Home

?(up)

(down)

(up)

(down)

Page 5: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

5

CS 1571 Intro to AI M. Hauskrecht

Decision making example.Assume the simplified problem with the Bank and Home

choices only.The result is guaranteed – the outcome is deterministic

What is the rational choice assuming our goal is to make money?

Bank 1011.0

1001.0Home

CS 1571 Intro to AI M. Hauskrecht

Decision making. Deterministic outcome.Assume the simplified problem with the Bank and Home

choices only.These choices are deterministic.

Our goal is to make money. What is the rational choice?Answer: Put money into the bank. The choice is always

strictly better in terms of the outcomeBut what to do if we have uncertain outcomes?

Bank 1011.0

1001.0Home

Page 6: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

6

CS 1571 Intro to AI M. Hauskrecht

Decision making. Stochastic outcome

Stock 1

0.6

0.4

110

100

Stock 2

Bank101

1.0

• How to quantify the goodness of the stochastic outcome?We want to compare it to deterministic and other stochastic outcomes.

?

90

(up)

(down)

CS 1571 Intro to AI M. Hauskrecht

Decision making. Stochastic outcome

Stock 1

0.6

0.4

110

100

Stock 2

Bank101

1.0

• How to quantify the goodness of the stochastic outcome?We want to compare it to deterministic and other stochastic outcomes.

Idea: Use expected value of the outcome

90

(up)

(down)

Page 7: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

7

CS 1571 Intro to AI M. Hauskrecht

Expected value

Stock 1

0.6

0.4

110

100

• Let X be a random variable representing the monetary outcome with a discrete set of values .

• Expected value of X is:

Intuition: Expected value summarizes all stochastic outcomes into a single quantity.

• Example:

• What is the expected value of the outcome of Stock 1 option?

∑Ω∈

==Xx

xXxPXE )()(

90

CS 1571 Intro to AI M. Hauskrecht

Expected value

Stock 1

0.6

0.4

110

90100

• Let X be a random variable representing the monetary outcome with a discrete set of values .

• Expected value of X is:

• Expected value summarizes all stochastic outcomes into a single quantity

• Example:

Expected value for the outcome of the Stock 1 option is:

∑Ω∈

==Xx

xXxPXE )()(

1023666904.01106.0 =+=×+×

102

Page 8: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

8

CS 1571 Intro to AI M. Hauskrecht

Expected values

Investing $100 for 6 months

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

1023666904.01106.0

=+=×+×

?

(up)

(down)

(up)

(down)

CS 1571 Intro to AI M. Hauskrecht

Expected values

Investing $100 for 6 months

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

1023666904.01106.0

=+=×+×

1044856806.01404.0

=+=×+×

104

(up)

(down)

(up)

(down)

Page 9: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

9

CS 1571 Intro to AI M. Hauskrecht

Expected values

Investing $100 for 6 months

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

1023666904.01106.0

=+=×+×

1044856806.01404.0

=+=×+×

104

?

(up)

(down)

(up)

(down)

CS 1571 Intro to AI M. Hauskrecht

Expected values

Investing $100 for 6 months

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

1023666904.01106.0

=+=×+×

1044856806.01404.0

=+=×+×

104

101

1011010.1 =×

(up)

(down)

(up)

(down)

Page 10: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

10

CS 1571 Intro to AI M. Hauskrecht

Expected values

Investing $100 for 6 months

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

1023666904.01106.0

=+=×+×

1044856806.01404.0

=+=×+×

104

101

1011010.1 =×

?

(up)

(down)

(up)

(down)

CS 1571 Intro to AI M. Hauskrecht

Expected values

Investing $100 for 6 months

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

1023666904.01106.0

=+=×+×

1044856806.01404.0

=+=×+×

104

101

1011010.1 =×

1001000.1 =×

100

(up)

(down)

(up)

(down)

Page 11: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

11

CS 1571 Intro to AI M. Hauskrecht

Selection based on expected valuesThe optimal action is the option that maximizes the

expected outcome:

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

104

101

100

(up)

(down)

(up)

(down)

CS 1571 Intro to AI M. Hauskrecht

Relation to the game search• Game search: minimax algorithm

• considers the rational opponent and its best move• Decision making: maximizes the expectation

• play against the nature - stochastic non-malicious “opponent”

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

104

101

100

(up)

(down)

(up)

(down)

Page 12: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

12

CS 1571 Intro to AI M. Hauskrecht

(Stochastic) Decision tree

• Decision tree:

• decision node• chance node• outcome (value) node

Stock 1

Stock 2

Bank

0.6

0.4

110

90

0.4

0.6

140

80

1011.0

1001.0

Home

102

104

101

100

(up)

(down)

(up)

(down)

CS 1571 Intro to AI M. Hauskrecht

Sequential (multi-step) problems

The decision tree can be build to capture multi-step decision problems:

• Choose an action• Observe the stochastic outcome• And repeat

How to make decisions for multi-step problems?• Start from the leaves of the decision tree (outcome nodes)• Compute expectations at chance nodes• Maximize at the decision nodesAlgorithm is sometimes called expectimax

Page 13: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

13

CS 1571 Intro to AI M. Hauskrecht

Assume:• Two investment periods• Two actions: stock and bank

Multi-step problem example

Stock

Bank

0.5

0.4

0.6

1.0

0.5

1.0

1.0

1.0

0.5

0.5

0.50.5

(up)

(down)

(up)

(down)

(up)

(down)

(up)

(down)

200

130

60

90

140

100

125

80

105

Stock

Bank

Stock

Bank

Stock

Bank

CS 1571 Intro to AI M. Hauskrecht

Assume:• Two investment periods• Two actions: stock and bank

Multi-step problem example

Stock

Bank

0.4

0.6

1.0

1.0

1.0

1.0

0.5

0.5

0.50.5

150

0.5

0.5(up)

(down)

(up)

(down)

(up)

(down)

(up)

(down)

200

130

60

90

140

100

125

80

105

Stock

Bank

Stock

Bank

Stock

Bank

Page 14: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

14

CS 1571 Intro to AI M. Hauskrecht

Assume:• Two investment periods• Two actions: stock and bank

Multi-step problem example

Stock

Bank

0.4

0.6

1.0

1.0

1.0

1.0

0.5

0.5

0.50.5

125

150

95

90

110

105

0.5

0.5(up)

(down)

(up)

(down)

(up)

(down)

(up)

(down)

200

130

60

90

140

100

125

80

105

Stock

Bank

Stock

Bank

Stock

Bank

CS 1571 Intro to AI M. Hauskrecht

Assume:• Two investment periods• Two actions: stock and bank

Multi-step problem example

Stock

Bank

0.4

0.6

1.0

150

1.0

1.0

1.0

0.5

0.5

0.50.5

125

110

150

95

90

110

105

95

0.5

0.5(up)

(down)

(up)

(down)

(up)

(down)

(up)

(down)

200

130

60

90

140

100

125

80

105

Stock

Bank

Stock

Bank

Stock

Bank

Page 15: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

15

CS 1571 Intro to AI M. Hauskrecht

Assume:• Two investment periods• Two actions: stock and bank

Multi-step problem example

Stock

Bank

0.4

0.6

1.0

117

110

150

1.0

1.0

1.0

0.5

0.5

0.50.5

125

110

150

95

90

110

105

95

0.5

0.5(up)

(down)

(up)

(down)

(up)

(down)

(up)

(down)

200

130

60

90

140

100

125

80

105

Stock

Bank

Stock

Bank

Stock

Bank

CS 1571 Intro to AI M. Hauskrecht

Assume:• Two investment periods• Two actions: stock and bank

Multi-step problem example

Stock

Bank

0.4

0.6

1.0

117

110

150

1.0

1.0

1.0

0.5

0.5

0.50.5

125

110

150

95

90

110

105

95

0.5

0.5(up)

(down)

(up)

(down)

(up)

(down)

(up)

(down)

200

130

60

90

140

100

125

80

105

Stock

Bank

Stock

Bank

Stock

Bank

Page 16: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

16

CS 1571 Intro to AI M. Hauskrecht

• Notice that the probability of stock going up and down in the 2nd

step is independent of the 1st step (=0.5)

Multi-step problems. Conditioning.

Stock

Bank

0.4

0.6

1.0110

1.0

1.0

1.0

0.50.5

0.50.5

0.5

0.5(up)

(down)

(up)

(down)

(up)

(down)

(up)

(down)

200

130

60

90

140

100

125

80

105

Stock

Bank

Stock

Bank

Stock

Bank

CS 1571 Intro to AI M. Hauskrecht

• But this may not be the case. In decision trees:– Later outcomes can be conditioned on the earlier

stochastic outcomes and actionsExample: stock movement probabilities. Assume:P(1st=up)=0.4P(2nd=up|1st=up)=0.4P(2nd=up|1st=down)=0.5

Conditioning in the decision tree

Stock

Bank

0.6200

130

0.4

0.6

60

90

100

0.4

1.0

1.0

0.50.5

125

(2nd up)

(2nd down)

(2nd up)

(2nd down)

(1st up)

(1st down)

Stock

Bank

Stock

Bank

Page 17: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

17

CS 1571 Intro to AI M. Hauskrecht

Multi-step problems. Conditioning.Tree Structure: every observedstochastic outcome = 1 branchP(1st=up)=0.4P(2nd=up|1st=up)=0.4P(2nd=up|1st=down)=0.5

140

105

0.6

0.4

0.6

0.4

1.0

1.0

0.5

0.5

(2nd up)

(2nd down)

(2nd up)

(2nd down)

(1st up)

(1st down)

140

80

105

80

Stock

Bank

Stock

Bank

Bank

Stock

0.6

0.4

0.6

0.4

1.0

1.0

0.5

0.5

(2nd up)

(2nd down)

(2nd up)

(2nd down)

(1st up)

(1st down)

200

130

60

90

100

125

Stock

Bank

Stock

Bank

CS 1571 Intro to AI M. Hauskrecht

Trajectory payoffs• Outcome values at leaf nodes (e.g. monetary values)

– Rewards and costs for the path trajectoryExample: stock fees and gains. Assume:Fee per period: $5 paid at the beginningGain for up: 15%, loss for down 10%

Stock

Bank

0.6

0.4

0.6

0.4

1.0

1.0

0.50.5

(2nd up)

(2nd down)

(2nd up)

(2nd down)

(1st up)

(1st down)

Stock

Bank

Stock

Bank

1000

1000-5

(1000-5)*1.15

(1000-5)*1.15-5

[(1000-5)*1.15-5]*1.15=1310.14

[(1000-5)*1.15-5]*0.9=1025.33

1310.14

1025.33

Page 18: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

18

CS 1571 Intro to AI M. Hauskrecht

• The decision tree is rarely given to you directly.– Part of the problem is to construct the tree.

Example: stocks, bonds, bank for k periodsStock:– Probability of stocks going up in the first period: 0.3– Probability of stocks going up in subsequent periods:

• P(kth step=Up| (k -1)th step =Up)=0.4• P(kth step =Up| (k -1)th step=Down)=0.5

– Return if stock goes up: 15 % if down: 10%– Fixed fee per investment period: $5Bonds:– Probability of value up: 0.5, down: 0.5– Return if bond value is going up: 7%, if down: 3%– Fee per investment period: $2Bank:– Guaranteed return of 3% per period, no fee

Constructing a decision tree

CS 1571 Intro to AI M. Hauskrecht

Review

Page 19: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

19

CS 1571 Intro to AI M. Hauskrecht

KR and logic

• Knowledge representation:– Syntax (how sentences are build), Semantics (meaning of

sentences), Computational aspect (how sentences are manipulated)

• Logic:– A formal language for expressing knowledge and ways of

reasoning– Three components:

• A set of sentences• A set of interpretations• The valuation (meaning) function

CS 1571 Intro to AI M. Hauskrecht

Propositional logic

• A language for symbolic reasoning• Language:

– Syntax, Semantics• Satisfiability of a sentence: at least one interpretation under

which the sentence can evaluate to True.• Validity of a sentence: True in all interpretations• Entailment:

is true in all worlds in which KB is true • Inference procedure

– Soundness– Completeness

α=|KBα

α=|KBαi

KBIf thenIf thenα=|KB α

iKB

Page 20: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

20

CS 1571 Intro to AI M. Hauskrecht

Propositional logic

• Logical inference problem: ?– Does KB entail the sentence ?

• Logical inference problem for the propositional logic is decidable.– A procedure (program) that stops in finite time exists

• Approaches:– Truth table approach– Inference rule approach– Resolution refutation

• Normal forms: DNF, CNF, Horn NF (conversions)

αα=|KB

α=|KB if and only if )( α¬∧KB is unsatisfiable

CS 1571 Intro to AI M. Hauskrecht

First order logic

• Deficiencies of propositional logic• First order logic (FOL): allows us to represent objects, their

properties, relations and statements about them– Variables, predicates, functions, quantifiers– Syntax and semantics of the sentences in FOL

• Logical inference problem– Undecidable. No procedure that can decide the entailment

for all possible input sentences in a finite number of steps.• Inference approaches:

– Inference rules– Resolution refutation

α=|KB ?

Page 21: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

21

CS 1571 Intro to AI M. Hauskrecht

First order logic

• Methods for making inferences work with variables:– Variable substitutions– Unification process that takes two similar sentences and

computes the substitution that makes that makes them look the same, if it exists

• Conversions to CNF with universally quantified variables– Used by resolution refutation

• The procedure is refutation- complete

CS 1571 Intro to AI M. Hauskrecht

Knowledge-based systems with HNF

• KBs in Horn normal form:– Not all sentences in FOL can be translated to HNF– Modus ponens is complete for Horn databases

• Inferences with KBs in Horn normal form (HNF)– Forward chaining– Backward chaining

• Production systems– Conflict resolution

Page 22: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

22

CS 1571 Intro to AI M. Hauskrecht

Planning

• Find a sequence of actions that lead to a goal– Much like path search, but for very large domains– Need to represent the dynamics of the world

• Two basic approaches planning problem representation:– Situation calculus

• Explicitly represents situations (extends FOL)• Solving: theorem proving

– STRIPS• Add and delete list• Solving: Search

(Goal progression, Goal regression)• Frame problem

CS 1571 Intro to AI M. Hauskrecht

Planning

• Divide and conquer approach (Sussman’s anomaly)

• State space vs. plan space search

• Partial order (non-linear) planners:– Search the space of partially build plans– Progressive or regressive mode

• Hierarchical planners

Page 23: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

23

CS 1571 Intro to AI M. Hauskrecht

Uncertainty

• Basics of probability:– random variable, values, probability distribution

• Joint probability distribution– Over variables in a set, full joint over all variables– Marginalization (summing out)

• Conditional probability distribution

• Product rule• Chain rule• Bayes rule

0)( s.t. )(

),()|( ≠= BPBPBAPBAP

)()|(),( BPBAPBAP =

)(

)()|()|(BP

APABPBAP =

CS 1571 Intro to AI M. Hauskrecht

Uncertainty

Full joint probability distribution– Over variables in a set, full joint over all variables

Two important things to remember:• Any probabilistic query can be computed from the full joint

distribution• Full joint distribution can be expressed as a product of

conditionals via the chain rule

Page 24: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

24

CS 1571 Intro to AI M. Hauskrecht

Bayesian belief networks• Full joint distribution over all random variables defining the

domain can be very large– Complexity of a model, inferences, acquisition

• Solution: Bayesian belief networks (BBNs)• Two components of BBNs:

– Structure (directed acyclic graph)– Parameters (conditional prob. distributions)

• BBN build upon conditional independence relations:

• Joint probability distribution for BBNs:– Product of local (variable-parents) conditionals

)|()|()|,( CBPCAPCBAP =

))(|(),..,,(,..1

21 ∏=

=ni

iin XpaXXXX PP

CS 1571 Intro to AI M. Hauskrecht

Bayesian belief networks

• Model of joint distribution:– Reduction in the number of parameters

• Inferences:– Queries on joint probabilities– Queries on conditionals expressed as ratios of joint

probabilities– Joint probabilities can be expressed in terms of full joints– Full joints are product of local conditionals

• Smart way to do inferences:– Interleave sums and products (variable elimination)

Page 25: Decision making in the presence of uncertaintymilos/courses/cs1571-Fall06/lectures/Cla… · CS 1571 Introduction to AI Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott

25

CS 1571 Intro to AI M. Hauskrecht

Decision-making in the presence of uncertainty

• Decision tree:– Decision nodes (choices are made)– Chance nodes (reflect stochastic outcome)– Outcomes (value) nodes (value of the end-situation)

• Rational choice:– Decision-maker tries to optimize the expected value