bayesian networks using random variables to represent objects and events in the world –various...

13
Bayesian Networks • Using random variables to represent objects and events in the world – Various instantiations to these variables can model the current state of the world • Computing joint probabilities over these variables • Estimating joint probabilities over all random variables often leads to a combinatorial explosion – Estimating probabilities of every combination of values for the variables involved

Upload: morgan-briggs

Post on 17-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Bayesian Networks• Using random variables to represent

objects and events in the world– Various instantiations to these variables can

model the current state of the world• Computing joint probabilities over these variables

• Estimating joint probabilities over all random variables often leads to a combinatorial explosion– Estimating probabilities of every combination

of values for the variables involved

Page 2: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Conditional independence• Use the chain rule to replace joint

probabilities with conditional ones:P(A1, A2, A3, A4, A5) = P(A1 | A2, A3, A4,A5) · P(A2 | A3, A4, A5) · P(A3 | A4, A5) · P(A5)

• Bayesian Networks allow to reduce the terms even further, by taking into account conditional independence:– P(A|C1, …, Cn, U) = P(A|C1, …, Cn) for some

set U of random variables– Given {C1, …, Cn}, A is independent of U

Page 3: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Conditional independence (cont’d)• The structure of a Bayesian Network

reflects a number of independence assumptions (d-separation criterion)

• Directed arcs between random variables represent conditional dependencies

• Combined with the chain rule, conditional independencies allow us to manipulate smaller conditional probability tables to estimate the joint probabilities

Page 4: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Ind( Xi ; {X1,…,Xi-1}\Pai | Pai )

where Pai are parents of Xi

Formal notation• Each random variable A is conditionally

independent of all other random variables that are not descendants of A, given A’s parents

n

iiin xpxxp

11 )|(),,( pa

n

iiin xxxpxxp

1111 ),|(),,(

Page 5: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Example: modeling real world• “Mary walks outside and finds that the street and

lawn are wet. She concludes that it has just rained recently. Furthermore, she decides that she doesn’t need to water her roses.”

• Mary’s logic:– rain or sprinklers street = wet– rain or sprinklers lawn = wet– lawn = wet soil = moist– soil = moist roses = OK

Page 6: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Mary’s world (cont’d)• Let’s compute the probability of the

following (intuitively unlikely) world event:– The roses are OK– The soil is dry– The lawn is wet– The street is wet– The sprinklers are off– It’s raining

Page 7: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Where are the missing probabilities, e.g., P(sprinklers=F) or P(street=dry|rain=T, sprinklers=T) ?

Page 8: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Computing the probability of a complex event

• P(sprinklers = F, rain = T, street = wet, lawn = wet, soil = dry, roses = OK) =

P(roses = OK | soil = dry) *

P(soil = dry | lawn = wet) *

P(lawn = wet | rain = T, sprinklers = F) * P(street = wet | rain = T, sprinklers = F) * P(sprinklers = F) * P(rain = T) =

0.2 * 0.1 * 1.0 * 1.0 * 0.6 * 0.7 = 0.0084

The event is quite unlikely !

Page 9: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Usage scenarios

• There are 2 types of computations performed with Bayesian Networks– Belief updating

• Computation of probabilities over random variables

– Belief revision• Finding maximally probably global assignment• Given some evidence or observation, our task is to

come up with a set of hypotheses (variable assignments) that together constitute the most satisfactory explanation/interpretation of the evidence at hand.

Page 10: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Belief revision• Let W be a set of all random variables in

the network

• Let e be the evidence (e is a subset of W)

• Any instantiation of all the variables in W that is consistent with e is called an explanation or interpretation of e

• The problem is to find an explanation w* such that P(w* | e) = max P(w | e)– w* is called the most probable explanation

Page 11: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Belief updating• Computing marginal probabilities of a

subset of random variables given the evidence

• Typically, the task is to determine the best instantiation of a single random variable given the evidence

Page 12: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Belief revision - example• Evidence: e = {roses = OK}

• Goal: determine the assignment to all the random variables that maximizes P(W|e)– P(e) is constant, and e is a subset of W

it’s sufficient to maximize P(W)

– Intuitively, non-evidence random variables in W can be viewed as possible hypotheses for e

• Solution: P(sprinklers=F, rain=T, street=wet, lawn=wet, soil=wet, roses=OK) = 0.2646

Page 13: Bayesian Networks Using random variables to represent objects and events in the world –Various instantiations to these variables can model the current

Belief updating - example• Evidence: e = {roses = OK}

• Goal: determine the probability that the lawn is either wet or dry given this observation– P(lawn=dry | roses = OK) = 0.1190– P(lawn=wet | roses = OK) = 0.8810