1 chapter 5 belief updating in bayesian networks bayesian networks and decision graphs finn v....

Chapter 5 Belief Updating in Bayesian Networks

Bayesian Networks and Decision GraphsFinn V. Jensen

Qunyuan ZhangDivision. of Statistical Genomics, CGS

Statistical Genetics ForumMay 7,2007

Contents of the Book

I A practical Guide to Normative Systems

1 Causal and Bayesian Network

2 Building Models

3 Learning, Adaption, and Tuning

4 Decision Graphs

II Algorithms for Normative Systems

5 Belief Updating in Bayesian Network

6 Bayesian Network Analysis Tools

7 Algorithms for Influence Diagrams

Structure of the Book

1 Causal and Bayesian Network

2 Building Models3 Learning, Adaption, and Tuning

5 Belief Updating in Bayesian Network 6 Bayesian Network Analysis Tools

4 Decision Graphs7 Algorithms for Influence Diagrams

I. What is BN?

II. How to create a BN?

III. What can we use BN to do? and how?[to know sth.]

Prob.(a single variable | BN)Joint Prob.(a set variables | BN)Importance of varibales

evidence sensitivity parameter sensitivity

Data conflict analysis[to make decision]

Optimal decision (cost & gain)

BN & Decision Tree

U=V1+V2

AXC V1+V2

P(A,C|D1,T,D2)

“BN” of the Book

Concept of BN

Model Biulding

(known part of structure)

BN Learning

(uncertain part of structure)

(structure & parameters)

Rules & Theories Data & Algorithms

Probability Calculation

Knowing, Understanding & Explaining

Decisions

Actions Cost & Gain

Changes

Chapter 5 Belief Updating in Bayesian Networks

Belief = Probability

Belief updating = Probability calculating based on a BN

(model, parameters and/or evidences)

Linear Model BN

Logistic Model

exxxy 3322110

3322110

1),|1( 3,21 xxx

exxxyP

X1 X2 X3 e

Conditional ProbabilityP(Y| X1,X2,X3)

Marginal ProbabilityP(Y) =∑[-Y] φ

Marginal Probability Calculation in BN

I. Simplification (5.5)

II. Marginalization (5.2),(5.3),(5.4),(5.6)

III. Simulation (5.7)

I. Simplifications

Graph-theoretic Representation

Definitions, Propositions & Theorems

Barren Nodes

DA B C

d-separation

By excluding the non-informative nodes (white nodes)

II. Marginalization

Calculating sums of products of potentials by eliminating variables repeatedly

Marginal Probabilities

BA1 A2 P(B)

B1 p1 p2p1+p2

B2 p3 p4p3+p4

P(A)p1+p3

Joint Probabilities

An Example of Marginalization/Elimination

BN parameters (potentials) :

φ1=P (A1) , φ2=P (A2|A1) , φ3=P (A3|A1), φ4=P (A4|A2)

φ5=P (A5|A2, A3), φ6=P (A6|A3)

P(A4)=?

A4 A5 A6

6532165321

),(),,(

),(),(),()(

3633253

13324412211

,,,,654321

AAAAAAAAAA

AAAAAAA

Distributive Law

Marginalization/Elimination Order

),(),(),()(

)(),(),(),(),()(

)(),,(),(),(),()(

),(),,(),(),(),()(

41'211

21'324412211

'513324412211

3'6325313324412211

363325313324412211

AAAAAAA

AAAAAAAAAA

AAAAAAAAAAA

AAAAAAAAAAAA

A4 A5 A6

Variable Elimination Order

)( 412356 APAAAAA

Marginalization/Elimination

Domain: a set of variables in BN

Potential: a real-valued probabilistic table over a domain

φ1=P (A1) , φ2=P (A2|A1) , φ3=P (A3|A1), φ4=P (A4|A2)

φ5=P (A5|A2, A3), φ6=P (A6|A3)

A4 A5 A6

Definition 5.1 (Elimination)Let Фbe a set of potentials, and let X be a variable. X is eliminated from Ф by:

1.Remove all potentials in Ф with X in their domains. Call the removed set ФX

X= A3 => ФX=(φ3, φ5, φ6 ), Ф=(φ1, φ2, φ4 )

2.Calculate φ-X = ∑x ΠФX = ∑A3 φ3φ5φ6

3.Add φ-X to Ф. Call the result set Ф-X =(φ1, φ2, φ4 , φ-X )

P(Y) is calculated by repeatedly eliminating the variables except Y

Question : how to find an efficient/optimal elimination order?

Domain Graphs

BN graph

6 domains

φ1 (A1) , φ2 (A2,A1) ,

φ3 (A3,A1), φ4 (A4,A2)

φ5 (A5,A2,A3), φ6(A6,A3)

A4 A5 A6

Domain graph

6 domains

φ1 (A1) , φ2 (A2,A1) ,

φ3 (A3,A1), φ4 (A4,A2)

φ5 (A5,A2,A3), φ6(A6,A3)

A4 A5 A6

Perfect Elimination Sequence

Fill-ins (red links)

Perfect Elimination Sequence

An elimination sequence without introducing fill-ins.

A6, A5, A3, A1, A2 down to A4 => P(A4)

A5, A6, A3, A1, A2 down to A4 => P(A4)

A1, A5, A6, A3, A2 down to A4 => P(A4)

A4 A5 A6

Domain Set of Elimination Sequence

The domain set of an elimination sequence is the set of domains of potentials produced during the elimination where potentials that are subsets of other potentials are removed.

For the sequence

A6, A5, A3, A1, A2 down to A4 => P(A4)

the set of domains is

{(A6,A3),(A2,A3,A5),(A1,A2,A3), (A1,A2),(A2,A4)}

Domain set reflects the complexity of an elimination sequence.

Question: how to find the smallest domain set ?

Set of Cliques

All perfect elimination sequences produce the same the domain set, namely the set of cliques of the domain graph.

all the sequences

A6, A5, A3, A1, A2 down to A4

produce the domain set

{(A6,A3),(A2,A3,A5),(A1,A2,A3), (A1,A2),(A2,A4)}

which contains 5 domains / cliques

Any perfect elimination sequence is optimal.

Cliques are a set of domains produce by perfect elimination sequences.

Clique set is the optimal set of domains.

Question: how to determine the set of cliques?

Triangulated Graphs

An undirected graph with a perfect elimination sequence is called a triangulated graph.

A triangulated graph A nontriangulated graph

Perfect elimination sequence No perfect elimination sequence

A5, A2, A4, A3 down to A1

Cliques in Triangulated Graphs

X : a node in domain graph

Fx : the set of neighbor nodes of X plus X

Simplicial: nodes with a complete neighbor set are called simplicial

To determine the set of cliques in a triangulated graph

1. Eliminate a simplicial node X. Fx is a clique candidate.

2. If Fx does not include all remaining nodes, go to 1.

3. Prune the set of cliques candidates by removing sets that are subsets of other clique candidates.

4. The resulting set is the set of cliques.

Question: given a set of cliques, how to determine the perfect elimination order?

Join Tree

An organized tree of cliques, in which all nodes on the path between V and W contain the intersection of V and W.

ABCDV1

CGHJV5

BCDEV10

BCDGV1

DEFIV3

A domain graph

Cliques (V) and Separators (S)

A join tree

Elimination sequence

A,F,I,H,J,G,B,C,D down to E

Not a join tree

Propagation Junction Trees

A junction tree is a join tree with the following structure:

1. Each potential is attached to a clique containing the domain of this potential (cliques)

2. Each link has the appropriate separator attached (separable)

3. Each separator contains two “mailboxes”, one for each direction (mutual communication)

φ1,φ2,φ3

V4: A1, A2, A3

V6: A2, A4φ5

V2: A2, A3, A5

V1: A3, A6

↑ ↓S4:A2

↑ ↓S2:A2,A3

↑ ↓S1:A3

Collect evidence to V6

distribute evidence from V6

Junction trees provide a general framework for finding optimal elimination sequence for triangulated graphs.

Question: what if a graph is non-triangulated?

Triangulations

Convert a non-triangulated graph into a triangulated one by adding new link(s)

BN non-triangulated graph triangulated graph

Optimal triangulation? Minimal fill-in size?

Heuristic approach: eliminate repeatedly a smplicial node, and if this is not possible, eliminate a node X with minimal size of Fx.

III. Stochastic Simulations

Forward Sampling

1. P(A) => A

2. P(B|A)=>B, P(C|A)=>C

3. P(D|B)=>D

4. P(E|C,D)=>E

5. Repeat steps 1~4

Gibbs Sampling

Evidence: B=n, E=n; P(B=n,E=n) is rare

P(A)=?

P(C| B=n,E=n, A=a0, D=d0) => c1

P(D| B=n,E=n,C=c1,A=a0) => d1

P(A| B=n,E=n, D=d1,C=c1) => a1

P(C| B=n,E=n, A=a1, D=d1) => c2

. discard

P(C| B=n,E=n, A=at-1, D=dt-1) => ct

. collect

1 chapter 5 belief updating in bayesian networks bayesian networks and decision graphs finn v....

b e d f slide

b c f e e e d

b c f e d

b c f e g e d

bn potential

b c f e e dseparation

bn decision tree

d2d2 axc

Documents

learning bayesian networks in r · 2013-07-10 · bayesian...

learning bayesian networks

probabilistic reasoning bayesian belief networks...

bayesian networks darwiche

bayesian learning of bayesian networks with informative...

dynamic bayesian networks

bayesian belief networks compound bayesian decision theory

bayesian network modelling · bayesian networks in genetics...

bayesian learning and learning bayesian networks

sampling bayesian networks

learning bayesian networks and causal...

overview on bayesian networks applications for ... · pdf...

bayesian optimization with robust bayesian neural networks

bayesian networks - intro -

bayesian networks, introduction

bayesian networks

bayesian networks. graphical models bayesian networks...

bayesian belief networks - linköping...

bayesian learning and learning bayesian networks

06 bayesian networks