lecture 4: variable elimination - github pages · lecture 4: variable elimination theo rekatsinas...

CS839:ProbabilisticGraphicalModels

Lecture4:VariableElimination

TheoRekatsinas

Recap:P-maps,BNs,MRFs

Section1

• ADAGGisaperfectmap(P-map)foradistributionPifI(P)=I(G)

• ThereareindependenceassumptionsthatcanbecapturedbyaBNandnotanMRF.Vice-versa

PartiallyDirectedAcyclicGraphs

Section1

• A.K.A.chaingraphs• Nodescanbepartitionedintodisjointchaincomponents• Anedgewithinthesamecomponentsmustbeundirected• Anedgebetweentwonodesindifferentchaincomponentsmustbedirected

ProbabilisticInferenceandLearning

Section1

• Wediscussedcompactrepresentationsofprobabilitydistributions:GraphicalModels(GMs)• AGM(fullyparameterized)MdescribesauniqueprobabilitydistributionP• TaskswecanperformwithaGM:• Task1(Inference):WhatistheprobabilityPM(X|Y)?Y:evidence• Task2(Learning):HowdoweestimateaplausiblemodelM fromdataD?

• LearningreferstoobtainingapointestimateofM• Bayesian’sarelookingforP(M|D)whichisactuallyaninferenceproblem.• Missingdata:tocomputeapointestimateofMweneedtoperforminferencetoimputethemissingdata.

Likelihood

Section1

• Mostqueriesinvolveevidence• Evidencee isanassignmentofvaluestoasetEvariablesinthedomain• WithoutlossofgeneralityE={Xk+1,…,Xn}

• Example:computetheprobabilityofevidencee

• A.k.a.computethelikelihoodofe

P (e) =X

· · ·X

P (x1, . . . , xk

ConditionalProbability

Section1

• Oftenweareinterestedintheconditionalprobabilitydistributionofavariablegiventheevidence:

• AkaaposterioribeliefinX,givenevidencee• MostofthetimesweonlycareaboutasubsetY ofalldomainvariablesX={Y,Z}anddonotcareabouttheremainingZ,

• theprocessofsummingout“donotcare”variablesiscalledmarginalization,andtheresultingprobabilityiscalledamarginalprobability

P (X|e) = P (X, e)

P (e)=

P (X, e)Px

P (X = x, e)

P (Y|e) =X

P (Y,Z = z|e)

Applicationsofaposterioribelief

Section1

• Prediction:whatistheprobabilityofanoutcomegiventhestartingcondition

• Thequerynodeisadescendentoftheevidence

• Diagnosis:whatistheprobabilityofdisease/faultgivensymptoms

• Thequerynodeisanancestoroftheevidence• Learningunderpartialobservations• Expectationmaximizationalgorithm(laterinclass)

MPA:MostProbableAssignment

Section1

• Whatisthemostprobablejointassignment(MPA)forsomevariablesofinterest

• Weareagaingivensomeevidenceeandweignore(thevaluesof)“donotcare”variablesz

• Akamaximumaposterioriassignmentofy (MAPinference)

MPA(Y|e) = argmax

yP (y|e) = argmax

P (y, z|e)

ApplicationsofMPA

Section1

• Classification: findmostlikelylabel,givenevidence• Explanation:whatthemostlikelyscenario,giventheobservedevidence

• MAPstateofavariabledependsonthesetofvariablesthatarejointlyqueried

• Example:• MapofY1• Mapof(Y1,Y2)

Y1 Y2 P(Y1,Y2)0 0 0.050 1 0.351 0 0.31 1 0.3

ComplexityofInference

Section1

• Thm: ComputingP(X=x|e)inaGMisNP-hard

• However,formanypracticalinstancesofGMswecansolveinferenceefficiently(inpolynomialtime)• NP-hardnessimpliesthatwecannotfindageneralprocedurethatworksefficientlyforarbitraryGMs• ForcertainGMfamilieswehaveprovablyefficientexactinferenceprocedures.

ApproachestoInference

Section1

• Exactinference• Variableelimination• Message-passingalgorithm(sum-product,beliefpropagation)• Junctiontreealgorithms

• Approximateinferencetechniques• Stochasticsimulation/samplingmethods• MarkovchainMonteCarlomethods• Variational algorithms

Operations:MarginalizationandElimination

Section1

• ConsiderthefollowingGM:

• WhatisthelikelihoodthatEistrue?

• Query:P(e)

A B C D E

P (e) =X

P (a, b, c, d, e)

Section1

• Query:P(e)

A B C D E

P (e) =X

P (a, b, c, d, e)

Anaïvesummationneedstoenumerateoveranexponentialnumberofterms

Section1

• Query:P(e)

• Chaindecomposition

A B C D E

P (e) =X

P (a, b, c, d, e)

P (e) =X

P (a)P (b|a)P (c|b)P (d|c)P (e|d)

EliminationonChains

Section1

• Reorderterms

A B C D E

P (e) =X

P (a)P (b|a)P (c|b)P (d|c)P (e|d)

P (c|b)P (d|c)P (e|d)X

P (a)P (b|a)

EliminationonChains

Section1

• Performinnermostsummation

• Thissummationeliminatesonevariablefromoursummationargumentatalocalcost

A B C D EXP (e) =

P (c|b)P (d|c)P (e|d)X

P (a)P (b|a)

P (c|b)P (d|c)P (e|d)p(b)

EliminationonChains

Section1

• Performinnermostsummation

A B C D EX XP (e) =

P (c|b)P (d|c)P (e|d)p(b)

P (d|c)P (e|d)X

P (c|b)p(b)

P (d|c)P (e|d)p(c)

EliminationonChains

Section1

• Eliminatenodesone-by-oneallthewaytotheend

• Complexity:• ForeachstepwehaveO(|Dom(Xi)|*|Dom(Xi+1)|)operations:O(kn2)• ComparewithnaïveO(nk)

A B C D EX X X XP (e) =

P (e|d)p(d)

UndirectedChains

Section1

• Rearrangingterms…A B C D E

P (e) =X

Z�(b, a)�(c, b)�(d, c)�(e, d)

�(c, b)�(d, c)�(e, d)X

�(b, a)

= . . .

ConditionalRandomFields

Section1

TheSum-ProductOperation

Section1

• Ingeneral,wewanttocomputethevalueofanexpressionoftheform:

• whereFisasetoffactors

• Wecallthistaskthesum-productinferencetask.

InferenceviaVariableElimination

Section1

• Generalidea:• Writequeryintheform

• Iteratively• Moveallirrelevanttermsoutsideofinnermostsum• Performinnermostsum,gettinganewterm• Insertthenewtermintotheproduct

• Backtooriginalquery

P (X1, e) =X

· · ·X

P (X1|e) =�(X1, e)Px1

�(X1, e)

Outcomeofelimination

Section1

• LetXbesomesetofvariables• LetFbeasetoffactorssuchthatforeach• LetbeasetofqueryvariablesandZ =X– Ybethevariabletobeeliminated

• TheresultofeliminatingZisafactor

• Thisdoesnotnecessarilycorrespondtoanyprobabilityorconditionalprobability

� 2 F, Scope[�] 2 XY ⇢ X

⌧(Y) =X

EvidenceandSum-Product

Section1

• Evidencepotential

• Totalevidencepotential

• Introducingevidence– restrictedfactors:

⌧(Y, e) =X

� · �(E, e)

VariableEliminationAlgorithm

Section1

ProcedureElimination(G,//theGME,//evidenceZ,//setofvariablestobeeliminatedX,//queryvariable(s))

1. Initialize(G)2. Evidence(E)3. Sum-Product-Elimination(F,Z,)4. Normalization(F)

Section1

ProcedureInitialize(G,Z)1. LetbeanorderingofZsuchthatiff I<j2. InitializeFwiththefullsetoffactors

ProcedureEvidence(E)1.Foreach

Z1, . . . , Zk Zi � Zj

i 2 IE ,

F = F [ �(Ei, ei)

Section1

ProcedureSum-Product-Variable-Elimination(F,Z,)

1. Fori =1,…,k

2.3. return4. Normalization()

F Sum-Product-Eliminate-Var(F,Zi)

�⇤ Q

�F�

�⇤

Section1

ProcedureSum-Product-Eliminate-Var (F,Z)

5.return

0 {� 2 F : Z 2 Scope[�]}F 00 F � F 0

�2F 0 �

F 00 [ {⌧}

Section1

ProcedureNormalization()

�⇤

P (X|E) = �⇤(X)Px

�⇤(X)

VariableEliminationExample

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)Step1:• Conditioning onevidence(fixHtoh)

• Sameasamarginalizationstep:pH(E,F ) = P (H = h|E,F )

pH(E,F ) =P

h0 P (H = h|E,F )�(h0 = h)

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)=>P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)pH(E,F)Step2:EliminateG

=>P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)p(E,F)=>P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A) pH(E,F)

pG(E) =P

g P (G = g|E) = 1

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)=>P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)pH(E,F)Step3:EliminateF

=>P(A)P(B)P(C|B)P(D|A)P(E|C,D)pF(A,E)

pH(E,A) =P

f P (F = f |A)pH(E,F )

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)=>P(A)P(B)P(C|B)P(D|A)P(E|C,D)pF(A,E)Step4:EliminateE

=>P(A)P(B)P(C|B)P(D|A)pE(A,C,D)

pE(A,C,D) =P

e P (E = e|C,D)pF (A,E)

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)=>P(A)P(B)P(C|B)P(D|A)pE(A,C,D)Step5:EliminateD

=>P(A)P(B)P(C|B)pD(A,C)

pD(A,C) =P

d P (D = d|A)pE(A,C,D)

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)=>P(A)P(B)P(C|B)pD(A,C)Step6:EliminateC

=>P(A)P(B)P(C|B)pC(A,B)

pC(A,B) =P

c P (C = c|B)pD(A,C)

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)=>P(A)P(B)pC(A,B)Step7:EliminateB

=>P(A)pB(A)

pB(A) =P

b P (B = b|A)pC(A,B)

Section1

Query:P(A|h)• Needtoeliminate:B,C,D,E,F,G,HInitialfactors:P(A)P(B)P(C|B)P(D|A)P(E|C,D)P(F|A)P(G|E)P(H|E,F)=>P(A)pB(A)Step8:Wrap-up

P (A, h) = P (A)pB(A), P (h) =P

a P (A = a)pB(A = a)

P (A|h) = P (A,h)P (h)

Complexityofvariableelimination

Section1

• Supportinoneeliminationstepwecompute

• Thisrequiresmultiplications

• Foreachvalueforwedokmultiplications

• additions

• Foreachvalueforwedo|Val(X)|additions

(y1, . . . , yk) =P

(x, y1, . . . , yk)

(x, y1, . . . , yk) =Q

i=1 pi(x,yci)

k · |V al(X)| ·Q

I |V al(yCi)|x, y1, . . . , yk

|V al(X)| ·Q

i |V al(yCi)|y1, . . . , yk

Complexityofvariableelimination

Section1

• Supportinoneeliminationstepwecompute

• Thisrequiresmultiplications

• Foreachvalueforwedokmultiplications

• additions

• Foreachvalueforwedo|Val(X)|additions

(y1, . . . , yk) =P

(x, y1, . . . , yk)

(x, y1, . . . , yk) =Q

i=1 pi(x,yci)

k · |V al(X)| ·Q

I |V al(yCi)|x, y1, . . . , yk

|V al(X)| ·Q

i |V al(yCi)|y1, . . . , yk

Complexityisexponential innumberofvariablesintheintermediatefactor

Summary

Section1

• Thesimpleeliminatealgorithmcapturesthekeyalgorithmicoperation

underlyingprobabilisticinference:

• Wetakeasumoverproductofpotentialfunctions

• Complexityofthealgorithmdependsonsizeofthesummandsthatappearin

thesequenceofthesummationoperations.

• ComputationalcomplexityoftheEliminatealgorithmcanbereducedtopurely

graph-theoreticconsiderations:treewidth (nextclass).

• Reasoningaboutthethreewidth wecandesignimprovedinferencealgorithms.

lecture 4: variable elimination - github pages · lecture 4: variable elimination theo rekatsinas...

Documents

branch elimination via multi-variable condition...

1 abstraction (cont’d) defining an abstract domain...

randomized variable elimination - journal of machine...

graphical models marc toussaint university of stuttgart...

randomized variable elimination david j. stracuzzi paul e....

variable elimination and conditioning: complexity forecasts

inference i introduction, hardness, and variable elimination...

1 variable elimination graphical models – 10708 carlos...

class 3: constraints; search rina...

logic: intro & propositional definite clause...

the elimination method...the elimination method another way...

elimination game!elimination game!

63 solving systems by elimination - mrs. guerriero...63...

tissue culture for elimination of...

3.6 systems with three variables 1.solving three-variable...

inference in graphical models - uni stuttgart ·...

notes:’solving’systems’using’elimination’ ·...

cpsc 322, lecture 30slide 1 reasoning under uncertainty:...

effective preprocessing in sat through variable and clause...

from variable elimination to junction trees