carnegie mellon parallel splash belief propagation joseph e. gonzalez yucheng low carlos guestrin...
TRANSCRIPT
![Page 1: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/1.jpg)
Carnegie Mellon
Parallel Splash Belief Propagation
Joseph E. GonzalezYucheng Low
Carlos GuestrinDavid O’Hallaron
Computers which worked on this project:
BigBro1, BigBro2, BigBro3, BigBro4, BigBro5, BigBro6, BiggerBro, BigBroFSTashish01, Tashi02, Tashi03, Tashi04, Tashi05, Tashi06, …, Tashi30,
parallel, gs6167, koobcam (helped with writing)
![Page 2: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/2.jpg)
Why talk about parallelism now?
Change in the Foundation of ML
Past Sequentia
l Perfo
rmance
Future SequentialPerformance
Log
(Speed
in G
Hz)
2
Future Parallel PerformanceFuture Parallel Performance
Release Date
![Page 3: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/3.jpg)
3
Why is this a Problem?
Sophistication
Para
llelis
m
Nearest Neighbor[Google et al.]
Basic Regression[Cheng et al.]
Graphical Models[Mendiburu et al.]
Support Vector Machines[Graf et al.]
Want to behere
![Page 4: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/4.jpg)
Why is it hard?
4
Algorithmic Efficiency
Parallel Efficiency
ImplementationEfficiency
Eliminate wastedcomputation
Expose independentcomputation
Map computation toreal hardware
![Page 5: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/5.jpg)
The Key Insight
5
![Page 6: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/6.jpg)
6
The Result
Nearest Neighbor[Google et al.]
Basic Regression[Cheng et al.]
Support Vector Machines[Graf et al.]
GoalSplash Belief Propagation
Graphical Models[Gonzalez et al.]
Sophistication
Para
llelis
m
Graphical Models[Mendiburu et al.]
![Page 7: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/7.jpg)
OutlineOverviewGraphical Models: Statistical StructureInference: Computational Structureτε - Approximate Messages: Statistical StructureParallel Splash
Dynamic SchedulingPartitioning
Experimental ResultsConclusions
7
![Page 8: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/8.jpg)
Graphical Models and Parallelism
Graphical models provide a common language for general purpose parallel algorithms in machine learning
A parallel inference algorithm would improve:
8
Protein Structure Prediction
Inference is a key step in Learning Graphical Models
Computer VisionMovie Recommendation
![Page 9: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/9.jpg)
Overview of Graphical Models
Graphical represent of local statistical dependencies
9
Observed Random Variables
Late
nt
Pix
el V
ari
able
sLa
tent
Pix
el V
ari
able
s Loca
l Dependencie
sLo
cal D
ependencie
s
Noisy Picture
InferenceInferenceWhat is the probability that this pixel is black?
What is the probability that this pixel is black?
“True” Pixel Values
Continuity Assumptions
![Page 10: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/10.jpg)
Synthetic Noisy Image Problem
Overlapping Gaussian noiseAssess convergence and accuracy
Noisy Image Predicted Image
![Page 11: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/11.jpg)
Protein Side-Chain Prediction
Model side-chain interactions as a graphical model
11
Protein Backbone
What is the most likely orientation?What is the most likely orientation?
InferenceInference
![Page 12: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/12.jpg)
Protein Side-Chain Prediction
276 Protein Networks:Approximately:
700 Variables1600 Factors70 Discrete orientations
Strong Factors
12
Protein B
ackbon
e
![Page 13: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/13.jpg)
Smokes(A) Cancer(A)Smokes(A) Cancer(A) Smokes(B) Cancer(B)Smokes(B) Cancer(B)
Friends(A,B) And Smokes(A) Smokes(B)
Friends(A,B) And Smokes(A) Smokes(B)
Markov Logic NetworksRepresent Logic as a graphical model
13
Cancer(A)Cancer(A) Cancer(B)Cancer(B)
Smokes(A)Smokes(A) Smokes(B)Smokes(B)
Friends(A,B)Friends(A,B)A: AliceB: Bob
True/False?
Pr(Cancer(B) = True | Smokes(A) = True & Friends(A,B) = True) = ?
Pr(Cancer(B) = True | Smokes(A) = True & Friends(A,B) = True) = ?
InferenceInference
![Page 14: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/14.jpg)
Markov Logic Networks
14
Smokes(A) Cancer(A)Smokes(A) Cancer(A) Smokes(B) Cancer(B)Smokes(B) Cancer(B)
Friends(A,B) And Smokes(A) Smokes(B)
Friends(A,B) And Smokes(A) Smokes(B)
Cancer(A)Cancer(A)Cancer(B)Cancer(B)
Smokes(A)Smokes(A) Smokes(B)Smokes(B)
Friends(A,B)Friends(A,B)A: AliceB: Bob
True/False?UW-Systems Model8K Binary Variables406K Factors
Irregular degree distribution:
Some vertices with high degree
![Page 15: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/15.jpg)
OutlineOverviewGraphical Models: Statistical StructureInference: Computational Structureτε - Approximate Messages: Statistical StructureParallel Splash
Dynamic SchedulingPartitioning
Experimental ResultsConclusions
15
![Page 16: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/16.jpg)
The Inference Problem
NP-Hard in General Approximate Inference:
Belief Propagation
16
Smokes(A) Cancer(A)Smokes(A) Cancer(A) Smokes(B) Cancer(B)Smokes(B) Cancer(B)
Friends(A,B) And Smokes(A) Smokes(B)
Friends(A,B) And Smokes(A) Smokes(B)
Cancer(A)Cancer(A) Cancer(B)Cancer(B)
Smokes(A)Smokes(A) Smokes(B)Smokes(B)
Friends(A,B)Friends(A,B)A: AliceB: Bob
True/False?
Protein BackboneWhat is the probability that Bob Smokes given
Alice Smokes?
What is the best configuration of the protein side-chains?
What is the probabilitythat each pixel is black?
![Page 17: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/17.jpg)
Belief Propagation (BP)Iterative message passing algorithm
Naturally Parallel Algorithm
17
![Page 18: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/18.jpg)
Parallel Synchronous BPGiven the old messages all new messages can be computed in parallel:
18
NewMessages
NewMessages
OldMessages
OldMessages
CPU 2
CPU 1
CPU 3
CPU n
Map-Reduce Ready!
![Page 19: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/19.jpg)
Sequential Computational Structure
19
![Page 20: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/20.jpg)
Hidden Sequential Structure
20
![Page 21: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/21.jpg)
Hidden Sequential Structure
Running Time:
21
EvidenceEvidence
Time for a singleparallel iterationTime for a singleparallel iteration Number of IterationsNumber of Iterations
![Page 22: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/22.jpg)
Optimal Sequential Algorithm
Forward-Backward
Naturally Parallel
2n2/p
p ≤ 2n
22
RunningTime
2n
Gap
p = 1
Optimal Parallel
n
p = 2
![Page 23: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/23.jpg)
Key Computational Structure
Naturally Parallel
2n2/p
p ≤ 2n
23
RunningTime
Optimal Parallel
n
p = 2
Gap Inherent Sequential Structure
Requires Efficient Scheduling
![Page 24: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/24.jpg)
OutlineOverviewGraphical Models: Statistical StructureInference: Computational Structureτε - Approximate Messages: Statistical StructureParallel Splash
Dynamic SchedulingPartitioning
Experimental ResultsConclusions
24
![Page 25: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/25.jpg)
Parallelism by Approximation
τε represents the minimal sequential structure
25
True Messages
τε -Approximation
11 22 33 44 55 66 77 88 99 1010
1
![Page 26: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/26.jpg)
Tau-Epsilon StructureOften τε decreases quickly:
26
Markov LogicNetworks
Protein Networks
Mess
ag
e A
pp
roxim
ati
on E
rror
in L
og S
cale
![Page 27: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/27.jpg)
Running Time Lower Bound
27
Theorem: Using p processors it is not possible to obtain a τε approximation in time less than:
Theorem: Using p processors it is not possible to obtain a τε approximation in time less than:
ParallelComponent
SequentialComponent
![Page 28: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/28.jpg)
A single processor can only make k-τε +1 vertices left aware in k-iterationsA single processor can only make k-τε +1 vertices left aware in k-iterations
Consider one direction using p/2 processors (p≥2):
28
11 nn
τε τε τε τε τε τε τε
τε
…
n - τε
We must make n - τε vertices τε left-awareWe must make n - τε vertices τε left-aware
Proof: Running Time Lower Bound
![Page 29: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/29.jpg)
Optimal Parallel SchedulingProcessor 1 Processor 2 Processor 3
29
Theorem: Using p processors this algorithm achieves a τε approximation in time:
Theorem: Using p processors this algorithm achieves a τε approximation in time:
![Page 30: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/30.jpg)
Proof: Optimal Parallel Scheduling
All vertices are left-aware of the left most vertex on their processor
After exchanging messages
After next iteration:
After k parallel iterations each vertex is (k-1)(n/p) left-aware
30
![Page 31: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/31.jpg)
Proof: Optimal Parallel Scheduling
After k parallel iterations each vertex is (k-1)(n/p) left-awareSince all vertices must be made τε left aware:
Each iteration takes O(n/p) time:
31
![Page 32: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/32.jpg)
Comparing with SynchronousBP
Processor 1 Processor 2 Processor 3
32
Synchronous Schedule Optimal Schedule
Gap
![Page 33: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/33.jpg)
OutlineOverviewGraphical Models: Statistical StructureInference: Computational Structureτε - Approximate Messages: Statistical StructureParallel Splash
Dynamic SchedulingPartitioning
Experimental ResultsConclusions
33
![Page 34: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/34.jpg)
The Splash OperationGeneralize the optimal chain algorithm:
to arbitrary cyclic graphs:
~34
1) Grow a BFS Spanning tree with fixed size
2) Forward Pass computing all messages at each vertex
3) Backward Pass computing all messages at each vertex
![Page 35: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/35.jpg)
Local StateLocal State
CPU 2
Local StateLocal State
CPU 3
Local StateLocal State
CPU 1
Running Parallel Splashes
Partition the graphSchedule Splashes locallyTransmit the messages along the boundary of the partition
35
Splash SplashSplash
Key Challenges:1) How do we schedules Splashes?2) How do we partition the Graph?
![Page 36: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/36.jpg)
Local StateLocal State
Sch
ed
ulin
g
Queu
e
Where do we Splash?Assign priorities and use a scheduling queue to select roots:
SplashSplash
SplashSplash
??
?
CPU 1
How do we assign priorities?
![Page 37: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/37.jpg)
Message SchedulingResidual Belief Propagation [Elidan et al., UAI 06]:
Assign priorities based on change in inbound messages
11
37
Message
Message
Message
22
Message
Message
Message
Large ChangeLarge ChangeSmall ChangeSmall Change
Small ChangeSmall Change
Large ChangeLarge Change
Small Change:Expensive No-Op
Large Change:Informative Update
![Page 38: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/38.jpg)
Problem with Message Scheduling
Small changes in messages do not imply small changes in belief:
38
Small change inall message
Small change inall message
Large change inbelief
Large change inbelief
Message
Message
BeliefMessage
Message
![Page 39: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/39.jpg)
Problem with Message Scheduling
Large changes in a single message do not imply large changes in belief:
39
Large change ina single messageLarge change in
a single messageSmall change
in beliefSmall change
in belief
Message
BeliefMessageMessage
Message
![Page 40: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/40.jpg)
Belief Residual Scheduling
Assign priorities based on the cumulative change in belief:
1 1
+1
+rv =
MessageChange
MessageChange
40
A vertex whose belief has changed substantially
since last being updatedwill likely produce
informative new messages.
A vertex whose belief has changed substantially
since last being updatedwill likely produce
informative new messages.
![Page 41: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/41.jpg)
Message vs. Belief Scheduling
Belief Scheduling improves accuracy and convergence
41
Bett
er
![Page 42: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/42.jpg)
Splash PruningBelief residuals can be used to dynamically reshape and resize Splashes:
LowBeliefs
Residual
![Page 43: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/43.jpg)
Splash SizeUsing Splash Pruning our algorithm is able to dynamically select the optimal splash size
43
Bett
er
![Page 44: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/44.jpg)
Example
Synthetic Noisy Image
Factor Graph
Vertex Updates
ManyUpdate
s
FewUpdate
s
Algorithm identifies and focuses on hidden sequential structure
44
![Page 45: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/45.jpg)
Parallel Splash Algorithm
Partition factor graph over processorsSchedule Splashes locally using belief residualsTransmit messages on boundary
Local StateLocal State
CPU 1
SplashSplash
Local StateLocal State
CPU 2
Local StateLocal State
CPU 3
SplashSplash
Fast Reliable Network
SplashSplash
45
SchedulingQueue
SchedulingQueue
SchedulingQueue
Given a uniform partitioning of the chain graphical model, Parallel Splash will run in time:
retaining optimality.
Theorem:
![Page 46: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/46.jpg)
CPU 1 CPU 2
Partitioning ObjectiveThe partitioning of the factor graph determines:
Storage, Computation, and Communication
Goal: Balance Computation and Minimize Communication
46
EnsureBalanceEnsureBalanceComm.
costComm.
cost
![Page 47: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/47.jpg)
The Partitioning ProblemObjective:
Depends on:
NP-Hard METIS fast partitioning heuristic
Work:
Comm:
47
Minimize Communication
Minimize Communication
Ensure BalanceEnsure Balance
Update counts are not known!
![Page 48: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/48.jpg)
Unknown Update Counts Determined by belief schedulingDepends on: graph structure, factors, …Little correlation between past & future update counts
48
Noisy Image Update CountsSimple Solution:
Uninformed Cut
![Page 49: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/49.jpg)
Uniformed Cuts
Greater imbalance & lower communication cost
Update Counts Uninformed Cut Optimal Cut
49
Bett
er
Bett
er
Too Much Work
Too Little Work
![Page 50: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/50.jpg)
Over-PartitioningOver-cut graph into k*p partitions and randomly assign CPUs
Increase balanceIncrease communication cost (More Boundary)
CPU 1CPU 1 CPU 2CPU 2 CPU 2CPU 2
CPU 1CPU 1 CPU 1CPU 1 CPU 2CPU 2
CPU 1CPU 1 CPU 2CPU 2 CPU 1CPU 1
CPU 2CPU 2 CPU 1CPU 1 CPU 2CPU 2
Without Over-Partitioning
k=650
![Page 51: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/51.jpg)
Over-Partitioning ResultsProvides a simple method to trade between work balance and communication cost
51
Bett
er
Bett
er
![Page 52: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/52.jpg)
CPU UtilizationOver-partitioning improves CPU utilization:
52
![Page 53: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/53.jpg)
Parallel Splash Algorithm
Over-Partition factor graph Randomly assign pieces to processors
Schedule Splashes locally using belief residualsTransmit messages on boundary
Local StateLocal State
CPU 1
SplashSplash
Local StateLocal State
CPU 2
Local StateLocal State
CPU 3
SplashSplash
Fast Reliable Network
SplashSplash
53
SchedulingQueue
SchedulingQueue
SchedulingQueue
![Page 54: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/54.jpg)
OutlineOverviewGraphical Models: Statistical StructureInference: Computational Structureτε - Approximate Messages: Statistical StructureParallel Splash
Dynamic SchedulingPartitioning
Experimental ResultsConclusions
54
![Page 55: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/55.jpg)
ExperimentsImplemented in C++ using MPICH2 as a message passing API
Ran on Intel OpenCirrus cluster: 120 processors
15 Nodes with 2 x Quad Core Intel Xeon ProcessorsGigabit Ethernet Switch
Tested on Markov Logic Networks obtained from Alchemy [Domingos et al. SSPR 08]
Present results on largest UW-Systems and smallest UW-Languages MLNs
55
![Page 56: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/56.jpg)
Parallel Performance (Large Graph)
56
UW-Systems8K Variables406K Factors
Single Processor Running Time:
1 Hour
Linear to Super-Linear up to 120 CPUs
Cache efficiency
Linear
Bett
er
![Page 57: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/57.jpg)
Parallel Performance (Small Graph)
UW-Languages1K Variables27K Factors
Single Processor Running Time:
1.5 Minutes
Linear to Super-Linear up to 30 CPUs
Network costs quickly dominate short running-time
57
Linear
Bett
er
![Page 58: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/58.jpg)
OutlineOverviewGraphical Models: Statistical StructureInference: Computational Structureτε - Approximate Messages: Statistical StructureParallel Splash
Dynamic SchedulingPartitioning
Experimental ResultsConclusions
58
![Page 59: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/59.jpg)
Summary
59
Algorithmic Efficiency
Parallel Efficiency
ImplementationEfficiency
Independent Parallel Splashes
Splash Structure +
Belief Residual Scheduling
Distributed QueuesAsynchronous Communication
Over-Partitioning
Experimental results on large factor graphs:
Linear to super-linear speed-up using up to 120 processors
![Page 60: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/60.jpg)
Sophistication
Para
llelis
m
Parallel Splash Belief Propagation
We are here
We were here
Conclusion
![Page 61: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/61.jpg)
Questions
61
![Page 62: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/62.jpg)
Protein Results
62
1 2 3 4 5 6 7 80
1
2
3
4
5
6
7
8
9
10
Number of Cores
Rel
ativ
e S
peed
up
Residual BP
Linear Speedup
ResidualSplash BP
MapReduce BP
![Page 63: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/63.jpg)
3D Video Task
63
1 2 3 4 5 6 7 80
1
2
3
4
5
6
7
8
Number of Cores
Rel
ativ
e S
peed
up
Residual BP
ResidualSplash BP
Linear Speedup
MapReduce BP
![Page 64: Carnegie Mellon Parallel Splash Belief Propagation Joseph E. Gonzalez Yucheng Low Carlos Guestrin David O’Hallaron TexPoint fonts used in EMF. Read the](https://reader035.vdocument.in/reader035/viewer/2022062804/5697bf911a28abf838c8e907/html5/thumbnails/64.jpg)
Distributed Parallel Setting
Opportunities:Access to larger systems: 8 CPUs 1000 CPUsLinear Increase:
RAM, Cache Capacity, and Memory Bandwidth
Challenges:Distributed state, Communication and Load Balancing
64
Fast Reliable Network
NodeNode
CPU BusMemoryCache
NodeNode
CPU BusMemoryCache