construction of dependent dirichlet processes based on ... · construction of dependent dirichlet...

35
Construction of Dependent Dirichlet Processes based on Poisson Processes Dahua Lin Eric Grimson John Fisher CSAIL MIT NIPS 2010 Outstanding Student Paper Award Presented by Shouyuan Chen 1 / 31

Upload: others

Post on 04-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Construction of Dependent DirichletProcesses based on Poisson Processes

Dahua Lin Eric Grimson John Fisher

CSAILMIT

NIPS 2010 Outstanding Student Paper AwardPresented by Shouyuan Chen

1 / 31

Page 2: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Outline

MotivationsDynamic Mixture ModelsDependent Dirichlet Process

Model ConstructionKey ideaThree operationsDiscussions

Inference and ExperimentsInferenceExperiments

Conclusions

2 / 31

Page 3: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Outline

MotivationsDynamic Mixture ModelsDependent Dirichlet Process

Model ConstructionKey ideaThree operationsDiscussions

Inference and ExperimentsInferenceExperiments

Conclusions

3 / 31

Page 4: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Mixture Models: From Static to Dynamic

▶ Evolutionary clustering▶ add/remove clusters▶ movement of clusters

▶ Document modeling▶ add/remove topics▶ evolution of topics

▶ Other applications▶ image modeling▶ location base services▶ financial analysis

4 / 31

Page 5: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Dynamic Mixture Models

Model the behavior of latent components overtime▶ Creation of new components.▶ Removal of existing components.▶ Variation of component parameters.

Components can be▶ Clusters→ Dynamic Gaussian Mixture Model▶ Topics→ Dynamic Topic Model

5 / 31

Page 6: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Outline

MotivationsDynamic Mixture ModelsDependent Dirichlet Process

Model ConstructionKey ideaThree operationsDiscussions

Inference and ExperimentsInferenceExperiments

Conclusions

6 / 31

Page 7: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Dirichlet Process

Dirichlet process (DP) ≈ infinite limit of Dirichlet distribution.▶ Finite mixture models.

▶ Prior: Dir(�⃗): k -dimensional Dirichlet distribution▶ Pre-specified number of components k .

▶ Dirichlet process mixture models (DPMM).▶ Prior: DP(�,H): "infinite dimensional Dirichlet distribution"▶ Learn hidden k automatically.

7 / 31

Page 8: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Extending DP to Dependent DPs

A Single DP

D

X

A Markov chain of t DPs.

D1 D2 Dt

X (1) X (2) X (t)

⋅ ⋅ ⋅

Key problem

How to design the Markov chain to support 3 key dependenciesbetween Dt−1 → Dt :

Creation Add a new componentRemoval Remove an existing component

Transition Varying component parameters

8 / 31

Page 9: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Outline

MotivationsDynamic Mixture ModelsDependent Dirichlet Process

Model ConstructionKey ideaThree operationsDiscussions

Inference and ExperimentsInferenceExperiments

Conclusions

9 / 31

Page 10: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Several ways to Dirichlet Process

Equivalent constructions for DP

Random measure Basic definitionPosterior Chinese restaurant process

Atomic construction Stick breaking process

Construct DP(�) by ΓP and PP

▶ Generate compound poisson process PP(�× )

▶ Gamma process ΓP(�) is transformed from compoundpoisson process

▶ Dirichlet process DP(�) is normalized Gamma process

10 / 31

Page 11: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Poisson, Gamma and Dirichlet Process

Given a measurable space (Ω,Σ, �)

▶ Compound Poisson Process

Π∗ ∼ PP(�× ), (dw) = w−1e−wdw

Π is a point process (collection of infinite random points) onproduct space �×

Π =∞∑

i=0

�(�,!�)

▶ Gamma Process: Transformed from compound poissonprocess

G ≜∑

(�,!�)∈Π

!��� ∼ ΓP(�)

▶ Dirichlet Process: Normalized gamma process

D ≜ G/G(�) ∼ DP(�)

11 / 31

Page 12: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Key Idea for Transforming DPsDP

PP

New DP

New PPOperations?

Complete randomness

A random measure of which the measure values of disjointsubsets are independent.

Complete Randomness Preserving Operations

Applying any operations that preserve complete randomness toPoisson processes results in a new Poisson process.

▶ Superposition two PP▶ Subsampling a PP▶ Mapping a PP point by point

12 / 31

Page 13: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Constructing a Chain of DPs

Dt−1

Πt−1

Removeexisting

components

Vary re-maining

componentsAdd new

components

Subsampling Transition Superposition

Dt

Πt

Ht

new components

13 / 31

Page 14: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Outline

MotivationsDynamic Mixture ModelsDependent Dirichlet Process

Model ConstructionKey ideaThree operationsDiscussions

Inference and ExperimentsInferenceExperiments

Conclusions

14 / 31

Page 15: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Subsampling

Subsampling via Independent Bernoulli Trail

∀� = (�,p�), z� ∼ Bernoulli(q), D =∑

� p��� ∼ DP(�)

Sq(D) ≜1∑

z�=1 p�

∑z�=1

p���

Theorem (Subsampling)

Sq(D) ∼ DP(q�)

Proof sketch:▶ DP→ PP: D → Π ∼ PP(� ).▶ Subsampling PP: Sq(Π) = {� ∈ Π : z� = 1} ∼ PP(q� ).▶ PP→ DP: Sq(Π)→ Sq(D) ∼ DP(q�)

15 / 31

Page 16: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Transition

Independent movement of each point

T (⋅, ⋅): probabilistic transition kernel D =∑

� p��� ∼ DP(�)

T (D) ≜∑

p��T (�)

Theorem (Transition)

T (D) ∼ DP(T�)

Proof sketch:▶ DP→ PP: D → Π ∼ PP(�× ).▶ Mapping PP:

T (Π) = {(T (�), !�) : (�, !�) ∈ Π} ∼ PP(T�× ).▶ PP→ DP: T (Π)→ T (D) ∼ DP(T�)

16 / 31

Page 17: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Superposition

Sum of independent DPs

Dk ∼ DP(�k ), k = 1, . . . ,m be independent,(c1, . . . , cm) ∼ Dir(�1(Ω), . . . , �m(Ω))

Theorem (Superposition)

∑k

ckDk ∼ DP(�1 + . . .+ �m)

Proof sketch:▶ DP→ PP: Dk → Πk ∼ PP(�k × ).▶ Mapping PP:

∑k gk Πk ∼ PP(

∑k gk�k × ).

▶ PP→ DP: 1∑k gk

∑gkDk =

∑ckDk ∼ DP(

∑k �k )

17 / 31

Page 18: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Outline

MotivationsDynamic Mixture ModelsDependent Dirichlet Process

Model ConstructionKey ideaThree operationsDiscussions

Inference and ExperimentsInferenceExperiments

Conclusions

18 / 31

Page 19: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Are All Poisson Things Necessary?

Basic definition of DPD ∼ DP(�) is a DP if for any partition A1, . . . ,An of space Ω

(D(A1), . . . ,D(An)) ∼ Dir(�(A1), . . . , �(An))

Alternate proof of superposition theorem

Let D =∑

k ckDk , consider any partition A1, . . . ,An of space Ω,

(D(A1), . . . ,D(An)) =

(∑k

ckDk (A1), . . . ,∑

k

ckDk (An)

)∼ Dir(

∑k

�k (A1), . . . ,∑

k

�k (An))

The second step is from the property of Dirichlet distribution,and it concludes that D ∼ DP(

∑k �k ).

19 / 31

Page 20: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Proof by basic definition

▶ By defining DP on an extended space over functions, wecan directly model all three operations: subsampling,transition and superposition without appealing to Poissonprocess.

▶ Such construction also allows DDP to be constructed overany measurable space. This paper is exactly a specialcase if the space is fixed to be a discrete Markov chain.

20 / 31

Page 21: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Outline

MotivationsDynamic Mixture ModelsDependent Dirichlet Process

Model ConstructionKey ideaThree operationsDiscussions

Inference and ExperimentsInferenceExperiments

Conclusions

21 / 31

Page 22: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Inference Scheme

▶ Gibbs sampling. Sample one latent variable from posteriorat each step. Consider time 1, . . . , t sequentially.

▶ Update labels. Samples survived components (withprobability q) and component assignments

▶ Update parameters. Samples component parameter fromT (.)

▶ Iterates between step 2 and 3. Then move on to next timet + 1, and never estimate earlier distributions.

Sequential sampling

This paper doesn’t derive a batch sampling algorithm. Earliersamples would likely be less accurate.

22 / 31

Page 23: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Posterior computation

For simplicity of notation, and without loss of generality, assumethe expectation of new components equals with removedcomponents.

▶ Given a set of samples Φ ∼ Dt : �i appears ci times▶ (By DP posterior) Dt ∣Φ ∼ DP(�+

∑k ck��k )

▶ (This paper) Dt+1∣Φ ∼ DP(�+∑

k qck�T (�k ))

Is that true?

23 / 31

Page 24: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Argument against it

Let Dt =∑

k pk��k , Dt+1 = 1∑zk =1 pk

∑zk =1 pk��k . � ∼ Dt is one

sample from Dt .Fact: Dt+1∣� is not DP.

Mixture of DP is not DPConsider z� ∼ Bernoulli(q). There are two different cases forDt+1∣�:

▶ z� = 1. Thus � is not removed. Thus � is equivalentlyobserved in Dt+1. Dt+1∣�, z� = 1 ∼ DP(�+ ��)

▶ z� = 0. In this case � is removed. Dt+1∣�, z� = 0 ∼ DP(�)

Hence Dt+1∣� is a mixture of DPs:

Dt+1∣� = qDP(�+ ��) + (1− q)DP(�)

It is proved NOT a DP. [1]

24 / 31

Page 25: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

A deeper argument

▶ The observation is censored.▶ Only knows � is not removed at now.▶ The complete lifespan of a component � is not observed.▶ Posterior of DP under censored observations is a mixture

of DP.

25 / 31

Page 26: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Outline

MotivationsDynamic Mixture ModelsDependent Dirichlet Process

Model ConstructionKey ideaThree operationsDiscussions

Inference and ExperimentsInferenceExperiments

Conclusions

26 / 31

Page 27: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Synset dataset

Setup

▶ Simulated over 80 phases.▶ Gaussian mixture models with 2 components initially.▶ The speed of introducing new components (one new

component per 20 phases in average) and removingexisting components is equal.

▶ Mean of component has a Brownian motion.▶ 1000 samples per components at each phase.

BaselinesFinite mixture models with K = 3,5,10. DPM is not comparedwith.

27 / 31

Page 28: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Results

28 / 31

Page 29: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Real World Applications

Evolutionary Topic Model

▶ Model topic evolution of research paper▶ Data: all NIPS papers over years▶ Method: feature extraction to generate 12 dimensions

feature per document. Then use Gaussian mixture model.

People Flow

▶ The motion of people in New York Grand Central station.▶ Data: 90,000 frames in one hour, divided into 60 phases.▶ Try to group people tracks into flows depending on their

motion patterns

29 / 31

Page 30: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Summary

▶ Propose a principled methodology to construct dependentDirichlet processes based on the theoretical connectionsbetween Poisson, Gamma and Dirichlet processes.

▶ Develop a framework of evolving mixture model, whichallows creation and removal of mixture components, aswell as variation of parameters.

▶ Derive a Gibbs sampling algorithm for inferring mixturemodel parameters from observations.

▶ Test the approach on both synthetic data and realapplications.

30 / 31

Page 31: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

My Summary

▶ Poisson process is not essential for constructing DDP.▶ Sequential sampling may damage the performance.▶ Posterior of this model should be MDP rather than DP.

31 / 31

Page 32: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

For Further Reading I

C. E. AntoniakMixtures of Dirichlet processes with applications toBayesian nonparametric problemsAnnals of Statistics, 2(6):1152-1174, 1974.

32 / 31

Page 33: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

Dirichlet ProcessStick-breaking representation

A sample from DP is almost surely discrete.

Stick-breaking representation

Let D ∼ DP(�,H) is a Dirichlet process. Then, almost surely

D =∞∑

i=1

pi��i

p1...∞ ∼ GEM(�)

∀i , �i ∼ H, i.i.d

The GEM distribution is called "stick-breaking" distribution.

33 / 31

Page 34: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

My Experiments

Setup

▶ Simulated over 30 phases.▶ Gaussian mixture models with 2 components initially.▶ The speed of introducing new components (0.4 new

component per phase in average) and removing existingcomponents is equal.

▶ Mean of component has a Brownian motion.▶ 200 samples per components at each phase.▶ Bias in posterior is fixed

BaselinesDPM, Sequential sampling (Markov-DPM), Batch algorithm(F-DPM)

34 / 31

Page 35: Construction of Dependent Dirichlet Processes based on ... · Construction of Dependent Dirichlet Processes based on Poisson Processes Author: Dahua Lin, Eric Grimson, John Fisher

My Experiment Results

0 5 10 15 20 25 30time

0.5

0.0

0.5

1.0

1.5

2.0

2.5vari

ati

on o

f in

form

ati

on F-DPM

Markov-DPM

DPM

0 5 10 15 20 25 30time

0

5

10

15

20

25

30

num

ber

of

clust

ers

F-DPM

Markov-DPM

DPM

Ground Truth

35 / 31