latent feature models for network data over time jimmy foulds advisor: padhraic smyth

26
Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth (Thanks also to Arthur Asuncion and Chris Dubois)

Upload: oprah

Post on 05-Jan-2016

41 views

Category:

Documents


0 download

DESCRIPTION

Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth (Thanks also to Arthur Asuncion and Chris Dubois). Overview. The task Prior work – Miller, Van Gael, Indian Buffet Processes The DRIFT model Inference Preliminary results Future work. The Task. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Latent Feature Models for Network Data over Time

Jimmy FouldsAdvisor: Padhraic Smyth

(Thanks also to Arthur Asuncion and Chris Dubois)

Page 2: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Overview

The task Prior work – Miller, Van Gael, Indian Buffet

Processes The DRIFT model Inference Preliminary results Future work

Page 3: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

The Task

Modeling Dynamic (time-varying) Social Networks

Interested in prediction Model interpretation for sociological

understanding Continuous time relational events versus

panel data?

Page 4: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Applications

Predicting Email Communications

Page 5: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth
Page 6: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Applications

Predicting Paper Co-authorship NIPS data

Page 7: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth
Page 8: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Prior Work

Erdos-Renyi Models are “pseudo-dynamic” Continuous Markov Process Models (Snijders

2006) The network stochastically optimizes ERGM

likelihood function Dynamic Latent Space Model (Sarkar & Moore,

2005) Each node (actor) is associated with a point in a low

dimensional space (Raftery et al. 2002). Link probability is a function of distance between points

Gaussian jumps in latent space in each timestep

Page 9: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Prior Work

Nonparametric Latent Feature Relational Model (Miller et al. 2009)

Each actor is associated with an unbounded sparse vector of binary latent features, generated from an Indian Buffet Process prior

The probability of a link between two actors is a function of the latent features of those actors (and additional covariates)

Page 10: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Prior Work

Nonparametric Latent Feature Relational Model (Miller et al. 2009) generative process:

Z ~ IBP W

kk' ~ N(0,

w)

Yij ~ (Z

iWZ

j+ covariate terms)

A kind of blockmodel with overlapping classes

Page 11: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

(The next few slides are from Zoubin Ghahramani's NIPS 2009 Workshop talk)

Page 12: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth
Page 13: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth
Page 14: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth
Page 15: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

How to Make this Model Dynamic For Longitudinal Data?

We would like the Zs to change over time, modeling changing interests, community memberships, …

Want to maintain sparsity property, but model persistence, generation of new features, ...

Page 16: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Infinite Factorial Hidden Markov Models (Van Gael et al., 2010)

A variant of the IBP A probability distribution over a potentially

infinite number of binary Markov chains Sparsity: At each timestep, introduce new

features using the IBP distribution Persistence: A coin flip determines whether

each feature persists to the next timestep Hidden Markov structure: the latent features

are hidden but we observe something at each timestep.

Page 17: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

DRIFT: the Dynamic Relational Infinite FeaTure Model

The iFHMM models the evolution of one actor's features over time

We use an iFHMM for each actor, but share the transition probabilities

Observed graphs generated via (Miller et al. 2009)'s latent feature model Y

ij ~ (Z

iWZ

j+...)

Page 18: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

DRIFT: the Dynamic Relational Infinite FeaTure Model

Page 19: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Inference

Markov chain Monte Carlo inference Use “slice sampling” trick with the stick-

breaking construction of the IBP to effectively truncate num features but still perform exact inference

Blocked Gibbs sampling on the other variables Forward-backward dynamic programming on

each actor's feature chain Metropolis-Hastings updates for W's since non-

conjugate

Page 20: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Group DRIFT

Clustering to reduce the number of chains Each actor has hidden class variable c < C <

N The chains of infinite binary feature vectors

are associated with classes rather than actors Allows us to scale up to large numbers of

actors Clustering may be interpretable

Page 21: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

i=1:C

Cn

n=1:N

β=1/C

Group DRIFT

Inference for a, b, exactly the same

Inference for z’s similar:• Slightly different “emission” probability• Run forward-backward sampler on M*C chains rather than M*N chains

Inference for c’s (actor’s assignment to specific chain) is easy too

Inference for W is similar (slightly different likelihood). Note we must now assume that the diagonal of W can be non-zero.

Page 22: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Preliminary Experimental Results (Synthetic Data)

Page 23: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Preliminary Experimental Results (Synthetic Data)

Page 24: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Preliminary Experimental Results (Synthetic Data)

Page 25: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Future work

Extension to Continuous Time It's easy to use IBP latent factor model as a

covariate in Relational Event Model (Butts 2008)

How to model the Zs changing over time for continuous data?

Page 26: Latent Feature Models for Network Data over Time Jimmy Foulds Advisor: Padhraic Smyth

Thanks for Listening!