feature driven and point process approaches for...

30
Feature Driven and Point Process Approaches for Popularity Prediction Swapnil Mishra, Marian-Andrei Rizoiu, Lexing Xie The Australian National University and Data61, Australia 1

Upload: others

Post on 29-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Feature Driven and Point Process Approaches for Popularity Prediction

Swapnil Mishra, Marian-Andrei Rizoiu, Lexing XieThe Australian National University and Data61, Australia

1

Page 3: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Which cascade will gather more attention?

A. tw1B. tw2C. same(∓5%)

3

http://www.nytimes.com/2015/02/27/...

Leonard Nimoy, Spock of ‘Star Trek,’ Dies at 83

Page 4: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Which cascade will gather more attention?

A. tw1B. tw2C. same(∓5%)

4

http://www.nytimes.com/2015/02/27/...

Leonard Nimoy, Spock of ‘Star Trek,’ Dies at 83

Page 5: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Which cascade will gather more attention?

A. tw1B. tw2C. same(∓5%)

5

Prediction is difficult!

How to use event times and user features?

http://www.nytimes.com/2015/02/27/...

Leonard Nimoy, Spock of ‘Star Trek,’ Dies at 83

Page 6: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Problem: predict the number of retweets on Twitter

Why do we care about it??

● Monetisation ● Personalization

Our Contributions

● Bridge gap:○ Approaches: feature-driven vs generative○ Problem setting: regression vs classification

● One new benchmark dataset:○ Features○ Event times

Popularity

6

Page 7: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Outline

● Popularity prediction problem

● Self-exciting point process

● Feature-driven approaches

● Results

7

Page 8: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Existing SolutionsApproach:

Problem Setting:

Open Questions:

● Can we have best of both generative and feature-driven models?

● How useful are features computed over data available through Public APIs?

8

Feature Driven: Cheng(2014), Martin(2016), Pinto(2013) Generative: Zhao(2015), Ding(2015), Shen(2014)

Regression: Zhao(2015), Shen(2014) Classification: Cheng(2014), Shamma(2011)

Page 9: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Self-Exciting Point Processes

9

Page 10: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Homogeneous Poisson Process

Self-Exciting Point Processes

10

Page 11: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Homogeneous Poisson Process

Self-Exciting Point Processes

11

Marked Hawkes Process

Page 12: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Kernel for Marked Hawkes

12

content virality

user influence

memory

Page 13: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Estimating Marked Hawkes Proc.

To have < 1, we use IPOPT(Wächter2006)

13

Page 14: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Estimating Marked Hawkes Proc.

To have < 1, we use IPOPT(Wächter2006)

14

Page 15: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

From Event Series to Predictions

15

T

Page 16: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

From Event Series to Predictions

16

T

Page 17: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Further Improving PredictionLimitations for predictions:

● #followers approximates influence● Assumes fixed parametric kernel● Local minima in parameter

estimation

17

Page 18: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Further Improving PredictionLimitations for predictions:

● #followers approximates influence● Assumes fixed parametric kernel● Local minima in parameter

estimation

18

Predictive Layer:

Page 19: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Outline

● Popularity prediction problem

● Self-exciting point process

● Feature-driven approaches

● Results

19

Page 20: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

FeaturesCriterion for selection:

● can be computed on data from Public API’s● shown to perform well

20

User Features(Cheng2014, Martin 2016)

Temporal Features(Cheng2014)

Early volume(Szabo2010, Pinto2013)

Past Success(Bakshy2011, Martin2016)

Page 21: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

DatasetsTweet-1Mo

● Provided by Zhao et.al(SEISMIC)

● One month tweets● Random Sample of 30.5K cascades > 50

NEWS

● April’15 to July’15 english news● 49.7 million tweets ● 110K cascades > 50

21

Page 22: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

DatasetsTweet-1Mo

● Provided by Zhao et.al(SEISMIC)

● One month tweets● Random Sample of 30.5K cascades > 50

NEWS

● April’15 to July’15 english news● 49.7 million tweets ● 110K cascades > 50

22

Feature-Driven

Generative

NEWS Available Now

Page 23: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Evaluation Setup

● 10-fold cross-validation for training

● Absolute Relative Error(ARE):

23

Page 24: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Results on Tweet-1Mo

24

Mean(std.dev)ARE

10 Mins 1 Hour

SEISMIC 0.70(15.58) 0.51(10.81)

HAWKES 0.33(0.41) 0.30(0.38)

Page 25: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Results on NEWS

25

Mean(std.dev)ARE

10 Mins 1 Hour

SEISMIC 0.71(4.89) 0.32(0.40)

Feature-Driven 0.22(0.17) 0.17(0.14)

HAWKES 0.22(0.80) 0.17(0.36)

Hybrid 0.15(0.14) 0.11(0.12)

Page 26: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

26

Example Prediction

Page 27: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Conclusion● Established common benchmark to compare feature driven

and generative approaches

27

Page 28: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Conclusion● Established common benchmark to compare feature driven

and generative approaches

28

● Generative explanatory model with a predictive layer outperforms current state of the art

Page 29: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Conclusion● Established common benchmark to compare feature driven

and generative approaches

29

● Generative explanatory model with a predictive layer outperforms current state of the art

● Small set of features with only message content and basic user features generalizes over problem space

Page 30: Feature Driven and Point Process Approaches for …cm.cecs.anu.edu.au/documents/smishra_cikm16_presentation.pdfFeature Driven and Point Process Approaches for Popularity Prediction

Conclusion● Established common benchmark to compare feature driven

and generative approaches

30

● Generative explanatory model with a predictive layer outperforms current state of the art

● Small set of features with only message content and basic user features generalizes over problem space

● Future: interplay between related cascades and RNN based popularity models

Data+Code:https://github.com/s-mishra/featuredriven-hawkes