estimating contagion rates in kickstarter twitter...

17
Estimating Contagion Rates in Kickstarter Twitter Cliques DAVON WOODARD/MD SALMAN AHMED CS 6604 FALL 2017

Upload: others

Post on 05-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Estimating Contagion Rates in Kickstarter Twitter Cliques

DAVON WOODARD/MD SALMAN AHMED CS 6604 FALL 2017

Page 2: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Outline1. Introduction and Initial Motivation

2. Data Collection

3. Assumptions

4. Simple Contagion1. Model

2. Results

5. Complex Contagion1. Model

2. Results

6. Challenges

7. Conclusion

Page 3: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Introduction

▪ Crowdfunding has emerged as a popular community-based, micro-financing model for entrepreneurs, artists, and activists alike to bring their respective dreams into fruition.

▪ Successful campaigns, those which meet their financial goals, bring with them not only the financial utility for the creator, but also social utility for the backers.

Page 4: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Motivation

Initial: Can the prediction power of Twitter data be extended by complementing static data with the model of social media exposure curves (stickiness and

persistence) presented by Romero[1], et. al coupled with the use of censored data presented by Li, et. al[2]?

Secondary: Given a set of assumptions, can the rate of spread of Kickstarter campaigns in a Twitter network be estimated using simple contagion and

complex contagion models?

Page 5: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Data Collection

▪Crowdfunding dataset

▪ From Chandan K. Reddy’s Team

▪Twitter dataset

▪Using Twitter public API & GetOldTweets-python1)

▪Modifying the APIs of GetOldTweets-python to meet our needs

1. https://github.com/Jefferson-Henrique/GetOldTweets-python

Page 6: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Dataset Characteristics

▪Crowdfunding dataset1 – 18k total records

▪Each record corresponds a project

▪Contains project id, name, URL, duration, goal amount, pledged amount, …

▪Twitter dataset2 – 162k total records

▪Each record corresponds to a tweet

▪Contains the text, user, date of tweet, tweet link, retweet, etc.

1 & 2. You can get the two datasets on http://people.cs.vt.edu/ahmedms/cs6604.html

Page 7: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Relevancy [Crowdfunding dataset x Twitter dataset]

▪ Out of 18k projects, 10k projects have tweets

▪ Out of the 10k, 4k projects have enough tweets to take part in the model develop

Page 8: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Assumptions

Our crawling program retrieves only the information about tweets.

▪ Twitter user network as a clique

▪ Total nodes of the Twitter user network is twice the unique Twitter users

Page 9: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Model – Simple Contagion

[4]

Page 10: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Model – Simple Contagion

Β(successful) = .0092

Β(failed) = .0120

Page 11: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Results– Simple Contagion

▪ Variance:▪ Assumptions▪ Network Structure

Page 12: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Model - Complex Contagion [Concept of Exposure Curve]

1

3

4

2

5

1

3

4

2

5

Day 1 Day 2

Ek = {2, 3, 4, 5} Ik = {4, 5}P(k) = 2/4Where k = 1

After one day

Page 13: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Exposure Curve

K

P(k)

P k = δ𝑒−λ𝑘 + 𝑐d𝐼 𝑡

𝑑𝑡= β𝑆 𝑡 𝐼 𝑡 = 𝑃(𝑘)𝑆 𝑡 𝐼 𝑡

d𝐼 𝑡

𝑑𝑡= (δ𝑒−λ𝑘 + 𝑐)𝑆 𝑡 𝐼 𝑡

d𝐼 𝑡

𝑑𝑡= (δ𝑒−λ𝐼(𝑡−1) + 𝑐)𝑆 𝑡 𝐼 𝑡

λ = 0.11360768

Page 14: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Results - Complex Contagion [Actual vs. Prediction]

▪ A big approximation error

▪ Inadequate twitter data

▪ Poor assumptions

Page 15: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Challenges

▪ Heading wrong direction until the last moment

▪ Thinking the output as the input to our models

▪ Inadequate associated Twitter data

▪ Poor assumptions

week

twee

ts

Page 16: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

Conclusion

▪ Assumptions are critical

▪ Model should fit the dataset available

▪ With additional Network Information additional research on information diffusion of Kickstarter campaigns help guide marketing efforts

Page 17: Estimating Contagion Rates in Kickstarter Twitter Cliquespeople.cs.vt.edu/ahmedms/cs6604/cs.6604.pdfMotivation Initial: Can the prediction power of Twitter data be extended by complementing

References

1. Romero, et. al . “Differences in the Mechanics of Information Diffusion Across Topics: Idioms, Political Hashtags, and Complex Contagion on Twitter”

2. Yan Li, Vineeth Rakesh, and Chandan K. Reddy. 2016. Project Success Prediction in Crowdfunding Environments.

3. Prakash, Aditya. “Epidemics: Probabilistic Models”, Lecture, 9/25/17, VT

4. https://institutefordiseasemodeling.github.io/Documentation/general/generic-tutorial7SI.html