panagiotis t. metaxas cs, wellesley college & crcs, harvard university joint work with eni...

15
Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

Upload: samson-francis

Post on 17-Dec-2015

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

Panagiotis T. MetaxasCS, Wellesley College &

CRCS, Harvard University

Joint work with Eni Mustafaraj and Dani Gayo-Avello

Page 2: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

OverviewSocial Media can “predict the present/future”

Success storiesElectoral Predictions with Social Media

Successful Predictions

Can Twitter predict US Elections?Elsewhere: Can Google Trends predict US

Elections?

How To Predict Elections using Social Media

Page 3: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

Social Media can “predict the future” (?!)

Page 4: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

To Predict (according to Merriam-Webster)

Page 5: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

Successful Electoral Predictions

This slide was intentionally left blank

Page 6: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

Successful Would-be PredictionsMethods Used:

Raw volume of TweetsSentiment Analysis of Tweets

Data Used:2009 German Elections Tweets2008 US Presidential Elections Tweets

Our Data:2010 Special Senatorial Elections (MA) Tweets2010 Senatorial Elections (CO, NV, CA, KY,

DE) Tweets

Page 7: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

Can Twitter Volume Predict?

Twitter volume (would have) predicted correctly 50% of the high-visibility US Congressional races

Page 8: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

Can Tweet Sentiment Predict?

Sentiment analysis (would have) predicted correctly 50% of the high-visibility US Congressional races

But was it sensing the sentiment correctly?

Page 9: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

“What if you had used some other methods?”We did not invent the methods we used,

we just evaluated them on data other than those they were designed for

We started off expecting to predict successfully(In fact, we were “predicting” at times ;-)

But we learned a lot from these experiments:

How to Predict Using Social Media

Page 10: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

How To PredictA method of prediction should be an algorithm

finalized before the elections: (input) how Social Media data are to be

collected, including the dates of data collection, (filter) the way in which the cleanup of the data

is to be performed (e.g., the selection of keywords relevant to the election),

(method) the algorithms to be applied on the data along with their input parameters, and

(output) the semantics under which the results are to be interpreted.

Page 11: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

How To PredictA method of prediction should be an

algorithm.Data from Social Media

are fundamentally different than Data from Natural Phenomena.People will change their behavior the next time

aroundSpammers & Activists will try to take

advantagee.g. “Twitter Bomb” and “Prefab Tweeter Factory”

Page 12: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

How To PredictA method of prediction should be an

algorithm.Data from Social Media

are fundamentally different than Data from Natural Phenomena.

Form a testable theory on why and when it predicts.Go past the OMG! FactorAvoid the self-deception syndrome

Page 13: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

How To PredictA method of prediction should be an algorithm.Data from Social Media

are fundamentally different than Data from Natural Phenomena.

Form a testable theory on why and when it predicts.

(maybe) Learn from the professional pollsters.Tweet ≠ UserUser ≠ Eligible VoterEligible Voter ≠ Voter

Page 14: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello
Page 15: Panagiotis T. Metaxas CS, Wellesley College & CRCS, Harvard University Joint work with Eni Mustafaraj and Dani Gayo-Avello

Thank you!