insight dataengineering henok_yelpdemo

10
Where is my tweet? Henok Mengistu Insight Data Engineering Fellow Silicon Valley, Summer 2016

Upload: university-of-wyoming

Post on 11-Apr-2017

37 views

Category:

Engineering


0 download

TRANSCRIPT

Page 1: Insight dataengineering henok_yelpdemo

Where is my tweet?Henok Mengistu

Insight Data Engineering Fellow

Silicon Valley, Summer 2016

Page 2: Insight dataengineering henok_yelpdemo

Motivation

Page 3: Insight dataengineering henok_yelpdemo

Motivation

But, this number doesn't show how the tweet spreads-out?

Page 4: Insight dataengineering henok_yelpdemo

But, a re-tweet graph could show

Page 5: Insight dataengineering henok_yelpdemo

A Demo

http://52.33.140.25/http://www.whereismytweet.online/

Page 6: Insight dataengineering henok_yelpdemo

Under the hood

Page 7: Insight dataengineering henok_yelpdemo

Engineering Challenges

Re-tweets could arrive out of order– Spark can't sort across a data stream

– Apache Flink

Page 8: Insight dataengineering henok_yelpdemo

● I am Henok– Originally, from Ethiopia

– Currently, a PhD student at the University of Wyoming

● Working on Evolutionary Computation

– I like playing and watching Soccer

– But skiing, not so much so

Page 9: Insight dataengineering henok_yelpdemo

Thank you!

Page 10: Insight dataengineering henok_yelpdemo

Queries

● On the re-tweet graph

– who are my audiences? ● Geographically, social groups

– Betweenness centrality ● Who is relevant to spread out my tweet?● Identify influential followers