École d'été: web science and the mind :uqam

57
The opportunity for Social Data Scientists

Upload: claude-theoret

Post on 24-Apr-2015

118 views

Category:

Internet


3 download

DESCRIPTION

Presentation to the Web Science summer school at UQAM, on the rise of the data scientist in the new economy

TRANSCRIPT

Page 1: École d'été: Web Science and the Mind :UQAM

The opportunity for Social Data Scientists

Page 2: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Part 1 The Explosion

Page 3: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 4: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 5: École d'été: Web Science and the Mind :UQAM

Every minute 8-10 months ago:

• 48 hours of video are downloaded on Youtube• 320 new accounts and 98,000 tweets appear

on Twitter• 168,000,000 million emails are sent • 20,000 new posts on Tumblr• 6,600 photos appear on Flickr• Over 20% of all websites are

CMS/wordpress/etc…

Page 6: École d'été: Web Science and the Mind :UQAM

Every minute today:

• 100 hours of video are downloaded on Youtube

• ??? new accounts and 236,000 tweets appear on Twitter

• 204,000,000 million emails are sent • 28,000 new posts on Tumblr• 1,600 photos appear on Flickr !!! No shit!

Page 7: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 8: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 9: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 10: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 11: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 12: École d'été: Web Science and the Mind :UQAM

@cgtheoret

But…• Facebook has lost 1.5 million users in Canada

and 6 million in the United States • Yahoo study: 50% of the content that is read

and shared by humans is produced by only 20, 000 accounts 0.05%

Page 13: École d'été: Web Science and the Mind :UQAM
Page 14: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 15: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Gartner is predicting an explosion in Social Media Analytics It spending

Page 16: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 17: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 18: École d'été: Web Science and the Mind :UQAM

@cgtheoret

In a lot of ways Social “Big Data” is like Oil…• Difficult and expensive to extract

Page 19: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Difficult and expensive to extract

Page 20: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Difficult and expensive to store and distribute

Page 21: École d'été: Web Science and the Mind :UQAM

Cheapest (and least useful) when its unrefined

@cgtheoret

Page 22: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 23: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 24: École d'été: Web Science and the Mind :UQAM

In a lot of ways “Big Data” is like Oil…• Can’t be used by consumers unless refined• More expensive at every step of refinement

@cgtheoret

Page 25: École d'été: Web Science and the Mind :UQAM

The Market is Producing a plethora of derived higher value data products

@cgtheoret

Page 26: École d'été: Web Science and the Mind :UQAM

@cgtheoret

In a lot of ways “Big Data” is like Oil…

• Difficult and expensive to extract• Difficult and expensive to store and distribute• Cheapest in its unrefined form• More expensive at every step of refinement• Produces a plethora of derived products• and it’s actually quite “dirty”!!!!

Page 27: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Part 2

Page 28: École d'été: Web Science and the Mind :UQAM

Social Data is one of the reasons why IBM added a 4th V to the Big Data Definition

VERACITY

@cgtheoret

Page 29: École d'été: Web Science and the Mind :UQAM

Social Data Analytics = Oil Refineries

@cgtheoret

Page 30: École d'été: Web Science and the Mind :UQAM

@cgtheoret

6 factors affect Data Veracity …

1. Accuracy: Is it true?2. Precision: If true, error margin?3. Reliability: Is it there all the time?4. Provenance: Can you trace the source?5. Fidelity: Did it change from the

source?6. Permission: Can you use it for the

context?

Page 31: École d'été: Web Science and the Mind :UQAM

Black Hat SEO : Blogs

Page 32: École d'été: Web Science and the Mind :UQAM

Twitter: 46% of brand followers are bots

Page 33: École d'été: Web Science and the Mind :UQAM

Black Hat Social Marketing : Twitter

Page 34: École d'été: Web Science and the Mind :UQAM

Or in some cases over 90 %…

Page 35: École d'été: Web Science and the Mind :UQAM

Dissapearing Romney: FB as well…

Page 36: École d'été: Web Science and the Mind :UQAM

And it is getting worse …

Page 37: École d'été: Web Science and the Mind :UQAM

Trying to solve the Veracity problem …

Page 38: École d'été: Web Science and the Mind :UQAM

Trying to solve the Veracity problem …

Page 39: École d'été: Web Science and the Mind :UQAM

The Big Guys are now doing Veracity …

Murali Krishnam <[email protected]>Murali Krishnam <[email protected]>

Page 40: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Part 3The Opportunity for Social Data Scientists

Page 41: École d'été: Web Science and the Mind :UQAM

@cgtheoret

Page 42: École d'été: Web Science and the Mind :UQAM

@cgtheoret

“McKinsey Global Institute estimated that by 2018 there will be 4 million big data related positions in the U.S. that require quantitative and analytical skills. However, there will be a potential shortfall of 1.5 million data-savvy managers and analysts to fill these positions”

Page 43: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Zeitgeist

Page 44: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 45: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 46: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 47: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 48: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 49: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 50: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 51: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 52: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 53: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 54: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 55: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 56: École d'été: Web Science and the Mind :UQAM

@cgtheoret @fffady

Page 57: École d'été: Web Science and the Mind :UQAM

@cgtheoret

[email protected]

@cgtheoret

Merci!