13:30 vision: parsons data analytics and society

46
REBECCA PARSONS [email protected] http://thoughtworks.com CTO Big Data Data Wednesday, November 7, 12

Upload: graphconnect

Post on 30-Oct-2014

517 views

Category:

Documents


2 download

DESCRIPTION

Data is accumulating rapidly. Our approaches to capturing and analyzing data, as well as in making connections between and drawing conclusions from this data are evolving as well. In this talk, we'll review some of the changes in how data is used and analyzed, highlighting several examples of the power of data analysis outside of the usual suspects: customer needs, production dashboards, etc. We'll look at how data is used to track election violence, movement of people after a natural disaster, and attempts to predict famine and other humanitarian crises before they happen. Rebecca Parsons (CTO at ThoughtWorks) Dr. Rebecca Parsons is ThoughtWorks' Chief Technology Officer. She has more than 20 years' application development experience, in industries ranging from telecommunications to emergent internet services. She has extensive experience leading in the creation of large-scale distributed object applications, services based applications, and the integration of disparate systems. Before coming to ThoughtWorks she worked as an assistant professor of computer science at the University of Central Florida. She also worked as Director's Post Doctoral Fellow at the Los Alamos National Laboratory researching issues in parallel and distributed computation, genetic algorithms, computational biology and non-linear dynamical systems. She spent her sabbatical from ThoughtWorks working with UNICEF's Innovation Lab in Kampala, Uganda in 2010. Rebecca received a Bachelor of Science degree in Computer Science and Economics from Bradley University, a Masters of Science in Computer Science from Rice University and her Ph.D. in Computer Science from Rice University.

TRANSCRIPT

REBECCA PARSONS

[email protected]://thoughtworks.com

CTO

BigDataData

Wednesday, November 7, 12

REBECCA PARSONS

[email protected]://thoughtworks.com

CTO

The Evolving Panorama of Data

Wednesday, November 7, 12

Changing Nature of Data

Response

How we use data nowWednesday, November 7, 12

Data is: Growing

Wednesday, November 7, 12

Walmart: 1 million transactions per hour

Facebook: 40 billion photos

The Economist: Feb 25th 2010

Data is: Growing

Wednesday, November 7, 12

640K ought to be enough for anybody

Wednesday, November 7, 12

2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012

1,482,824

1,287,537

1,080,872

853,698

616,308

356,191

127,942

40,2238,6401,990442

Monthly Contributors to Wikipedia souce: wikipedia

Data is: Distributed

Wednesday, November 7, 12

Data is: Distributed

Wednesday, November 7, 12

Data is: Distributed

98% of internet access points in Africa are mobile

30 million networked sensor nodes

growing 30% per year

McKinsey Global Institute: Big data: The next frontier for innovation, competition, and productivity

Wednesday, November 7, 12

Data is: Valuable

$300 billion / year for US health care

60% increase in retail margins

McKinsey Global Institute: Big data: The next frontier for innovation, competition, and productivity

Wednesday, November 7, 12

Data is: Urgent

Wednesday, November 7, 12

Data is: Connected

Wednesday, November 7, 12

Changing Nature of Data

Response

How we use data nowWednesday, November 7, 12

"NoSQL"

Wednesday, November 7, 12

Document

Graph

Graph

Key-value

Column-family

Wednesday, November 7, 12

GraphWednesday, November 7, 12

Graph

Polyglot Persistence

Wednesday, November 7, 12

Event Sourcing

Wednesday, November 7, 12

Log

Application State

Wednesday, November 7, 12

Data Sourceswere will be

textimagevideo

connections

Wednesday, November 7, 12

Analyticswill be

pattern recognition

data mining

chasing connections

were

roll-ups

trends

variance

Wednesday, November 7, 12

mapmapmapmap reduce

mapmapmapmap reduce

mapmapmapmap reduce

reduce

per order per month

Wednesday, November 7, 12

Wednesday, November 7, 12

10,000 ft view (literally)

CodeCity by Richard Wettel

http://www.inf.unisi.ch/phd/wettel/codecity.html

Wednesday, November 7, 12

Wednesday, November 7, 12

Changing Nature of Data

Response

How we use data nowWednesday, November 7, 12

Data Scientist Journalist

Wednesday, November 7, 12

Wednesday, November 7, 12

Wednesday, November 7, 12

Data Warehousing

Wednesday, November 7, 12

Wednesday, November 7, 12

http://ureport.ug/Wednesday, November 7, 12

http://ushahidi.com/

Wednesday, November 7, 12

http://libyacrisismap.net/Wednesday, November 7, 12

http://opendata.go.ke/Wednesday, November 7, 12

Wednesday, November 7, 12

http://datawithoutborders.cc/Wednesday, November 7, 12

Wednesday, November 7, 12

Wednesday, November 7, 12

http://unglobalpulse.orgWednesday, November 7, 12

0

100

200

300

400

500

Supply Demand

Deep Analytical Talent in 2018

50% of supply

McKinsey Global Institute: Big data: The next frontier for innovation, competition, and productivityWednesday, November 7, 12

Wednesday, November 7, 12

What about us?

Wednesday, November 7, 12

Wednesday, November 7, 12

Order-Taker Syndrome

Wednesday, November 7, 12

REBECCA PARSONS

[email protected]://thoughtworks.com

CTO

clip art from http://openclipart.org

Wednesday, November 7, 12