infochimps cloudcon 2012

31
Big Data & Cloud Infinite Monkey Theorem CloudCon Expo & Conference October, 2012

Upload: jim-kaskade

Post on 16-May-2015

126 views

Category:

Technology


1 download

DESCRIPTION

A Keynote on the "Infinite Monkey Theorem "

TRANSCRIPT

Page 1: Infochimps Cloudcon 2012

Big Data & Cloud

Infinite Monkey Theorem

CloudCon Expo & ConferenceOctober, 2012

Page 2: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 2

FirstWhat is Big Data?

“data sets so large and complex that it becomes difficult to process using on-hand database management tools.”

Page 3: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 3Source: 2011 IDC Digital Universe Study

2010 = 1.2Zettabytes/yr

2020 = 35.2Zettabytes/yr

Data VolumeGrowing 44x

Page 4: Infochimps Cloudcon 2012

AmpNode

AmpNode

AmpNode

Enterprise Data Warehouse

PARC | 4

. . . .

BYNET Interconnect

ParsingEngines

Request

???Answer

10TB =

$400K 3yrTCO

Page 5: Infochimps Cloudcon 2012

Search Recommend

Rank

Next-Best-ActionScore

Big Data Warehouse

PARC | 5

. . . .

Ethernet Interconnect

Master:Name NodeJob Tracker

AnalyticRequest

Slave:Task TrckrData Node

Slave:Task TrckrData Node

Slave:Task TrckrData Node

Answer

Semi-Structured

Data10TB =$80K 3yrTCO

Page 6: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 6

Traditional Operational

Traditional Decision Support

AnalyticAppliances

RealTime

Batch

LargeEnterprise

SmallEnterprise

Application Ecosystem

Deployment inPublic/Private Cloud

Toolset Integration

Hardened

Page 7: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 7

NextInfinite Monkey Theorem (2):

an infinite number of monkeys hitting keys on a typewriter for a period of time will almost surely type a given text, such as Shakespeare”s Hamlet.

Page 8: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 8

“unexperienced and unobservable“ based on

“real experiences and real observations“

Page 9: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 9

“ “Infinite Monkey Theorem (2):

an infinite number of monkeys hitting keys on a typewriter for a period of time will almost surely type a given text, such as Shakespeare”s Hamlet.

an infinite number of monkeys hitting keys on a typewriter for a period of time will almost surely type a given text, such as Shakespeare”s Hamlet.

Page 10: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 10

infinite number of monkeys

keys on atypewriter

almost surely

Shakespeare”s Hamlet

unlimitedcomputational

power

processingdata

statisticallysignificant

insights

Page 11: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 11

#thisischimpy

Page 12: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 12

“Little Data For Business Users“

Problem

Page 13: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential13

?

Business Analysts

Application Developers

ExecutiveDataScientist

ITStaff

Finance,DBAs,Etc.

Data

? ?? ?

Page 14: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 14

#thisreallysucks

Page 15: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 15

“Big Data For Business Users“

Page 16: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential16

?

Data

$ $$ $

Executive

ReduceFriction

Page 17: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 17

#thisisreallygood

Page 18: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 18

unlimitedcomputational

power

Public

PrivateVirtualPrivate

Page 19: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 19

analysts use these images to count shipping containers coming off ships in California and are able to get a sense of overall US import activity

Page 20: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 20

dataprocessing

Public

PrivateVirtualPrivate

Page 21: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 21

Walmart

Page 22: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 22

Target

Page 23: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 23

Images

Docs,Text

WebLogs

Social

Sensors

GPS

BusinessTransactions &

Interactions

BusinessIntelligence &

Analytics

SQL NoSQL NewSQL

EDW MPP NewSQL

Dashboards, ReportsVisualization…

Web, Mobile, CRM,ERP, SCM…

Page 24: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 24

statisticallysignificant

Public

PrivateVirtualPrivate

Page 25: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 25

#lotsofdata #simplealgorithms+

Page 26: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 26

CarsIn Lot

NewsText

WebPricing

SocialSentiment

WeatherSensors

LocalEmployment

QuarterlyRevenue

Prediction

Page 27: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 27

insights

Public

PrivateVirtualPrivate

Page 28: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 28

GnipPowertrack

GnipEDC

MoreoverMetabase

TVTranscription

RadioTranscription

PrintTranscription

In-MotionData Delivery

Service

NoSQL

ListeningApplication

New Media

Traditional Media

APIs

Sources Sentiment

Business Users

App DeveloperData Scientist

IT Staff

Page 29: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 29

unlimitedcomputational

power

processingdata

statisticallysignificant

insights

Page 30: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 30

#1BigDataCloudService

Page 31: Infochimps Cloudcon 2012

04/12/2023 Infochimps Confidential 31

#inspiredbyAvinashKaushik