directions for hadoop innovation, yahoo

14
Directions for Hadoop Innovation Apr 2013 Eric Bax

Upload: innovation-enterprise

Post on 01-Jul-2015

68 views

Category:

Technology


1 download

DESCRIPTION

BD Hadoop SF 2013

TRANSCRIPT

Page 1: Directions for Hadoop Innovation, Yahoo

Directions for Hadoop Innovation

Apr 2013

Eric Bax

Page 2: Directions for Hadoop Innovation, Yahoo

Hadoop in Online Advertising at Yahoo!

Response Prediction – Clicks and Conversions

Allocation and Pricing -- Guaranteed

Analytics – Marketplace Monitoring

Science – Value of Advertising

2

Page 3: Directions for Hadoop Innovation, Yahoo

Marketplace Operations

3

Model ConstructionAuction

ReconciliationAnalytics and Billing

Ad Calls

Auction Log Ad Served

Clicks and Conversions

Response Frequencies

Predict Model

Ad + ResponseROI Evaluation

Online/Offline Sales $

Page 4: Directions for Hadoop Innovation, Yahoo

Desiderata

4

Faster Answers

Fewer Computations per Datum

From Analytics to Active Monitoring

From Batch Cycles to Sense and Respond

Page 5: Directions for Hadoop Innovation, Yahoo

Faster Turnaround

9/18/20135

Act on the 80% of Data That Arrives Quickly

Then Correct as Late-Landing Data Arrive

Pull for Initial Result; Push for Updates?

Page 6: Directions for Hadoop Innovation, Yahoo

Online Updates to Models

9/18/20136

Each day produces Big Data.

Whole history: HUMONGOUS DATA.

Update models based on new data only.

And perhaps exceptions / borderline cases from history.

Page 7: Directions for Hadoop Innovation, Yahoo

“Embedded” Computation

9/18/20137

Move Computation Closer to Where Data are Generated

Monitor for Anomalies Where they Occur

(Sometimes) Compress into Sketches before Transmitting Data

Hadoop as Part of Serving vs Isolated Clusters?

Page 8: Directions for Hadoop Innovation, Yahoo

Propagate Data Among Logical Neighbors Quickly

Multi-Resolution Approach at Different Time Scales

Challenge: Clustering into Logical Neighborhoods to Fit Problem

Localized / Contextual Computation

9/18/20138

Page 9: Directions for Hadoop Innovation, Yahoo

Search Clusters

9/18/20139

Page 10: Directions for Hadoop Innovation, Yahoo

Who Clicks?

9/18/201310

Page 11: Directions for Hadoop Innovation, Yahoo

Who Doesn’t?

9/18/201311

Page 12: Directions for Hadoop Innovation, Yahoo

Hadoop in Five Years

9/18/201312

Will Hadoop grow by adding features / options?

Will it branch: faster, lighter, approximate, embedded versions?

Truly huge version? With approximation / sampling / multi-resolution?

Page 13: Directions for Hadoop Innovation, Yahoo

The Right Fit

9/18/201313

Multi-resolution sense and respond.

Details to neighbors, sketches and aggregates globally.

Migrate processes and storage to ingest points or logical neighbors.

Tune system-wide performance through human-machine dialog .

Page 14: Directions for Hadoop Innovation, Yahoo

Thank You

9/18/201314

Eric Bax

[email protected]