drizzle: fast and adaptable stream processing at …streaming benchmark - performance yahoo...

DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE

Shivaram Venkataraman, Aurojit Panda, Kay Ousterhout, Michael Armbrust, Ali Ghodsi, Michael Franklin, Benjamin Recht, Ion Stoica

STREAMING WORKLOADS

Streaming Trends: Low latency

Results power decisions by machines

Credit card fraud

Disable account

Suspicious user logins

Ask security questions

Slow video load

Direct user to new CDN

Disable stolen accounts Detect suspicious logins

Dynamically adjust application behavior

Streaming Requirements: High throughput

As many as 10s of millions of updates per second Need a distributed system

Distributed Execution Models

Execution models: CONTINUOUS OPERATORS

… …

Group by user, run anomaly detection

… …

Mutable local state

Low latency output

… …

Systems:

Google MillWheel

Streaming DBs: Borealis, Flux etc

Mutable local state

Low latency output

Execution models: Micro-batch

… …

… … … … … …

… …

Tasks output state on completion

Output at task granularity

Execution models: Micro-batch

… …

… … … … … …

… …

Tasks output state on completion

Output at task granularity

Dynamic task

scheduling

Adaptability

Straggler mitigation Elasticity

Fault tolerance

Microsoft Dryad

Google FlumeJava

Failure recovery

Failure recovery: continuous operators

Chandy Lamport Async Checkpoint

Checkpointed state

All machines replay from checkpoint

Failure recovery: Micro-batch

Task output is periodically

checkpointed

Task boundaries capture task interactions!

Failure recovery: Micro-batch

Parallelize replay

Replay tasks from failed

machine

Task output is periodically

checkpointed

Execution models Continuous operators

Micro-batch

Static scheduling Inflexible

Slow failover

Low latency

Dynamic scheduling Adaptable

Parallel recovery Straggler mitigation

Higher latency Processing granularity Scheduling granularity

Micro-batch

Static scheduling Low latency

Dynamic scheduling (coarse granularity)

Higher latency (coarse-grained processing)

Drizzle

Low latency (fine-grained processing)

inside the scheduler

… … … … … …

… …

Scheduler ?

(1) Decide how to assign

tasks to machines data locality fair sharing

(2) Serialize and send

… …

SCHEDULING OVERHEADS

Cluster: 4 core, r3.xlarge machines Workload: Sum of 10k numbers per-core

Median-task time breakdown

100 150 200 250

4 8 16 32 64 128

Machines

Compute + Data Transfer

Task Fetch

Scheduler Delay

inside the scheduler

… … … … … …

… …

Scheduler ?

(1) Decide how to assign

tasks to machines data locality fair sharing

(2) Serialize and send

… …

Reuse scheduling decisions!

DRIZZLE

… … … … … …

… …

(1) Pre-schedule reduce tasks

(2) Group schedule micro-batches

Goal: remove frequent

scheduler interaction

… … …

Goal: Remove scheduler involvement for reduce

… … …

? Goal: Remove scheduler involvement for reduce

coordinating shuffles: Existing systems

… …

Metadata describes

shuffle data location

Data fetched from remote

machines

coordinating shuffles: Pre-scheduling

… …

(1) Pre-schedule reducers

(2) Mappers get metadata

(3) Mappers trigger reducers

DRIZZLE

… … … … … …

… …

(2) Group schedule micro-batches

Goal: wait to return to

scheduler

Group scheduling

… … … … … …

… …

Group of 2 Schedule group of micro-batches

at once

Fault tolerance, scheduling at

group boundaries

Group of 2

100 150 200 250 300

4 8 16 32 64 128

Machines

Baseline Only Pre-Scheduling Drizzle-10 Drizzle-100

Micro-benchmark: 2-stages 100 iterations – Breakdown of pre-scheduling, group-scheduling

In the paper: group size auto-tuning

Evaluation Continuous operators

Micro-batch

Drizzle

1. Latency?

2. Adaptability?

EVALUATION: Latency

Yahoo! Streaming Benchmark Input: JSON events of ad-clicks Compute: Number of clicks per campaign Window: Update every 10s Comparing Spark 2.0, Flink 1.1.1, Drizzle 128 Amazon EC2 r3.xlarge instances

0 0.2 0.4 0.6 0.8

0 500 1000 1500 2000 2500 3000 Event Latency (ms)

Drizzle

Streaming BENCHMARK - performance

Yahoo Streaming Benchmark: 20M JSON Ad-events / second, 128 machines

Event Latency: Difference between window end, processing end

190 200 210 220 230 240 250 260 270 280 290

Spark Flink Drizzle

Adaptability: FAULT TOLERANCE

Inject machine failure at 240 seconds Yahoo Streaming Benchmark: 20M JSON Ad-events / second, 128 machines

190 200 210 220 230

Spark Flink Drizzle

Micro-batch

Dynamic scheduling Higher latency

Drizzle

Dynamic scheduling (coarse-granularity)

Optimization of batches

Optimize execution of each micro-batch by pushing down aggregation

INTRA-BATCH QUERY optimization

Yahoo Streaming Benchmark: 20M JSON Ad-events / second, 128 machines

0 0.2 0.4 0.6 0.8

0 500 1000 1500 2000 2500 3000 Event Latency (ms)

Drizzle

Drizzle-Optimized

EVALUATION

End-to-end Latency Fault tolerance Query optimization Synthetic micro-benchmarks

Video Analytics Shivaram’s Thesis: Iterative ML Algorithms

Yahoo Streaming Benchmark

Throughput Elasticity Group-size tuning

Conclusion Continuous operators

Micro-batch

Drizzle

Optimization of batches

Source code: https://github.com/amplab/drizzle-spark

Shivaram is answering questions on sli.do

drizzle: fast and adaptable stream processing at …streaming benchmark - performance yahoo...

Documents

the facebook live streaming benchmark report 2017 ·...

quincy100 - iconicbeta.com€¦ · 4 international...

types of precipitation precipitation comes in two forms...

extending the yahoo streaming benchmark to apache apex

drizzle 7.0, future of virtualizing

end-to-end streaming solution · become a benchmark in the...

drizzle everything syrup - the hershey company

drizzle anddrought

srbench: a streaming rdf/sparql benchmark · date:...

ewan oconnor, robin hogan, anthony illingworth drizzle...

pizzasocial.pizzapizzasocial.pizza/menu/pizzasocialtakeoutmenu.pdf ·...

streaming benchmark

drizzle comparisons

+ coco loco drizzle + cinn-a- yummy cozy cocoa

q1 digital advertising benchmark report - cmo. · pdf...

drizzle @opensql camp

implementingtpc e,#a#streaming benchmark · store, a...

coherent structure of drizzle events

developing drizzle replication plugins

drizzle talk