benchmarking interactive social networking actions

30
Benchmarking Interactive Benchmarking Interactive Social Networking Actions Social Networking Actions Shahram Ghandeharizadeh Shahram Ghandeharizadeh Director of Database Lab Director of Database Lab Computer Science Department Computer Science Department University of Southern California University of Southern California

Upload: gates

Post on 14-Jan-2016

45 views

Category:

Documents


0 download

DESCRIPTION

Benchmarking Interactive Social Networking Actions. Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern California. Outline. Motivation Research questions Survey use cases BG Benchmark FORSEE Future research. Motivation. Data Stores - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Benchmarking Interactive Social Networking Actions

Benchmarking Interactive Social Benchmarking Interactive Social Networking ActionsNetworking Actions

Shahram GhandeharizadehShahram GhandeharizadehDirector of Database LabDirector of Database LabComputer Science DepartmentComputer Science DepartmentUniversity of Southern CaliforniaUniversity of Southern California

Page 2: Benchmarking Interactive Social Networking Actions

Outline

Motivation Research questions

Survey use cases BG Benchmark FORSEE Future research

Page 3: Benchmarking Interactive Social Networking Actions

Motivation Data Stores

Cloud Services

Person-to-person cloud services

Page 4: Benchmarking Interactive Social Networking Actions

Research Questions

What is the tradeoff between alternative data models? E.g., Is JSON superior to the relational

data model?

How do alternative architectures compare with one another? E.g., Is cache augmented SQL as good as

a document/extensible store?

Do NewSQL data stores scale as well as NoSQL data stores?

Page 5: Benchmarking Interactive Social Networking Actions

Survey Use Case

S. Barahmand and S. Ghandeharizadeh. BG: A Benchmark to Evaluate Interactive Social Networking Actions. CIDR ‘13, Asilomar, CA.

Page 6: Benchmarking Interactive Social Networking Actions

Data Model

Accounts

Friend

MembersPages

Follow

Resources Own

Share

Share

News Feed Displays Ownd

Page 7: Benchmarking Interactive Social Networking Actions

BG Architecture

Scalable

Emulates User Behavior

Service Level AgreementQuick and Efficient Rating

Visualization Tool

S. Barahmand and S. Ghandeharizadeh. Expedited Benchmarking of Social Networking Actions with Agile Data Loading Techniques. CIKM ‘13, SF, CA.

Page 8: Benchmarking Interactive Social Networking Actions

http://bgbenchmark.org

Page 9: Benchmarking Interactive Social Networking Actions

Good Benchmark = FORSEE

Focus on an important debate & provide relevant metrics to facilitate progress.

One number to describe alternative designs/solution.

Runs in a reasonable amount of time.

Scalable.

Effective abstraction with meaningful requests.

Extendible.

Page 10: Benchmarking Interactive Social Networking Actions

Good Benchmark = FORSEE

F

One number to describe alternative designs/solution.

Runs in a reasonable amount of time.

Scalable.

Effective abstraction with meaningful requests.

Extendible.

+ Unpredictable data

Page 11: Benchmarking Interactive Social Networking Actions

Good Benchmark = FORSEE

F

O

Runs in a reasonable amount of time.

Scalable.

Effective abstraction with meaningful requests.

Extendible.

+ Unpredictable data

SoAR

Page 12: Benchmarking Interactive Social Networking Actions

Good Benchmark = FORSEE

F

O

R

Scalable.

Effective abstraction with meaningful requests.

Extendible.

+ Unpredictable data

SoAR

4 months to rate =1 Week to rate =

Page 13: Benchmarking Interactive Social Networking Actions

Good Benchmark = FORSEE

F

O

R

S

Effective abstraction with meaningful requests.

Extendible.

+ Unpredictable data

SoAR

4 months to rate =1 Week to rate =

Page 14: Benchmarking Interactive Social Networking Actions

Good Benchmark = FORSEE

F

O

R

S

E

Extendible.

+ Unpredictable data

SoAR

4 months to rate =1 Week to rate =

Only when two members are NOT friends!

Page 15: Benchmarking Interactive Social Networking Actions

Good Benchmark = FORSEE

F

O

R

S

E

E

+ Unpredictable data

SoAR

4 months to rate =1 Week to rate =

Only when two members are NOT friends!

FORSEE = PREDICT

Page 16: Benchmarking Interactive Social Networking Actions

Good Benchmark = FORSEE

F

O

R

S

E

E

+ Unpredictable data

SoAR

4 months to rate =1 Week to rate =

Only when two members are NOT friends!

A good benchmark helps settle debates

quickly to enable its discipline to make rapid progress.

Page 17: Benchmarking Interactive Social Networking Actions

Future Research: Data Sciences

Challenge: Wide variety of science applications with diverse debates.

Hypothesis: A benchmark generator.

BenchmarkGenerator

ER diagram

Actions & their dependencies

Key Metrics

Application (data science)

SpecificBenchmark

Page 18: Benchmarking Interactive Social Networking Actions

Future Reseach Evaluate the hypothesis using BG.

Extend to other data science applications.

BenchmarkGenerator

Unpredictable data

Page 19: Benchmarking Interactive Social Networking Actions
Page 20: Benchmarking Interactive Social Networking Actions
Page 21: Benchmarking Interactive Social Networking Actions

Big Data: Operations

Simple Complex

Off-line

Interactive

Ad-hoc

Pre-specified

Page 22: Benchmarking Interactive Social Networking Actions

Big Data: Google Analytics

Simple Complex

Off-line

Interactive

Ad-hoc

Pre-specified

1. Gather click stream data: Optimized for writes, 2. Compute aggregated data: MapReduce/Hadoop

Objective:1. Advertising ROI2. Frequency of access to pages

Page 23: Benchmarking Interactive Social Networking Actions

Big Data: Google Analytics

Simple Complex

Off-line

Interactive

Ad-hoc

Pre-specified

1. Gather click stream data: Optimized for writes, 2. Compute aggregated data: MapReduce/Hadoop3. Enable users to view aggregated data.

Objective:1. Advertising ROI2. Frequency of access to pages

Page 24: Benchmarking Interactive Social Networking Actions

Big Data: Facebook

Show profile page of Farah FawcettFollow Barak ObamaFriend Lady Gaga

Simple Complex

Off-line

Interactive

Ad-hoc

Pre-specified

Page 25: Benchmarking Interactive Social Networking Actions

3 Vs: Facebook High Volume:

1.2 billion user profiles, 150 billion friend connections, 1.13 trillion likes, 17 billion tagged locations, 240 billion photos, ….

High Velocity: 700 million active users daily, 4.5 billion likes daily, 350

million photos uploaded daily, …

High Variety: Mix of data types: Structured records, multimedia content,

text.

Source: http://expandedramblings.com/index.php/by-the-numbers-17-amazing-facebook-stats/ posted on Oct 6, 2013.

Page 26: Benchmarking Interactive Social Networking Actions

Expertise/Contributions

BG Benchmark to evaluate performance of alternative data stores: SQL, NoSQL, NewSQL.

http://bgbenchmark.org

A high performance CASQL solution that minimizes software development life cycle.

KOSAR, a prototype of a CASQL solution.

Simple ComplexOff-line

Interactive

Ad-hoc

Pre-specified

Page 27: Benchmarking Interactive Social Networking Actions

BG, http://bgbenchmark.org

Joint work with Sumita Barahmand Benchmark for interactive social

networking actions. Consists of 11 actions:

Page 28: Benchmarking Interactive Social Networking Actions

CASQL

Joint work with Jason Yap. Key insight: Query result look up is

faster than query processing. Contribution is physical data

independence in CASQL systems: Transparent caching Serial schedules Detection of race conditions and

prevention of inconsistent states.

Page 29: Benchmarking Interactive Social Networking Actions

KOSAR

Joint work with Reihane Boghrati, Lakshmy Mohanan and Neeraj Narang.

A software prototype of CASQL Scalable Highly available Elastic

Boosts performance of a leading industrial strength RDBMS vendor from 2 actions per second to more than 300,000 actions per second.

Page 30: Benchmarking Interactive Social Networking Actions

BG Coordinator

Delta Analyzer

BGClient 2 BGClient NBGClient 1

Experiment

Load

Agile Data Loading Techniques

Experiment

Data Store Server