brief announcement: practical summation via gossip

13
www.dvs1.informatik.tu- darmstadt.de Brief Announcement: Practical Summation via Gossip Wesley W. Terpstra, Christof Leng, Alejandro P. Buchmann Databases and Distributed Systems Group Technische Universität Darmstadt Germany

Upload: lorne

Post on 05-Jan-2016

27 views

Category:

Documents


1 download

DESCRIPTION

Brief Announcement: Practical Summation via Gossip. Wesley W. Terpstra, Christof Leng, Alejandro P. Buchmann Databases and Distributed Systems Group Technische Universität Darmstadt Germany. Sum calculation in peer-to-peer. Input: every peer has a value Output: (at least) one peer knows - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Brief Announcement: Practical Summation via Gossip

www.dvs1.informatik.tu-darmstadt.de

Brief Announcement:Practical Summation via Gossip

Wesley W. Terpstra, Christof Leng, Alejandro P. Buchmann

Databases and Distributed Systems Group

Technische Universität Darmstadt

Germany

Page 2: Brief Announcement: Practical Summation via Gossip

2

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Sum calculation in peer-to-peer

Input: every peer has a value

Output: (at least) one peer knows

Useful in computing many global statistics: Network size Average utilization Load balance (standard deviation) Churn (rate of peer replacement) Size of stored data

For our system, BubbleStorm, we compute degi(p)

x pp∈P

x p

Page 3: Brief Announcement: Practical Summation via Gossip

3

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Build on an existing solution

Approaches can be compared by Message rounds (latency) Total messages (bandwidth) Parameters: system size (n), accuracy ()

We improve the Push-Sum algorithm for practical use

Rounds Messages

Push-Sum (2003, FOCS)

Sample&Collide (2006)

Random Tour (2006)

Comp&Spread (2006)

logn + log1

ε

n logn + log1

ε

⎝ ⎜

⎠ ⎟

logn +1

εn

1

εn logn

n +1

ε 2

1

ε 2n

1

ε 2log2 n

1

ε 2 n log2 n

Page 4: Brief Announcement: Practical Summation via Gossip

4

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Analogy: Measuring a lake’s volume

Page 5: Brief Announcement: Practical Summation via Gossip

5

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Push-Sum visualized

Page 6: Brief Announcement: Practical Summation via Gossip

6

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Stationary Distribution (Steady State)

Perturbations of equilibrium do not affect water/fish ratio

Equilibrium: edges carry the same water and fish in both directions peers have water and fish proportional to degree and clock

Page 7: Brief Announcement: Practical Summation via Gossip

7

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Improvement: Big Fish eat smaller fish

Page 8: Brief Announcement: Practical Summation via Gossip

8

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Fish eating in the Network

Page 9: Brief Announcement: Practical Summation via Gossip

9

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Stationary Distribution (Steady State)

Page 10: Brief Announcement: Practical Summation via Gossip

10

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Other improvements

Round switching Once the result is accurate “enough”, restart Provides a running estimate on network statistics

Compensate for message loss

Prevent adding two of the most aggressive fish

Save bandwidth for multiple measurements

Page 11: Brief Announcement: Practical Summation via Gossip

11

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Synchrony

Kempe et al. prove correctness with synchronous model, but conjecture that it works asynchronously We validate this claim by simulation

1 million peers, 5s gossip interval, find network size:

0

10

20

30

40

50

60

27:00 29:00 31:00 33:00 35:00

Logarithmic size estimate

Time (mm:ss)

MaximumStd dev.

Minimum

Page 12: Brief Announcement: Practical Summation via Gossip

12

DA

TA

BA

SE

S A

ND

DIS

TR

IBU

TE

D S

YS

TE

MS

TE

CH

NIS

CH

E U

NIV

ER

SIT

ÄT

DA

RM

ST

AD

T

Open Problem

Push-Sum is very vulnerable to attack Any peer can completely change the result This is largely due to the problem statement (sum!)

Simplistic prevention (bounds) easily defeated Introduce too few of the largest fish type too large Switch rounds prematurely too small & unstable

What is a useful adversary model for summation?

Page 13: Brief Announcement: Practical Summation via Gossip

www.dvs1.informatik.tu-darmstadt.de

?Questions

Thanks for listening!