boston university computer science on the marginal utility of network topology measurements john...

25
Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros, and Mark Crovella

Post on 19-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Boston University Computer Science

On The Marginal Utility of

Network Topology Measurements

John Byers

with

Paul Barford (now at Wisconsin),

Azer Bestavros, and Mark Crovella

Page 2: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Measurement Philosophy

Current Dogma: When conducting a wide-area measurement study: “more is better.” More = measurements More = measurement sites

True, but taking more measurements and deploying more infrastructure is expensive!

Our focus: How much better is more? Even harder: When can we stop measuring?

Not much work on this topic in our community.

Page 3: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Problem Instance:Discovering Internet Topology

Typical goal: discover the router-level Internet graph

Typical approach: merge lists of known nodes and edges

Traceroute reports the IP path from A to B i.e., how IP paths are overlaid on the router

graph

Page 4: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Traceroute studies

Yield overlays of projections from S’s to D’s Sources: active, expensive Destinations: passive, cheap

SS

D

D DD

D

Page 5: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Motivating Questions

How should we use traceroute and what can it discover? Physical topology (nodes, links)? IP routing topology?

What’s a good way to organize a collection-of-traceroutes study? Many sources? Many destinations? How much is enough?

Page 6: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Theoretical Inroads

Take a graph G = (V, E) and a routing algorithm R. Choose j sources and k destinations at random. Consider the subgraph G’ = (V’, E’) induced by

routes from R between all (S, D) pairs. How do expected values of |V’| and |E’| scale as a

function of j and k ? Chuang-Sirbu scaling law is special case for j = 1.

Marginal utility of adding k+1 ’st source or destination is expected contribution to |V’| or |E’|.

Page 7: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

What might we expect?

Two extremal cases: Clique: each new (S, D) discovers a new path Star: each new S or D discovers only a small

neighborhood

D

D D

D

D

D

D D

D

DS

S SS

Clique Star

Page 8: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Skitter to the Rescue

Two datasets from CAIDASmall dataset: May 2000

8 sources, 1277 destinations, 20K paths Sources in: New Zealand, Japan, Singapore, San

Jose (2), Ottawa, London, Washington All sources traced to all destinations

Large dataset: October 2000, 30 times bigger 12 sources, 313709 destinations, 600K paths No destination common to all sources, or vice versa

Page 9: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Interface Disambiguation

Traceroutes report only on interfaces used Routers often have multiple interfaces But merging traceroutes requires matching routers

Solution: probe each interface from some site X Routers are supposed to respond on the interface

used for routing to X

Results in set of (probe interface, response interface) pairs Each connected component is taken to be a router

Page 10: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Classifying Nodes

Core, border, stub, leafSolely from traceroute information

Leaf Border Core Stub

Page 11: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Classification depends on msmts

Core

Stub

Border

Page 12: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Limitations and Caveats

Interface disambiguation 13% of interfaces never responded

Node classification Identifying a border node requires two paths to it

Representativeness Datasets are small, may not be representative Skitter sources not selected at random

Unknown coverage of true network Diminishing returns may not signify good coverage

Page 13: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Diminishing Returns (Small Dataset)

Page 14: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Diminishing Returns (Large Dataset)

Page 15: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Diminishing returns by Classification (Small Dataset)

Core

Stub

Border

Page 16: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

What Does This Suggest?

D

D D

D

D

DS

S

Page 17: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Adding Destinations: Nodes

Slope isabout 3

Page 18: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Adding Destinations: Links

Slope isabout 4

Page 19: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Add Sources or Destinations?

Isolines represent constant node discovery, varying S’s or D’s

Page 20: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Node Degree Distribution

8 Sources

1 Source

Page 21: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Node Degree Distribution: Tail

1 Source

8 Sources

Page 22: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Degree distribution convergence: RMSE

Page 23: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Information Theory Plug

Link Discovery

Can compare marginal utility of different processes.

Node Discovery

Page 24: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Related Work

Pansiot & Grad ’98 First multi-traceroute study Similar methodology, incl. interface disambiguation

Chuang & Sirbu ’98Phillips, Shenker & Tangmunarunkit ’99 single-source case, found sublinear growth of multicast

tree with added destinations Govindan & Tangmunarunkit ’00

Extensive node discovery, overcoming limitations of traceroute

Broido & Claffy ’01 Larger datasets; more detailed look at graph structure

Page 25: Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Conclusions

Rigorous quantification of marginal utility of additional measurements.

To discover all physical nodes, traceroute is inefficient Diminishing returns: many S’s and D’s needed

Trading off S’s and D’s Adding destinations seems more cost-effective

To discover how “typical” routes pass through network, traceroute is informative Routing core and feeders Much of routing core is visible from few S’s

(given enough D’s)