hitesh ballani, paul francis, tuan cao and jia wang cornell …ballani.azurewebsites.net › talks...

77
Making Routers Last Longer with ViAggre Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell University and AT&T Labs–Research NSDI 2009

Upload: others

Post on 28-Jun-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Making Routers Last Longer with ViAggre

Hitesh Ballani, Paul Francis, Tuan Cao and JiaWang

Cornell University and AT&T Labs–Research

NSDI 2009

Page 2: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Motivation: Rapid Routing Table Growth

0

50000

100000

150000

200000

250000

300000

0806040200989694929088

Inte

rnet

Rou

ting

Tab

le S

ize

Year

282,000 prefixes (Sep’08)

0

50000

100000

150000

200000

250000

300000

0806040200989694929088

Inte

rnet

Rou

ting

Tab

le S

ize

Year

282,000 prefixes (Sep’08)

[Data Credit: Geoff Huston]

Page 3: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Motivation: Rapid Routing Table Growth

0

100000

200000

300000

400000

500000

140806040200989694929088

Inte

rnet

Rou

ting

Tab

le S

ize

Year

0

100000

200000

300000

400000

500000

140806040200989694929088

Inte

rnet

Rou

ting

Tab

le S

ize

Year

??

Rapid future growth

I IPv4 exhaustionI IPv6 deployment

Routing Table stored in Forwarding InformationBase (FIB) on Routers

Large Routing Table ⇒ More FIB space on Routers

Page 4: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Does FIB Size Matter?

The problem is Scaling Properties of FIB memory(low volume, off-chip SRAM)

Technical concernsI Power and Heat dissipation problems

Business concerns

I Low-volume, off-chip SRAM does not trackMoore’s law

I Larger routing table ⇒ Less cost-effectivenetworks

I Price per byte forwarded increases

I Cost of router memory upgrades

Page 5: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Does FIB Size Matter?

Anecdotal evidence shows ISPs are willing toundergo some pain to extend the lifetime of their

routers

Page 6: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Virtual Aggregation (ViAggre)

A “configuration-only” approach to shrinking routerFIBs

I Applies to legacy routersI Can be adopted independently by any ISP

Real World Impact

I IETF Standards effortI Huawei implementing ViAggre into routers

Key Insight: Divide the routing burden

A router only needs to keep routes for a fraction ofthe address space

Page 7: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Talk Outline

I Motivation[]y

I Router Innards[]y

I Big Picture[]y

I ViAggre Design[]y

I Design Concerns[]y

I Evaluation[]y

I Deployment[]y

Page 8: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Router Innards

Route Processor

Line Card

ASIC FIB

Line Card

Line Card

Line Card

RIB

Router

Routing Protocol

(RP)

Switch Fabric

RP

Router

RP

Router

Page 9: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Router Innards

Route Processor

Line Card

ASIC FIB

Line Card

Line Card

Line Card

RIB

Router

Routing Protocol

(RP)

Switch Fabric

RP

Router

RP

Router

Control PlaneParticipates in routing protocol

Page 10: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Router Innards

Route Processor

Line Card

ASIC FIB

Line Card

Line Card

Line Card

RIB

Router

Routing Protocol

(RP)

Switch Fabric

RP

Router

RP

Router

RoutingInformation

(DRAM $)Base

Control PlaneRIB is a table of routes and is stored on slow

memory

Page 11: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Router Innards

Route Processor

Line Card

ASIC FIB

Line Card

Line Card

Line Card

RIB

Router

Routing Protocol

(RP)

Switch Fabric

RP

Router

RP

Router

Forwarding Information Base(SRAM $$$)

Data PlaneResponsible for sending packets based on FIB (stored

in fast memory)

Page 12: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Routing Scalability Problem Space

[MapEncap’96]

[GSE, ID’97]

[Atoms, ’04]

[CRIO, ICNP’06]

[LISP, ID’07]

[SIRA, ID’07]

[TRRP, ’07]

[APT, ID’07]

[Six/One,

MobiArch’08]

[Francis, CNIS’94]

[Deering, ID’00]

[Hain, ID’02]

[Krioukov, Arxiv’05]

[Shim6, ID’07]

[Multipath, ’08]

A few problems afflict Internet routing scalabilityLots of work to address these problems

FIB growthRIB growth

Routing Convergence,Update Churn, ....

Page 13: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Routing Scalability Problem Space

[MapEncap’96]

[GSE, ID’97]

[Atoms, ’04]

[CRIO, ICNP’06]

[LISP, ID’07]

[SIRA, ID’07]

[TRRP, ’07]

[APT, ID’07]

[Six/One,

MobiArch’08]

[Francis, CNIS’94]

[Deering, ID’00]

[Hain, ID’02]

[Krioukov, Arxiv’05]

[Shim6, ID’07]

[Multipath, ’08]

Separate edge from the core

FIB growthRIB growth

Routing Convergence,Update Churn, ....

Page 14: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Routing Scalability Problem Space

[MapEncap’96]

[GSE, ID’97]

[Atoms, ’04]

[CRIO, ICNP’06]

[LISP, ID’07]

[SIRA, ID’07]

[TRRP, ’07]

[APT, ID’07]

[Six/One,

MobiArch’08]

[Francis, CNIS’94]

[Deering, ID’00]

[Hain, ID’02]

[Krioukov, Arxiv’05]

[Shim6, ID’07]

[Multipath, ’08]

Geographical routing

FIB growthRIB growth

Routing Convergence,Update Churn, ....

Page 15: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Routing Scalability Problem Space

[MapEncap’96]

[GSE, ID’97]

[Atoms, ’04]

[CRIO, ICNP’06]

[LISP, ID’07]

[SIRA, ID’07]

[TRRP, ’07]

[APT, ID’07]

[Six/One,

MobiArch’08]

[Francis, CNIS’94]

[Deering, ID’00]

[Hain, ID’02]

[Krioukov, Arxiv’05]

[Shim6, ID’07]

[Multipath, ’08]

Compact routing

FIB growthRIB growth

Routing Convergence,Update Churn, ....

Page 16: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Routing Scalability Problem Space

[MapEncap’96]

[GSE, ID’97]

[Atoms, ’04]

[CRIO, ICNP’06]

[LISP, ID’07]

[SIRA, ID’07]

[TRRP, ’07]

[APT, ID’07]

[Six/One,

MobiArch’08]

[Francis, CNIS’94]

[Deering, ID’00]

[Hain, ID’02]

[Krioukov, Arxiv’05]

[Shim6, ID’07]

[Multipath, ’08]

Elimination Approaches

FIB growthRIB growth

Routing Convergence,Update Churn, ....

Page 17: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Routing Scalability Problem Space

[MapEncap’96]

[GSE, ID’97]

[Atoms, ’04]

[CRIO, ICNP’06]

[LISP, ID’07]

[SIRA, ID’07]

[TRRP, ’07]

[APT, ID’07]

[Six/One,

MobiArch’08]

[Francis, CNIS’94]

[Deering, ID’00]

[Hain, ID’02]

[Krioukov, Arxiv’05]

[Shim6, ID’07]

[Multipath, ’08]

All require architectural changeSo many good ideas, so little impact!

FIB growthRIB growth

Routing Convergence,Update Churn, ....

Page 18: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Routing Scalability Problem Space

[MapEncap’96]

[GSE, ID’97]

[Atoms, ’04]

[CRIO, ICNP’06]

[LISP, ID’07]

[SIRA, ID’07]

[TRRP, ’07]

[APT, ID’07]

[Six/One,

MobiArch’08]

[Francis, CNIS’94]

[Deering, ID’00]

[Hain, ID’02]

[Krioukov, Arxiv’05]

[Shim6, ID’07]

[Multipath, ’08]

Can we devise an incremental solution by focusing ona subset of the problem space?

FIB growthRIB growth

Routing Convergence,Update Churn, ....

Page 19: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Routing Scalability Problem Space

[MapEncap’96]

[GSE, ID’97]

[Atoms, ’04]

[CRIO, ICNP’06]

[LISP, ID’07]

[SIRA, ID’07]

[TRRP, ’07]

[APT, ID’07]

[Six/One,

MobiArch’08]

[Francis, CNIS’94]

[Deering, ID’00]

[Hain, ID’02]

[Krioukov, Arxiv’05]

[Shim6, ID’07]

[Multipath, ’08]

This Talk: Focuses on reducing FIB size

FIB growthRIB growth

Routing Convergence,Update Churn, ....

Page 20: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Talk Outline

I Motivation[]y

I Router Innards[]y

I Big Picture[]y

I ViAggre Design[]y

I Design Concerns[]y

I Evaluation[]y

I Deployment[]y

Page 21: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre: Basic Idea

PoP A PoP C

PoP B

ISP

IPv4AddressSpace

0.0.0.0

255.255.255.255

External Router External

Router

Today: All routers have routes to all destinations

Page 22: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre: Basic Idea

PoP A PoP C

PoP B

ISP

0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2

VirtualPrefixes

Divide address space into Virtual Prefixes (VPs)Notation: “/2” implies that the first two bits are used to group IP

addresses. “0/2” represents addresses starting with 00.

i.e. 0/2 ⇒ 0.0.0.0/2 ⇒ [0.0.0.0 to 63.255.255.255]

Page 23: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre: Basic Idea0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2

VirtualPrefixes

Aggregation Pointsfor Green VP

Assign Virtual Prefixes to the routersGreen Aggregation Points maintain routes to green

prefixes

Page 24: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre: Basic Idea0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2

VirtualPrefixes

Aggregation Pointsfor Green VP

Routers only have routes to a fraction of the addressspace

Page 25: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre: Basic Idea0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2

VirtualPrefixes

Aggregation Pointsfor Green VP

1. How to achieve such division of the routing tablewithout changes to routers and external cooperation?

2. How do packets traverse even though routers havepartial routing tables?

Page 26: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Control-Plane0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2

Only Blue Routesshould go into FIB

Control-plane needs to ensure that a router’s FIBonly contains routes that the router is aggregating

Page 27: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Control-Plane0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 Full Routing Table

External BGP Peers may advertise full routing table

Page 28: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Control-Plane0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 Full Routing Table

RIB

FIB

Load full routingtable into RIB

Supress all butblue routes from FIB

Simple Approach: FIB SuppressionRouters can load a subset of the RIB into their FIB

High Performance Overhead

Page 29: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Control-Plane0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2

Full RoutingTable

Route Reflector

RIB

FIB

Practical Approach: Route-reflector SuppressionExternal router peers with a route-reflector

Blue router receives only blue routes

Page 30: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Control-Plane0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2

Full RoutingTable

Route Reflector

Practical Approach: Route-reflector SuppressionRoute-reflectors exchange routes with each other

Page 31: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Data-Plane paths0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 Virtual Prefix

Packets destined to a prefix in Red

Consider packets destined to a prefix in the red VP

Page 32: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Data-Plane paths0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

12

ViAggre pathIngress (I) → Aggregation Pt (A) → Egress (E)

Page 33: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Ingress → Aggregation Point0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

1

Router I doesn’t have a route for destination prefix

Page 34: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Ingress → Aggregation Point0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

1

Advertise Red VP

Aggregation Points advertise corresponding VirtualPrefixes

Page 35: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Ingress → Aggregation Point0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

1

Advertise Red VP

Prefix Next-HopP1P2

........

0/2 A128/2192/2 ....

....

Blue router has a route for the red Virtual Prefix

Page 36: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Aggregation Point → Egress0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2 2

Prefix Next-HopP3P4

X....

64/2 ....

128/2 192/2 ....

....

Aggregation Pt. A has a route for destination prefix

Page 37: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Aggregation Point → Egress0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2 2

Non-red routers don’t have a route

for dst. prefix

Router A tunnels packet to external router asintermediate routers don’t have route to dst. prefixOriginal packet is encapsulated in tunnel header with X as dst.

Page 38: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Aggregation Point → Egress0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2 2

......

XATunnel Header

Dst

Original Packet

Src

Router A tunnels packet to external router asintermediate routers don’t have route to dst. prefixOriginal packet is encapsulated in tunnel header with X as dst.

Page 39: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Aggregation Point → Egress0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

1

Strip tunnel headerfrom outgoing pkts

Egress Router strips the tunnel header off outgoingpackets

Page 40: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Talk Outline

I Motivation[]y

I Router Innards[]y

I Big Picture[]y

I ViAggre Design[]y

I Design Concerns[]y

I Evaluation[]y

I Deployment[]y

Page 41: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Failure of Aggregation Point0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

1

What if Aggregation Pt. A fails?

Page 42: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Failure of Aggregation Point0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

Prefix Next-HopP1P2

........

0/2 A2128/2192/2 ....

....

Router I installs the route advertised by A2

Page 43: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Failure of Aggregation Point0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

Prefix Next-HopP1P2

........

0/2 A2128/2192/2 ....

....

Packets are re-routed appropriately

Page 44: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre’s impact on ISP’s traffic0.0.0.0

255.255.255.255

External Router External

Router

0/2

64/2

128/2

192/2 I

A

E X

A2

1

ViAggre paths can be longer than native pathsTraffic stretch, increased router and link load, etc.

Page 45: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Popular Prefixes

Traffic volume follows power-law distribution

I 95% of the traffic goes to 5% of prefixesI Has held up for years

Install “Popular Prefixes” in routers

I Stable over weeksI Mitigates ViAggre’s impact on the ISP’s traffic

Page 46: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Talk Outline

I Motivation[]y

I Router Innards[]y

I Big Picture[]y

I ViAggre Design[]y

I Design Concerns[]y

I Evaluation[]y

I Deployment[]y

Page 47: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre’s impact on adopting ISP

Positive Negative

Increase in path length(Stretch in msec)

Reduction in FIB Size

(% of global Load Increase

routing table) (Increase in trafficcarried by routers)

Page 48: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre’s impact on adopting ISP

Positive Negative

Increase in path length(Stretch in msec)

Reduction in FIB Size

(% of global Load Increase

routing table) (Increase in trafficcarried by routers)

ViAggre deployment optionsI Choosing Virtual Prefixes

I Choosing Aggregation Points

I Choosing Popular Prefixes

ISP can make these choices to tune +ves Vs -ves

Page 49: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre’s impact on adopting ISP

Positive Negative

Increase in path length(Stretch in msec)

Reduction in FIB Size

(% of global Load Increase

routing table) (Increase in trafficcarried by routers)

ViAggre deployment optionsI Choosing Virtual Prefixes

I Choosing Aggregation Points

I Choosing Popular Prefixes

ISP can make these choices to tune +ves Vs -ves

Page 50: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Choosing Aggregation Points

Assigning more routers to aggregate a virtual prefix

I Reduces Stretch imposed on Traffic (as there is a

close-by aggregation point to send traffic to)

I Increases FIB size (as more cumulative FIB space isused)

ISP can choose aggregation points to trade-off

FIB Size Vs Stretch

Page 51: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Aggregation Point Assignment Problem

min Worst FIB Sizes.t. Worst Stretch ≤ Constraint

Constraint on Worst Stretch ensuresI ISP’s Service Level Agreements not breached

I Latency-sensitive traffic not hurt too much

Worst FIB SizeI Important for provisioning routers

Aforementioned Constraint ProblemI Can be mapped to MultiCommodity Facility Location

I NP-hard problem

I Logarithmic approximation algorithm [Ravi, Sinha, SODA’04]

Page 52: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Tier-1 ISP Study

We implemented a greedy approximation algorithm

Algorithm Input: Data from tier-1 ISP

I Topology, Routing tables, Traffic matrix

Used our algorithm with varying stretch constraints

Page 53: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

FIB Size Vs Stretch

0

5

10

15

20

25

30

0 2 4 6 8 10

FIB

Siz

e (

% o

f Int

erne

t rou

ting

tabl

e)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

0

5

10

15

20

25

30

0 2 4 6 8 10

FIB

Siz

e (

% o

f Int

erne

t rou

ting

tabl

e)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

FIB Size reduces as Stretch constraint is relaxed

Page 54: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

FIB Size Vs Stretch

0

5

10

15

20

25

30

0 2 4 6 8 10

FIB

Siz

e (

% o

f Int

erne

t rou

ting

tabl

e)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

0

5

10

15

20

25

30

0 2 4 6 8 10

FIB

Siz

e (

% o

f Int

erne

t rou

ting

tabl

e)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

Worst-case FIB Size10K prefixes

(4% of global routing table)

FIB Size reduces as Stretch constraint is relaxed

Page 55: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

FIB Size Vs Stretch

0

5

10

15

20

25

30

0 2 4 6 8 10 0

1

2

3

4

5

6

7F

IB S

ize

(%

of g

loba

l rou

ting

tabl

e)

Act

ual S

tret

ch (

mse

c)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

Worst StretchAverage Stretch

0

5

10

15

20

25

30

0 2 4 6 8 10 0

1

2

3

4

5

6

7F

IB S

ize

(%

of g

loba

l rou

ting

tabl

e)

Act

ual S

tret

ch (

mse

c)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

Worst StretchAverage Stretch

Average Stretch is negligible

Page 56: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

FIB Size Vs Stretch

0

5

10

15

20

25

30

0 2 4 6 8 10 0

1

2

3

4

5

6

7F

IB S

ize

(%

of g

loba

l rou

ting

tabl

e)

Act

ual S

tret

ch (

mse

c)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

Worst StretchAverage Stretch

0

5

10

15

20

25

30

0 2 4 6 8 10 0

1

2

3

4

5

6

7F

IB S

ize

(%

of g

loba

l rou

ting

tabl

e)

Act

ual S

tret

ch (

mse

c)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

Worst StretchAverage Stretch

Average Stretch0.2 msec

Average Stretch is negligible

Page 57: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

FIB Size Vs Stretch

0

5

10

15

20

25

30

0 2 4 6 8 10 0

1

2

3

4

5

6

7F

IB S

ize

(%

of g

loba

l rou

ting

tabl

e)

Act

ual S

tret

ch (

mse

c)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

Worst StretchAverage Stretch

0

5

10

15

20

25

30

0 2 4 6 8 10 0

1

2

3

4

5

6

7F

IB S

ize

(%

of g

loba

l rou

ting

tabl

e)

Act

ual S

tret

ch (

mse

c)

Worst-case Stretch Constraint (msec)

Worst-case FIB SizeAverage FIB Size

Worst StretchAverage Stretch

ViAggre can extend lifetime of outdatedrouters by 7-10 years while imposing no stretch

(Worst-case Stretch Constraint = 0ms)

Page 58: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Router Load

Naıve ViAggre deployment

I Traffic routed through aggregation points

I Can lead to substantial load increase acrossrouters

I Alleviative: Use of Popular Prefixes

Page 59: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Router Load

Naıve ViAggre deployment

I Traffic routed through aggregation points

I Can lead to substantial load increase acrossrouters

I Alleviative: Use of Popular Prefixes

A lot of traffic destined to popular prefixes

60

70

80

90

100

2015105321% o

f Tra

ffic

Car

ried

% of Popular Prefixes

60

70

80

90

100

2015105321% o

f Tra

ffic

Car

ried

% of Popular Prefixes

Page 60: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Router Load

100

39

10

1

0.120151053210

24.519.514.59.56.54.5Lo

ad In

crea

se(%

of n

ativ

e lo

ad)

% of Popular Prefixes

Worst FIB size (% of DFZ routing table)

100

39

10

1

0.120151053210

24.519.514.59.56.54.5Lo

ad In

crea

se(%

of n

ativ

e lo

ad)

% of Popular Prefixes

Worst FIB size (% of DFZ routing table)

Popular prefixes populated in all routers5% Popular prefixes ⇒ Max. Load Increase= 1.38%

Page 61: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Pros

10x reduction in FIB Size

I Negligible Traffic Stretch (<0.2 msec)I Negligible Increase in Load (<1.5%)

Advantages

I Can be incrementally deployed

I Can be deployed on a limited-scale

I Incentive for deploymentI No change to ISP’s routing setup

I Does not affect routes advertised toneighbors

I Does not restrict routing policies

Page 62: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Cons

Control-plane hacks can impactI Installation Time

I Convergence Time

I Failover Time

Planning OverheadI Choosing virtual prefixes

I Assigning aggregation pointsI Assuring network robustness

Configuration overhead of a configuration-onlysolution

Page 63: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Deployment on WAIL

ISP with ViAggre

PoP1 PoP2

AS2 AS3

Routes propagated using

I Status Quo

I ViAggre (prefix lists for selective advertisement)

Page 64: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Deployment on WAIL

ISP with ViAggre

PoP1 PoP2

AS2 AS3

Routes propagated using

I Status Quo

I ViAggre (prefix lists for selective advertisement)

Routes propagated using mesh of internal BGP peerings

Page 65: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Deployment on WAIL

ISP with ViAggre

PoP1 PoP2

AS2 AS3

RR1 RR2

Routes propagated using

I Status Quo

I ViAggre (prefix lists for selective advertisement)

Prefix List size depends on # of popular prefixes

Page 66: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Deployment on WAIL

ISP with ViAggre

PoP1 PoP2

AS2 AS3

X

Measuring Control-Plane Overhead

Restart external peeringMeasure Installation Time

Page 67: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Installation Time on WAIL

50

100

150

200

250

300

0 50 100 150 200 250

Inst

alla

tion

Tim

e (s

ec)

Number of Prefixes Advertised (thousands)

Status QuoViAggre, 2% PopularPrefixesViAggre, 5% PopularPrefixes

ViAggre reduces Installation Time

Full Routing Table Installation TimeStatus Quo=273sec, ViAggre (2% Popular Prefixes)=124sec

Page 68: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre management overhead

Developed Configuration Tool

I ∼330 line python script

I Extracts information from existing configuration files

I Generates ViAggre configuration files

I Planning component in the works

Working with a router vendor (Huawei)

I Implement ViAggre natively

I IETF Draft

Page 69: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

ViAggre Conclusion

ViAggre shrinks the FIB on routers

I Can be used by ISPs today!

I 10x reduction in FIB Size

I Negligible traffic stretchI Negligible load increase

ISPs can extend lifetime of their routersI Outdated routers can be used for 7-10 years

Is this a “complete” solution? No

I A simple and effective first step

Page 70: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Thank You!

Page 71: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Does FIB Size Matter?

YesTony Li [IAB Workshop’06]

Vince Fuller [APRICOT’07]IAB Workshop [RFC 4984]

. . .

NoDefaultOff [HotNets’05]

AIP [SIGCOMM’08]

. . .

Page 72: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Does FIB Size Matter?

YesTony Li [IAB Workshop’06]

Vince Fuller [APRICOT’07]IAB Workshop [RFC 4984]

. . .

MaybeMe

NoDefaultOff [HotNets’05]

AIP [SIGCOMM’08]

. . .

Page 73: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Does FIB Size Matter?

YesTony Li [IAB Workshop’06]

Vince Fuller [APRICOT’07]IAB Workshop [RFC 4984]

. . .

MaybeMe

NoDefaultOff [HotNets’05]

AIP [SIGCOMM’08]

. . .

Other reasons to reduce FIB Size

I Rapid future multihomingI To facilitate commodification of ISP business

Anecdotal evidence shows ISPs are willing toundergo some pain to extend the lifetime of their

routers

Page 74: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Rapid Routing Table Growth

BigISPs

LittleISPs

Sites

A B C

a1 a2 b1 b2 c1 c2

a11 a12 a21 a22 b11 b12 b21 b22 c11 c12 c21 c22

Internet Routing Scalability is based on hierarchyRequires addressing to be aligned with topology

Page 75: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Rapid Routing Table Growth

BigISPs

LittleISPs

Sites

A B C

a1 a2 b1 b2 c1 c2

a11 a12 a21 a22 b11 b12 b21 b22 c11 c12 c21 c22

Address � Topology MatchSites a11 and a12 are addressed from the address block of a1

which is addressed from the address block of A{a11, a12} ⊂ a1 ⊂ A

Page 76: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Rapid Routing Table Growth

BigISPs

LittleISPs

Sites

A B C

a1 a2 b1 b2 c1 c2

a11 a12 a21 a22 b11 b12 b21 b22 c11 c12 c21 c22

A

Routing should scale by:Number of top-level ISPs and Fan-out

Routing state on A: {B, C, a1, a2}

Page 77: Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell …ballani.azurewebsites.net › talks › talk-nsdi09-va.pdf · Hitesh Ballani, Paul Francis, Tuan Cao and Jia Wang Cornell

Rapid Routing Table Growth

BigISPs

LittleISPs

Sites

A B C

a1 a2 b1 b2 c1 c2

a11 a12 a21 a22 b11 b12 b21 b22 c11 c12 c21 c22

Address � Topology MismatchMultihoming, Load Balancing, Address

Fragmentation, Bad Operational Practices