universitÀ degli studi roma tre dipartimento di informatica e automazione monitoring the status of...

Post on 18-Dec-2015

220 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

UNIVERSITÀ DEGLI STUDI ROMA TREDipartimento di Informatica e Automazione

Monitoring the Status of MPLS VPN and VPLS Based on BGP

Signaling Information

Giuseppe Di BattistaMassimo Rimondini

Giorgio Sadolfo

IEEE/IFIP NOMS 201218/04/2012

UNIVERSITÀ DEGLI STUDI ROMA TREDipartimento di Informatica e Automazione

Monitoring the Status of MPLS VPN and VPLS Based on BGP

Signaling Information

Giuseppe Di BattistaMassimo Rimondini

Giorgio Sadolfo

IEEE/IFIP NOMS 201218/04/2012

About MPLS VPNs/VPLS

VPN VPLSMPLS

NOMS 2012 - 18/04/2012

ISP BB

About MPLS VPNs/VPLS

Customersite

Customersite

Customersite

Customersite

Customersite

Customer

Customer

’s EtherSphere™

About MPLS VPNs/VPLS

192.168.0.4

State of the Art(in MPLS/VPLS monitoring)

NOMS 2012 - 18/04/2012

rese

arc

h

ind

ust

ryre

searc

hte

chn

olo

gy

monitoring

control planeMPLS and VPLS

indust

ry

State of the Art(in MPLS/VPLS monitoring)

IP Solution Center

Service Aware Manager

Service Activator Solution for VPN Services

Tivoli Network Manager

VPN Explorer

indu

stry

rese

arc

hte

chn

olo

gy

State of the Art(in MPLS/VPLS monitoring)

Routing convergenceD. Pei, J. Van der Merwe. BGPConvergence in Virtual Private Networks.Proc. IMC, 2006.

ScalabilityC. Kim, A. Gerber, C. Lund, D. Pei, S. Sen. Scalable VPN Routing via Relaying. Proc. SIGMETRICS, 2008.

MonitoringM. K. Thottan, G. K. Swanson, M. Cancone, T. K. Ho, J. Ren, S. Paul. SEQUIN: An SNMP-based MPLS Network Monitoring System. Bell Labs Technical Journal 8(1), 95–111, 2003.

NOMS 2012 - 18/04/2012

ind

ust

ryre

searc

hte

chn

olo

gy

State of the Art(in MPLS/VPLS monitoring)

SNMPTIBCO Rendezvous Message TransportOracle DBMSRCP, RSHTelnet, SSHTFTP, FTP

NOMS 2012 - 18/04/2012

ind

ust

ryte

chn

olo

gy

rese

arc

h

MPLS VPN/VPLS monitoringmethodology

Focus on monitoringObservation of effects of network events• Reconfigurations• Failures

Additional technologies requiredRequires access to devicesGraphical visualization of VPN states

Extensive discussion on scalability vs visibility of (the effects of) network eventsArchitecture, prototype, experimentation in Junosphere NOMS 2012 - 18/04/2012

+ provisioningObservation of the network status

Instant snapshot of device states

+ history(Almost)Standard technologies (BGP)Unobtrusive

Exhaustive analysis of observable effects

Discovery of a subtle anomaly in the routing software, confirmed by Juniper

Our Contributions

Methodology

NOMS 2012 - 18/04/2012

3 Visualize VPN states

2 Reconstruct visibility of VPNs at PEs

1 Collect signaling messages

Methodology1. Collection

Approach Drawback(s)

Monitor network trafficUndetermined in absence of traffic

Inject network traffic Intrusive; hard to tune

Watch router configurations

Intrusive; access restrictions may apply

Watch router statesSame as above + untimely

Notifications (e.g., SNMP)Additional technologies required

*Limited visibility of the effect of a configuration

Monitor signaling messages

N/A

• Actual propagation of information

• Routing decisions @ PEs

BGPLDP

Methodology1. Collection

VPN signalingMPLS: BGPVPLS:

NOMS 2012 - 18/04/2012

Autodiscovery

Signaling

Vendor

RFC 4762 (Kompella) N/A LDP Cisco

RFC 4761 BGP BGP Juniper

BGP-based VPLS Autodiscovery

LDP-BGP VPLS Interworking

BGP is also...easy to set upscalablepolicy-aware

NOMS 2012 - 18/04/2012

Methodology1. Collection

Customersite

Customersite

Customersite

Customersite

Customersite

BGP peerings

Mmmh... I’m a reflector-client

Methodology 2. Reconstruction of VPN

state

Exhaustive comparison of information from different BGP updates

NOMS 2012 - 18/04/2012

timesta

mpRD

prefix

+

CE ID

RT

Extended communities

Extended communities

Extended communities

NLRI

NLRI

NLRI

type (A/W)

NOMS 2012 - 18/04/2012

Methodology 2. Reconstruction of VPN

stateExample

RD1

pfx1

+ RT1

RD1

pfx1

+

A

RT2

RD1

pfx1

+ RT2

Changed VPN?Reconfiguration?

Policy change?Moved pfx1 to a different VPN?

NOMS 2012 - 18/04/2012

Methodology 2. Reconstruction of VPN

stateApply the method to a sequence of BGP updates

Reconstruct history of VPNvisibility at each PE

...

NOMS 2012 - 18/04/2012

Methodology 2. Reconstruction of VPN

stateA few difficulties:

Investigation of the PE where the effect was first observedDealing with missing attributes in withdrawalsInadmissible announcements [rfc4761]ReannouncementsSynchronization with actual VPNstatesMonitoring RC peering states

Methodology 3. Visualization

Query: visibility at each PE ofRD 12345:10011prefix 172.16.110.0/30RT 12345:111

time

PE

visibleoriginatednot visible

BGP updates

Query: visibility at each PE ofRD 12345:10011prefix 172.16.110.0/30

...with RT12345:111

...with RT12345:222

Methodology 3. Visualization

QueriesCheck information propagation• Input: RD+{prefix,CE ID}, RT• Output: Visibility from all PEs

Check a PE’s visibility of a specific VPN• Input: RT, PE• Output: Visibility of all RD+{prefix,CE ID} with that RT at that

PE

Highlight belonging of a prefix to a VPN• Input: RD+{prefix,CE ID}• Output: Visibility of that RD+{prefix,CE ID} from all PEs, with

each seen RT

Highlight participation of PEs in VPNs• Input: RT• Output: Visibility of that RT at each PE

*

*

*

#

#

#

#

* VPN≡RT# over time

Scalability

Routing table size >> #Internet prefixes: ~ k 105

[Ben-Houidi et al. 07] Only routing updates count Same scalability of [ORV], [BGPlay], [iBGPlay]

Amount of routing updates Lots of customers, prefixes, VPNs, etc. Bursts (due to, e.g., configurations changes,

faults) are unlikely 2-3 orders of magnitude less than VPN routes

[Ben-Houidi et al. 07] Our prototype works even for M/L ISPs[Ben-Houidi et al. 07] Z. Ben-Houidi, R. Teixeira, and M. Capelle, “Origin of route explosion in virtual

private networks,” in Proc. CoNEXT, 2007.

Scalability vs Visibility

Customersite

Customersite

Scalability vs Visibility

Customersite

Customersite

Scalability vs Visibility

Customersite

Customersite

Scalability vs Visibility

layer higherlower

scalability higherlower

visibility worsebetterbeware of matching updates

libbgpdumpbash

Experimental Scenario

visualization client

local storage

ROUTE COLLECTOR

routingdaemon

route retriever

databaseJFreeChart

• advertise MP extensions for L2VPN

• dump relevant fields to MRT

+ process L2VPN MP from MRTs+max lag: 3mins

preliminary testson Cisco routers

SEA

DEN

CHI

NYC

WAS

ATL

HOU

LAX

SEA

DEN

CHI

NYC

WAS

ATL

HOU

LAX VPLS

MPLS

SEA

DEN

CHI

NYC

WAS

ATL

HOU

LAX

SEA

DEN

CHI

NYC

WAS

ATL

HOU

LAX

Experiments

Injected events:(De+re)activation of customer sitesRT change(De+re)activation of multihomingLocal preference change in amultihoming configuration

TimingRandom orderVarying rate ( [1/hr...100/min] )

> 150,000 collected BGP updatesProcessing time: < 20s,without optimizations

NOMS 2012 - 18/04/2012

SEA

DEN

CHI

NYC

WAS

ATL

HOU

LAX

VPLS only!

The Oscillation Problem

Did not affect forwardingInvestigation with JuniperBest route selection in VPLS only considered

VPLS control flagssite preferencePE router IDties were broken on most recent announcement (could carry updated labels)

DISAGREE [Griffin et al. 02]Fix (being) released

[Griffin et al. 02] T. Griffin, F. B. Shepherd, and G. Wilfong, “The stable paths problem and interdomain routing,” IEEE/ACM Transactions on Networking, vol. 10, no. 2, pp. 232–243, 2002.

Wrapping Up

A monitoring methodology

Discussion on scalability vs visibilityArchitecture & prototype implementationExperimentation revealing routing anomaly

NOMS 2012 - 18/04/2012

Effects Signaling MPLS+VPLS Visualization

Operation Reconfiguration Troubleshooting

NOMS 2012 - 18/04/2012

Future Work/Open Problems

Monitor otherprotocols/kinds ofinformationCollect non-best routesImprove the visualizationTrigger alarmsImprove inference of event causes

Acknowledgments to

Thank you

top related