building a highly adaptive resilient and scalable mpls1352
TRANSCRIPT
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
1/29
2006 Verizon. All Rights Reserved. PT00000. 00/00/06
Building a Highly Adaptive, Resilient,
and Scalable MPLS Backbone
Building a Highly Adaptive, Resilient,
and Scalable MPLS Backbone
Ning So, Verizon Business
Hao-Hsin Huang, WANDL, Inc.
Feb. 9, 2007MPLS World Congress 2007Paris, France
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
2/29
2
AgendaAgenda
About Verizon Business and WANDL
Verizon Business MPLS core network
MPLS TE Fast Reroute (FRR) design on VzB MPLS core
Design challenges and solutions
Solving LSP meshing scalability Q&A
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
3/29
3
Verizon BusinessVerizon BusinessVerizon Business is the premier communications solutions
provider for global businesses, government agencies, andeducational institutions.
Data and IP services
Security
IT solutions
Managed networks
Premises equipment
Contact centers
ConferencingVoice
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
4/294
WANDLWANDLWANDL is the leading supplier of software solutions for advanced
network planning, management, design, and optimization
20 years expertise in software development for network optimization,planning, design, OSS/automation
Work with Cisco, Juniper, Tellabs, Alcatel, Nortel, Lucent, Huawei, etc.
Manage technologies such as IP, MPLS, VoIP, transport(SONET/SDH), FR/ATM ,TDM
Customers include carriers, ISPs, telcos, PTTs, service providers,enterprise, and government organizations
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
5/295
MPLS Core Network OverviewMPLS Core Network Overview
Verizon Business Private MPLS Core Network
U.S.-based
Backbone trunks are OC48 and OC192 packet-over-SONET
Provide Layer 2 (MPLS-TE) transport services for Verizon Businessprivate data networks including IP, Ethernet, and Frame Relay
networks
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
6/296
Drawing of the MPLS Core NetworkDrawing of the MPLS Core Network
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
7/297
Key Objectives of the NetworkKey Objectives of the Network
Network Capacity
Provide sufficient backbone capacity to carry traffic from all feedernetworks
Network Resiliency
100% traffic re-route during single hardware and/or transmission facility
failure
Network Performance
Provide the minimal latency routes available between any two network
elements Network Efficiency
Best practices in network architecture and traffic engineering techniques to
optimize network routing and resource usage
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
8/298
Why Implement MPLS TE FRR?Why Implement MPLS TE FRR?
Underlying transmission facilities supporting the MPLS core network aregradually shifting from a 1+1 SONET ring-based infrastructure to 1+0linear-based infrastructure.
Transmission equipment moving from conventional DWDM to Ultra LongHaul (ULH). ULH favors linear over ring infrastructures.
ULH has fewer optical regions compared with conventional systems, resulting in
a less efficient ring design (larger rings) and no impact on linear designULH has a much higher availability rate, reducing the need for the ring system for
transmission facility maintenance and repair work
The core network with linear transmission facility relies more on the Layer
2 reroute for service restoration.
MPLS TE FRR is the only proven technology that provides servicerestoration at a speed comparable to ring-based restoration.
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
9/29
9
FRR Implementation Design ChallengesFRR Implementation Design Challenges
FRR design can be highly complicated with implementation andmanagement presenting bigger challenges to service providers.
FRR design challenges
1. Bundled FRR design for ease of implementation and management
2. FRR bandwidth and pre-emption design
3. FRR design with built-in facility failures knowledge base
4. Do not re-route LSP on a FRR-enabled network environment
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
10/29
10
FRR Design Challenge #1FRR Design Challenge #1
Description
When Juniper FRR is engineered and signaled on a per LSPbasis, service providers lose the control of FRR route selection
Design and management of individual FRR quickly becomes tootime consuming for service providers with a large MPLS core
network (such as the Verizon Business MPLS Core network).
Solution
Utilize the node/link protection scheme
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
11/29
11
Challenge #1 Solution Illustration NLP Discovery and Design ParametersChallenge #1 Solution Illustration NLP Discovery and Design Parameters
NLP flag on LSP
Auto FRR designparameters
Primary tunnel path in yellowNLP bypass tunnel in green
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
12/29
12
Challenge #1 Solution Bypass Tunnel andIts Protected Primary Tunnel PathsChallenge #1 Solution Bypass Tunnel andIts Protected Primary Tunnel Paths
Bypass tunnel path Protected path
All tunnels being protectedby this bypass tunnel
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
13/29
13
FRR Design Challenge #2FRR Design Challenge #2
Description
How to set up the bypass tunnel bandwidth and preemption?
Problems occur if bypass tunnel bandwidth is set up accordingto the primary LSP bandwidth Additional bandwidth on the backbone has to be reserved for FRR, causing
inefficient use of valuable network resources
FRR bandwidth and preemption design quickly becomes too complicatedwhen multiple FRR paths are set up to account for multiple network failures
Multiple network element failure can cause domino effect on FRR reroutedue to preemption which magnifies the problem and causes networkinstability
Service provider loses performance predictability due to the massiveamount of combinations and permutations of the re-route scenarios
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
14/29
14
FRR Design Challenge #2FRR Design Challenge #2
Solution
Set bypass tunnel bandwidth to zero, effectively turningoff the preemption
All traffic on the FRR route share the pain during theoutages
Rely on the LSP re-signal to restore the LSP primaryroute, based on the new network topology
Allow queuing at the physical interface level handle thetraffic discard during congestion
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
15/29
15
Challenge #2 SolutionOption 1: Both Design and RSVP BW=0Challenge #2 SolutionOption 1: Both Design and RSVP BW=0
All bypass tunnels are routed based on 0 bw requirement forprotection.
Shortest path results. An efficient network resource usage results due to 0 rsvp bw. Queuing delay could occur during congestion.
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
16/29
16
Challenge #2 SolutionOption 2: Full Design BW and 0 RSVP BWChallenge #2 SolutionOption 2: Full Design BW and 0 RSVP BW
All bypass tunnels are routed based on 100% primary tunnel bwrequirement for protection.
Best paths for full protection will result. An efficient network resource usage results due to 0 rsvp bw.
Propagation delay could be increased for some bypass tunnels.
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
17/29
17
FRR Design Challenge #2FRR Design Challenge #2
Alternative Solution
Create multiple bypass tunnels Set maximum bw and subscription ratio for bypass
tunnels.
Load balance among all available network resources Minimize congestion
Prevent losing all tunnels during multiple failure
scenarios
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
18/29
18
Challenge #2 Alternative Solution Multiple Bypass TunnelsChallenge #2 Alternative Solution Multiple Bypass Tunnels
Full BW for bypasstunnel design
Low RSVP BW toconserve network
resource Set maximum protectionBW for bypass tunnels
Set maximum number ofbypass tunnels allowed
A possible scenario is to use multiple (4) OC48 trunks to protect asingle OC192 trunk
Load balancing is desired among 4 OC48 trunks
This scheme could preserve 75% of the traffic even when the OC192and one OC48 are down at the same time
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
19/29
19
FRR Design Challenge #3FRR Design Challenge #3
Description
Current MPLS network does not have any knowledge of thephysical layer topology, so the auto-configured FRR maybe put on the same physical transmission facilities as theprimary route of the LSP
Manual configuration of individual FRR, even using node/linkprotection scheme, can be error prone
Admin Group function is the ideal and logical choice formarking the facilities. However, existing RSVP TE extensiondoes not include Admin Group
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
20/29
20
FRR Design Challenge #3FRR Design Challenge #3
Solution
Use fate sharing function to group trunks based on
physical topology Fate sharing allows for manual categorization of trunks
When constructing bypass tunnels, router will avoid links
in the same group as the protected link Juniper fate-sharing and Cisco Share Risk Link Group
(SRLG) features are vendor-specific implementation
Create explicit bypass paths using WANDL tool to ensurefacility diversity
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
21/29
21
Challenge #3 Solution Facility GroupingChallenge #3 Solution Facility Grouping
Automatically constructed facility groupingduring parsing of the configuration files
User may add new facility grouping to studyits impact on bypass tunnel design
Specify facility diversity for FRR auto design
Three OC48 ports on
the same PIC
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
22/29
22
Challenge #3 Solution FRR Design with Facility DiversityChallenge #3 Solution FRR Design with Facility Diversity
Primary tunnel path in yellow
and NLP bypass tunnel in green With facility diversity Bypass tunnel avoids link
in the same facility due toadded constraint
W/o facility diversity
Bypass tunnel useslink in the same facilitydue to CSPF
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
23/29
23
FRR Design Challenge #4FRR Design Challenge #4
Description
How to design LSP that resists re-routing in a FRR-enabled
network environment Customers with ultra-delay sensitive applications who do not
want to be placed on sub-optimal route
Customers prefer to be notified of the failure and switch to
alternative network/provider
Solution
Dedicate a set of do not re-route LSPs Disable the FRR at the LSP head end while maintaining the
node/link protection scheme network wide
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
24/29
24
Challenge #4 Solution LSP Tunnels Configlet GenerationChallenge #4 Solution LSP Tunnels Configlet Generation
Link-protection onrsvp interfaces
Mix of LSPs with NLPand without NLP
LSP with and
without NLP
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
25/29
25
LSP Scalability ChallengeLSP Scalability Challenge
Designing a large MPLS core network that convergesmultiple feeder networks with thousands of trunkingLSPs for each feeder network
Designing and managing a network with tens ofthousands of LSPs is not practical for serviceproviders
Network performance and operator ability totroubleshoot are negatively impacted as the number ofLSPs increases
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
26/29
26
A Practical Solution for LSP ConsolidationA Practical Solution for LSP Consolidation
Establish a full mesh of LSPs as the base LSP set inthe MPLS core network.
Set up feeder network logical trunking mesh aspseudowires within the base LSPs
Establish each set of PW with its own Class of Servicequeue for traffic management and policing
Build more than one set of base LSPs depending on
special requirements, e.g., do not re-route LSPs
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
27/29
27
LSP Scalability illustrationLSP Scalability illustration
Tunnels routedthru a core link
Pseudowires riding on
a LSP tunnel
One fully meshed set of LSP tunnels Many PWs riding on a LSP tunnel All PWs passing thru a link are protected
by FRR NLP
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
28/29
28
ConclusionsConclusions
FRR is a necessary and powerful tool in the MPLS TE-enabled network; however, it is also highly
complicated to implement MPLS TE is still a rapidly developing and evolvingtechnology with many gaps in the current standards
Although most major service providers have deployedMPLS TE-enabled networks, the learning curve is stillsteep, and there are many challenges ahead
A comprehensive FRR design and simulation tool likeMPLSView is needed to ensure proper BW protectionand efficient use of network resources
-
8/2/2019 Building a Highly Adaptive Resilient and Scalable Mpls1352
29/29
29
Q&AQ&A
Ning So
Verizon Business
2400 N. Glenville
Richardson, Texas
Hao-Hsin Huang
WANDL, Inc
88 Centennial Ave.
Piscataway, NJ
Thank You!