perfsonar going forward

20
perfSONAR Going Forward Eric Boyd, Internet2 http://www.perfsonar.net Internet2 Technology Exchange September 27 th 2016

Upload: dinhtu

Post on 13-Feb-2017

233 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: perfSONAR Going Forward

perfSONAR GoingForward

EricBoyd,Internet2http://www.perfsonar.net

Internet2TechnologyExchangeSeptember27th 2016

Page 2: perfSONAR Going Forward

ProblemStatement• TheglobalResearch&Educationnetworkecosystemiscomprisedofhundredsofinternational,national,regionalandlocal-scalenetworks.

September27,2016 2©2016,http://www.perfsonar.net

Page 3: perfSONAR Going Forward

ProblemStatement• Whilethesenetworksallinterconnect,eachnetworkisowned

andoperatedbyseparateorganizations(called“domains”)withdifferentpolicies,customers,fundingmodels,hardware,bandwidthandconfigurations.

September27,2016 3©2016,http://www.perfsonar.net

Page 4: perfSONAR Going Forward

ProblemStatement• Thiscomplex,heterogeneoussetofnetworksmustoperateseamlesslyfrom“endtoend”tosupportscienceandresearchcollaborationsthataredistributedglobally.

September27,2016 4©2016,http://www.perfsonar.net

Page 5: perfSONAR Going Forward

ProblemStatement• Inpractice,performanceissuesareprevalentanddistributed.• Whenanetworkisunderperformingorerrorsoccur,itis

difficulttoidentifythesource,asproblemscanhappenanywhere,inanydomain.

September27,2016 5

• Local-areanetworktestingisnotsufficient,aserrorscanoccurbetweennetworks.

©2016,http://www.perfsonar.net

Page 6: perfSONAR Going Forward

ProblemStatement:Hardvs.SoftFailures

• “Hardfailures”arethekindofproblemseveryorganizationunderstands– Fibercut– Powerfailuretakesdownrouters– Hardwareceasestofunction

• Classicmonitoringsystemsaregoodatalertinghardfailures– i.e.,NOCseessomethingturnredontheirscreen– Engineerspagedbymonitoringsystems

September27,2016 6©2016,http://www.perfsonar.net

Page 7: perfSONAR Going Forward

ProblemStatement:Hardvs.SoftFailures

• “Softfailures”aredifferentandoftengoundetected– Basicconnectivity(ping,traceroute,webpages,email)works

– Performanceisjustpoor

• Howmuchshouldwecareaboutsoftfailures?

September27,2016 7©2016,http://www.perfsonar.net

Page 8: perfSONAR Going Forward

September27,2016 8©2016,http://www.perfsonar.net

Page 9: perfSONAR Going Forward

Physicalpipethatleakswateratrateof.0046%byvolume.

è è

Network‘pipe’thatdropspacketsatrateof.0046%.

è è

Result100%ofdatatransferred,slowly,at<<5%optimalspeed.

Elephant Flows Place Great Demands on NetworksResult99.9954%ofwatertransferred,at“linerate.”

essentiallyfixed

determinedbyspeedoflight

Throughcarefulengineering,wecanminimizepacketloss.

September27,2016 9©2016,http://www.perfsonar.net

Page 10: perfSONAR Going Forward

SoftFailuresCausePacketLossandDegradedTCPPerformance

MetroArea

Local(LAN)

RegionalContinental

International

Measured (TCP Reno) Measured (HTCP) Theoretical (TCP Reno) Measured (no loss)

With loss, high performance beyond metro distances is essentially impossible

September27,2016 10©2016,http://www.perfsonar.net

Page 11: perfSONAR Going Forward

publicperfSONARServers(Sept2016)• Over2000publiclyregisteredservers

– Equalnumberofnon-registeredservers?• ESnet:50

– mostly10G,includesa40GhostinBoston• GEANT:22• Internet2:3• Someothertopdeployments:

– Onenet (24),AMPATH(8),bc.net (10),RNP(8),Canarie (13),kreonet(14),NERO(12),AARnet (19),JGN(17),CENIC(5),KANREN(5)

September27,2016©2016,http://www.perfsonar.net 11

Page 12: perfSONAR Going Forward

MoreperfSONARStatistics• TotalHosts:2001• TotalDomains:413• 75%arerunninglatestversion(auto-update)• 7%arerunningaversion<3.5(end-of-life,orphanhosts?)

• 27%ofthehostshaveanIPV6Address• 38%are.edu hosts• 75totaltop-leveldomains,940domains

September27,2016©2016,http://www.perfsonar.net 12

Page 13: perfSONAR Going Forward

MoreperfSONARStats• 95%RHEL/CentOS;5%Debian/Ubuntu• 10%areVMs• 40%usingjumboframes• NICSpeed:

– 1Gbps49.06%– 10Gbps44.17%– 40Gbps2.88%– 100Mbps 2.32%

September27,2016©2016,http://www.perfsonar.net 13

Page 14: perfSONAR Going Forward

perfSONAR 4.0• perfSONAR 4.0FeatureTour

– AndyLake– 9:00AM

• pScheduler DeepDive– MarkFeit– 9:30AM

September27,2016©2016,http://www.perfsonar.net 14

Page 15: perfSONAR Going Forward

Whatcomesnext?• perfSONAR SteeringGroupisengagedinaprojecttodefinestrategicplanforpost-perfSONAR 4.0efforts

• Fourbasicthemesarebeingconsidered:– Operationsefficiency– Automationofconfiguration,execution,&analysis– Performanceofthecloud– Careandfeedingoftheopensourceeffort

September27,2016©2016,http://www.perfsonar.net 15

Page 16: perfSONAR Going Forward

OperationsEfficiency• Reduceoperationseffortrequiredofnetworkoperatorsanddataintensivesciencecollaborations– EasydeploymentofephemeralpS Nodes:perfSONARcontainerization(e.g.Docker)

– Automateloganalysistoidentifykeysignatures– Improvedashboardvisualization– Enhancemeshconfigurations

Page 17: perfSONAR Going Forward

AutomationofConfiguration,Execution,&Analysis

• Configuration– Automatictestconfiguration

• SignificantprogressinperfSONAR 4.0– Automaticnodediscovery/insertionintotestmessages

• Execution– Auto-detectionofproblemsalongtheend-to-endpath– Automaticallyrunadditionaltests

• Analysis– Anomalydetectiontools– Intelligentalarming(e.g.adaptiveviamachinelearning)– Archivesforresearchdata

Page 18: perfSONAR Going Forward

PerformanceoftheCloud• Cloudservicesandcloudbasedworkflowsareimportantgoingforward.

– Toourresearchers– Toourlabsanduniversities

• Performancemonitoringneedstobeautomatedand‘ephemeral’tillthecloudworkflowisactive

• Supportatargetedpopularusecase– e.g.:perfSONAR AmazonEC2deployment– Showcasewhatispossible– Researchhowtosolvetheproblemmoregenerally

• Buildtowardsamoregeneralsolutioninfollow-onreleases• Observation:Increasingneed/useofperfSONAR onVMs

– Informtheuserofthelimitationsofwhatcanbeconcluded

Page 19: perfSONAR Going Forward

CareandFeeding• Expandnewofferings

– pScheduler isanewoffering(1.0release)whichwillpromptanewiterationofcommunityideas• Securitychallenges,vulnerabilities,andinefficiencies

– e.g.Expandunittesting– Review,refine,andreduceinefficientcodecomponents

• Supportevolvingdefaultoperatingsystems– e.g.CentOS6->Centos7,Debian 7->Debian 8->Debian 9

• PreferredAPIinterfaces,middlewarekeepevolving– e.g.SOAP/XML->REST/JSONAPIs– messagepassingarchitectures– e.g.RabbitMQ

• CustomCodeReplacement– RemoveperfSONAR customcodewithwell-supportedpackagesastheybecomeavailable

• ELK(Elasticsearch,Logstash,Kibana)• ESnet’s ReactTimeseries Charts• IU’sTimeSeriesDataService(TSDS)

• SmallNodes– Emergingtechnologiesarerapidlyevolving– Maintainacurrentsnapshotofbestofferings

Page 20: perfSONAR Going Forward

CommunityMilestones&Feedback• Milestones

– perfSONAR 4.0– RC1isoutthisweek– perfSONAR 4.0– RC2expectedinOctober– perfSONAR 4.0outinNovember

• Feedback– WhatdoyouneedfromperfSONAR post4.0?

September27,2016©2016,http://www.perfsonar.net 20