pacific research platform version 2 (prpv2) workshop · cenic hooks up campus gateways in a...

15
Pacific Research Platform Version 2 (PRPv2) Workshop Summary of Workshop Presentations Richard Moore and Tom DeFanti Workshop held February 21, 2017 Hosted by Calit2’s Qualcomm Institute (Calit2/QI) at University of California San Diego (UCSD) This workshop was supported by NSF award ACI-1541349. Additional support is from the University of California Office of the President (UCOP), the Corporation for Education Network Initiatives in California (CENIC), and UC San Diego Calit2

Upload: others

Post on 19-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

Pacific Research Platform Version 2 (PRPv2) Workshop

Summary of Workshop Presentations Richard Moore and Tom DeFanti

Workshop held February 21, 2017

Hosted by Calit2’s Qualcomm Institute (Calit2/QI) at

University of California San Diego (UCSD)

This workshop was supported by NSF award ACI-1541349.

Additional support is from the

University of California Office of the President (UCOP), the Corporation for Education Network Initiatives in California (CENIC), and UC San Diego Calit2

Page 2: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page2

TableofContentsPreface.........................................................................................................................................................3

Agenda.........................................................................................................................................................3

WorkshopContextandObjectives...............................................................................................................4

SessionNotes...............................................................................................................................................5

WelcomeandPRPTwoYearsIn–LarrySmarr(UCSD)............................................................................5

Session1:FastandSecureDataTransferPanel– Moderator:TomDeFanti(UCSD)...............................6

1.1 DataPlacementwithHTCondor–PhilPapadopoulos(UCSD/SDSC)........................................6

1.2 IPv6andCENIC/ESnet/I2Goals–JohnHess(CENIC)andTomHutton(UCSD/SDSC)..............7

1.3 FastDataTransfer:SC16results,etc.–AzherMughal(Caltech).............................................7

1.4 TrustedPlatformModules–JohnGraham(UCSD/Calit2/QI)..................................................8

1.5 Session1PanelQ&A................................................................................................................8

Session2:ScienceEngagementFuturePanel–Moderator:FrankWürthwein(UCSD)...........................9

2.1 PRPWeatherDataTransfer–ScottSellars(UCSD/SIO)...........................................................9

2.2 PRPEngagementFoci–CamilleCrittenden(UCB/CITRIS)&TomDeFanti(UCSD/Calit2/QI)..9

2.3 PRPfromtheUCSCPointofView–ShawfengDong(UCSC)..................................................10

2.4 PRPfromtheUCMercedPointofView–JeffWeekley(UCM)..............................................10

2.5 Session2PanelQ&A..............................................................................................................11

Session3:NetworkArchitectureforGlobalizationofPRPv2Panel–Moderator:CeesdeLaat(UVa)..12

3.1 CooperativeWorkGroupsinPRPv2–LeonGommans(AirFrance/KLM)..............................12

3.2 ScienceDMZsatKISTI–JeonghoonMoon(KISTI)..................................................................12

3.3 FastDataTransfer:Brazil–MichaelStanton(RNP,Brazil).....................................................13

3.4 ObservationsonExpandingthePRP–HarveyNewman(Caltech).........................................13

3.5 Session3PanelQ&A..............................................................................................................13

Session4:SDN/SDXinPRPv2Panel–Moderator:TomHutton(UCSD/SDSC).......................................14

4.1 SDNinPRPv2forsecureflows–PhilPapadopoulos(UCSD/SDSC)........................................14

4.2 100GbServicesforData-IntensiveScienceEnabledbySDN/SDX–JoeMambretti(NorthwesternUniv)..........................................................................................................................14

4.3 SDN/SDXinCENIC/PacificWave–JohnHess(CENIC)............................................................14

4.4 Session4PanelQ&A..............................................................................................................15

Page 3: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page3

PrefaceThisSummaryofWorkshopPresentationsprovidesabriefoverviewoftheworkshop,includingnotesfromthetalksandpanelquestion-and-answersessions.Itisnotintendedtoreproduceorsummarizetheinformationinthespeakers’presentations,whichareavailableatpacificresearchplatform.org.Theworkshopvideo-recordingisalsoavailablethereaswell.

RecommendationsandfindingsfromtheworkshopwillbeincorporatedintotheplanningfortheremainderofthePRPproject,andmaybeaddedtothisreportatalatertime.

Agenda10:00 WelcomeandPRPTwoYearsIn–LarrySmarr(UCSD)Session1: FastandSecureDataTransferPanel–Moderator:TomDeFanti(UCSD)10:30 1.1 DataPlacementwithHTCondor–PhilPapadopoulos(UCSD/SDSC)10:42 1.2 IPv6andCENIC/ESnet/I2Goals–JohnHess(CENIC)andTomHutton(UCSD/SDSC)10:54 1.3 FastDataTransfer:SC16results,etc.–AzherMughal(Caltech)11:06 1.4 TrustedPlatformModules–JohnGraham(UCSD/Calit2/QI)11:18 1.5 Q&A11:45Lunch,withoptionalwalktoseeLibraryCAVEkioskSession2: ScienceEngagementFuturePanel–Moderator:FrankWürthwein(UCSD)1:45 2.1 PRPWeatherDataTransfer–ScottSellars(UCSD/SIO)1:55 2.2 PRPEngagementFocii–CamilleCrittenden(UCB)andTomDeFanti(UCSD/Calit2/QI)2:05 2.3 PRPfromtheUCSCPointofView–ShawfengDong(UCSC)2:15 2.4 PRPfromtheUCMercedPointofView–JeffWeekley(UCM)2:25 2.5 Q&ASession3:NetworkArchitectureforGlobalizationofPRPv2Panel–Moderator:CeesdeLaat(UVa)3:00 3.1 CooperativeWorkGroupsinPRPv2–LeonGommans(AirFrance/KLM)3:12 3.2 DMZsatKISTI–JeonghoonMoon(KISTI)3:24 3.3 FastDataTransfer:Brazil–MichaelStanton(RNP)3:36 3.4 ObservationsonExpandingthePRP–HarveyNewman(Caltech)3:48 3.5 Q&A4:00BreakSession4:SDN/SDXinPRPv2Panel–Moderator:TomHutton(UCSD)4:30 4.1 SDNinPRPv2forsecureflows–PhilPapadopoulos(UCSD/SDSC)4:45 4.2 100GbServicesforData-IntensiveScienceEnabledbySDN/SDX–JoeMambretti

(NorthwesternU)5:00 4.3 SDN/SDXinCENIC/PacificWave–JohnHess(UCSD/Calit2/QI)5:15 4.4 Q&A5:30 Reception

Page 4: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page4

WorkshopContextandObjectivesThePacificResearchPlatform(PRP)isanambitiousprojectdrivenbythehigh-speednetworkingneedsofcollaborative,big-datascience.Manyresearchdisciplinesareincreasinglymulti-investigatorandmulti-institutionalandneedevermorerapidaccesstotheirultra-largeheterogeneousandwidelydistributeddatasets.Inresponsetothischallenge,theDepartmentofEnergy’sESnetdevelopedtheScienceDMZmodel,anetworksystemoptimizedforhigh-performancescientificapplicationsratherthanforgeneral-purposeorenterprisecomputing.ThePRPextendsthecampusScienceDMZmodel,whichiswidelyfundedonindividualcampusesbyNSF’sCC-NIEandCC-NIIprograms,toaregionalmodelfordata-intensivenetworking.Itenablesresearcherstoquicklyandeasilymovedatabetweencollaboratorlabs,supercomputercenters,anddatarepositories,creatingabig-datafreewaythatallowsthedatatotraversemultiple,heterogeneousnetworkswithoutperformancedegradation.ThePRP’sdatasharingarchitecture,withend-to-end10–100Gb/sconnections,alsoenablesregion-widevirtualco-locationofdatawithcomputing.

ThemainfocusofthePRPprojectistobuildaresearcher-definedanddata-focusednetworkwhoserequirementsaredrivenbydirectengagementswithsophisticated,cyberinfrastructure-knowledgeableScienceTeamschosenfromthefieldsofparticlephysics,astronomy,biomedicalsciences,earthsciences,andscalabledatavisualization.Itisapartnershipofmorethan20institutions,includingfourNationalScienceFoundation,DepartmentofEnergy,andNASAsupercomputercenters.

APRPworkshopwasheldinOctober2015,shortlyaftertheNSFawardwasinitiated,tobringtogetherrepresentativesfromallPRPpartnerinstitutions,participatingScienceTeammembers,technicalstaff,andnetworkarchitectsandimplementerstodiscussdeploymentofthePRPandaddressinteroperabilityofScienceDMZsataregionallevel(http://prp.ucsd.edu/presentations/2015-prp-workshop-report).Theprimaryobjectivesofthatworkshopwere:

• ToengagethePRPteamandcollaboratorsandfacilitatein-personinteractions.• Gainacommonunderstandingofthescienceengagementprocess.• Exploreandcapturedatastorage,computing,andnetworkingrequirementsacrossfivescientific

domainsidentifiedasinitialapplicationsforthePRP.• Identifycommonscience-driventechnicalrequirementsforthePRP.• ExchangetechnicalideasforthePRP’stechnicalimplementation.

Thetechnicaldesignandbuild-outofthePRPprojectisbeingconductedintwophases–PRPVersion1inyears1-2andPRPVersion2inyears3-5–withtheaimofachievingthefollowinggoals:

• PRPv1:Createascalablenetworkdesignforoptimizingdatatransfer.• PRPv2:EvolvetoIPv6withCooperatingResearchGroups.• PRPv2:CreateOpenFlowasafirewallandexploreothertrustandsecurityfeatures.

Page 5: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page5

Considerableprogresshasbeenmadeduringthefirst17monthsoftheprojectindeployingPRPv1andsuccessfullyapplyingthatsystemtoanumberofscientificapplications.ThecurrentworkshopisfocusedonthetransitionforthePRPprojectfromVersion1toVersion2.ThePRPv1capabilitieswillnotbereplacedbyPRPv2,butratherPRPv2isanexplorationofmeanstoimprovePRPv1andpossiblyaddresstheincreasingchallengesofsecurityfromthecampuses'pointofview.ThePRPv2goalsarenotsetinstone.Thisworkshop’sgoalwasmeanttoofferanopendiscussionwithalphausers,long-termnationalandinternationalpartnersinnetworkinganddatatransfer,andtheimplementersofthePRPv1andPRPv2.Thisworkshop'spurposeistodebateanddiscussthegoalsforPRPv2.

ThePRPv2workshopwasheldatUCSD/Calit2onFebruary22,2017,inconjunctionwiththeON*VECTORInternationalPhotonicsworkshopFebruary23-24.ThePRPv2workshopwasattendedby48people,including12internationalattendeesrepresentingfivecountries(Brazil,CzechRepublic,Korea,NetherlandsandJapan).

SessionNotesThefollowingsectionisbasedoncontemporaneousnotesfromthetalksandquestion-and-answersessions.Thesenotesarenotintendedtobecomprehensive,nortoreproduceorsummarizethespeakers’presentationmaterials,whichareavailableassourcematerialatpacificresearchplatform.org.Theworkshopvideo-recordingisalsoavailablethereaswell.Thesenoteshavenotbeenreviewedbytheparticipants/speakers,andassuchtheymayinadvertentlymisrepresentthespeakers’statementsorintents;pleasecontacttheparticipantsdirectlyifyouhaveanyquestions.

WelcomeandPRPTwoYearsIn–LarrySmarr(UCSD)

ThefoundationforthePRPprojectstartedwithESnetdevelopingtheScienceDMZandDataTransferNode(DTN)concepts.

PRP’sDTNboxes,the10-100Gb/sFlashI/ONetworkAppliances(FIONAs),areacriticalenablerfortheproject.Recentlytheteamhasdevelopedalessexpensiveversion,FIONette,for~1Gbpssites.

Theprojectmonitorsdatatransferperformanceacrosstheentirethesystem4timesperday.JohnGraham,PRPchiefengineer,hasplayedacriticalroleinthedevelopment/deploymentofFIONAsandthemonitoringcapabilities.

ThePRPprojectwouldnothavebeenpossiblewithoutthelong-termsubstantialinvestmentsinCENIC.JohnHesshasplayedacriticalrolerepresentingCENICwithinthePRPproject.CENIChooksupcampusgatewaysinareliablefashion.PRPgoesfromcampusnetworkgatewaytotheresearcheroncampus.TherehashistoricallybeenagapinthisconnectivitythatthePRPistryingtofill–thecampusCIOisresponsibleforthecampusnetwork,butoftenthedepartmenthasresponsibilityfortheresearcher’sconnectivitytothatnetwork.

TherehavebeengreatstridesalreadyinPRPv1toestablish,use,andmonitorthehigh-speedend-to-endconnectivitybetweenpartnersites.(Seethecolormatricesofend-pointconnectivityperformancefromJanuary2016toDecember2016inthepresentation,aswellasreal-timeMaDDashmonitoringresultsat

Page 6: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page6

http://prp-maddash.calit2.optiputer.net/maddash-webui/).Morecampuses/end-pointshavebeenaddedsincetheearlydemonstrationsoftheproject,andthereisimprovedbandwidthperformancebetweenthosesites.Theseadvancesareatributetothecollective,sustainedactivities–mostlyonavolunteerbasisinregardstothisNSFgrant-bythenetworkingpeopleatthesesites.

Therearealreadyanumberofscienceapplicationsuccessesduringthefirst1.5yearsoftheproject.Existingteamsincludethehigh-energyphysicsgroupatCaltech(HarveyNewman),ShawfengDongfromUCSCconnectingtheircampusHyadesclustertoNERSC/LBNLandgettingdatafromtheDarkEnergySpectroscopicInstrument(locatedatKittPeak~800GB/night),thevirtualrealityvisualizationcapabilityatUCMerced(JeffWeekley),andthePEERearthquakeprojectledoutofUCBerkeley.Someofthesewillbediscussedinthisworkshop.

Andtherearenewscienceusecasesthatwerenotpartoftheoriginalproposal,includingconnectingthemicroscopeoffScrippsPier(JulesJaffeUCSD/SIO),regionaldownscalingofclimatemodels(DanCayan/MikeDettingeratUCSD/SIO),expansionofHPWREN(FrankVernonUCSD),andScottSellarsandMartyRalphfromtheCenterforWesternWeatherandWaterExtremes(UCSD/SIO)usingthesystem(seelatertalk).ThereisapossibleexpansiontoincludetheOceanObservatoryInitiativein2017,connectingsensorsontheseaflooroffthestateofWashington.

Theworktodatehasalsospurrednewopportunitiesthatcouldn’thavepreviouslybeendone,withPRPprovidinghigh-speedaccesstoemergingtechnologies.Forexample,therearenewcapabilitiesbasedonnon-VonNeumannarchitectures–TrueNorthbyIBM,neuralprocessorsbyKnuEdge,field-programmablegatearraytechnologies,orARMprocessorslikeQualcomm’sSnapdragon–thatPRPcanfacilitateaccessto.Aproposalhasbeendevelopedtosupportmachinelearningcapabilities,usingacloudofGPU-heavyFIONAboxesinaCondor-managedcloud.Andtheconceptisexpandingtointernationalcapabilities,ledbysomeoftheworkshop’sinternationalparticipants.

Thistalkhasreviewedthepastwork–i.e.PRPV1.ThisworkshopfocusesonPRPV2–lookingaheadtoSDN/SDX,IPVv6,andothertechnologies.

WeshouldthankthePRPpartners–inMissouriterms,theteamhasbeen‘raisingabarn,’basedlargelyonpartners’volunteerlaborcontributingtothesegoals.

Inresponsetoaquestionaboutsustainedfunding,LarryindicatedthathehastalkedwithprogramofficersatNSFandthefundingprospectsareuncertainbothatanagencylevelandattheOfficeofAdvancedCyberInfrastructure(ACI).ThereissomereorganizationwithintheOfficeofACI–NSFislookingforanewOfficedirector,andthereareprioritydebateswithintheofficeacrosssupercomputers,dataefforts,softwareandnetworking.HehopestherewillbeongoingfundingopportunitiesthatwouldleveragethelargeinvestmentsNSFhasalreadymade,andnotesthatInternet2hasissuedwhitepaperscitingnetworking’sroleinmajorinfrastructureprojectsforthecountry.

Session1:FastandSecureDataTransferPanel– Moderator:TomDeFanti(UCSD)

1.1 DataPlacementwithHTCondor–PhilPapadopoulos(UCSD/SDSC)

ThetalkdiscussesaprojectthatisrelatedtoPRPwithDr.MironLivnyatUWisconsin-MadisonandotherpartnersatBeihangUniversityandtheComputerNetworkInformationCenter(CNIC)inChina.

Page 7: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page7

PRPisoftenfocusedonrawperformance(bandwidth).Thisprojectextendsthemetricstoreliabilityandaccesstodata.Moresecurityissuesareinvolvedwhentouchingdisks,andisnetworkmeasurementperformanceagoodproxyfordisk-to-diskperformance?Thisprojectmeasuresend-to-endanddisk-to-diskperformanceamongasetofinternationalendpointsusingdifferentdata-movementprotocolsandIPv4andIPv6.

Therearealotofknownnetworktraversals,butit’strickytoknowwhattheactualpathsare.

A‘testmanifest’issubmittedasaroutinechronjobtoaCondorpool,withHTCondorhandlingerrors/recovery/reporting/iteration.AlimitedCondorpoolisbuiltacrosssites,withnodesneedingtotrusteachother‘enough.’Serversaresetupandtorndowninuserspaceforexperiments.

Seeperformanceslidesinthepresentation.Manyoftheend-point/protocolresultsshowsubstantialvariabilityinperformance,whichisnotnecessarilyunderstoodatthispoint.ThetestingdidindicatethatiRODSperformanceis~10%oftherawnetworkperformance.

Commonly-usedcomponentsinclude:HTCondor,Graphite,CarbonandWhisperdatabase(open-sourcedfromOrbitz),Python>2.6.x,Git;seeslidesforwebsites.AllcomponentshaveV4andV6implementations.

1.2 IPv6andCENIC/ESnet/I2Goals–JohnHess(CENIC)andTomHutton(UCSD/SDSC)

PRPv1isnotuniqueintermsofnetworkingarchitecture.

ThereisageneralconcernoncampusesaboutScienceDMZsbypassingcampusfirewalls.Wedon’twanttoinadvertentlycreateinsecurepaths.AlsotheprojectandcampuseswanttoensurethatthePRPnetworkisappropriatelyusedforresearchpurposesonly.

What’senvisionedforPRPv2?PRPv2willreflectadifferentstrategy,usingBorderGatewayProtocol(BGP)communitiesfortaggingclassesofDMZnetworks.Theplanistohaveapilotimplementationacrosssixsites(UCSD,SDSC,Caltech,NCSA,UChicago…andoneNorthernCaliforniasite).PilotBGPpeeringwillbenativeIPv6only.AstretchgoalistoincorporateSDN/SDXtypesignalingforpathsorsuper-channels.

Takethe~20sitesinPRPv1andtake~6PRPv2pilotsites.UserouteserverslocatedatPacificWavesites.Tagroutesforthispurpose.Dynamicsignaling.Dataplanewilltraverseexistingpaths.

ProposingIPv6asamechanismtodealwithestablishingspecificroutesconnectinggroups–generallythereisIPv6addressspaceavailable.AndoneoftheobjectiveshereistoaccelerateadoptionofIPv6.

ESNethasfourpublicDTNsthatareavailableforread-onlytransfers.PRPmaywanttoconsiderasimilarapproach.

Tstatisawaytoinstrumentdatatransfers(thinkofitasNetflow++).Theywouldliketogetsomemajorcenterstoincorporatetstat,butitwon’tbeeveryone.Andwouldliketobeabletogeneratereportsonthevariousconnectionsviatstat.

TomDeFanticlarifiesthatPRPv1isnotgoingaway,butratherwillpersistthroughthenextphaseoftheproject,withPRPv2beingaparalleltestbed.

1.3 FastDataTransfer:SC16results,etc.–AzherMughal(Caltech)

TalkfocusesondemonstrationsconductedatSC’16andplansforSC’17.Seepresentationmaterialsforgoalsofdemonstrations.

Theteamhadafew1Tpbslinksacrosstheshowfloor(Caltech,StarLight,SCinet),more100Gbslinks.

Page 8: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page8

SC16demosacrossCENIC:PRPbasedontopoftheCENICnetworkbackbone.Oneoftheobjectivesisto‘energize’thescienceteamssothatresearcherscanseewhatcanbeachieved.

1Tbpsbooth-to-boothtransfers:

• LookedatvariousNVMedrivesforhigh-speedtransfers.Builtalow-costDTNserverthatcoulddrive~100Gbps.Maximumthroughputat14drives(7drives/processor).

NeedprocessorwithmorePCIelanesinordertogethigherspeeds.IntelSkyLake?

Haveachievedtransmissionacross4MellanoxVPINICs–closeto400Gbps.

SC17goals

§ EastWestintegrationwithothercontrollersalongwithstate,recovery,provisioning,monitoring§ DemonstratingSENSEprojectforDTNautotuning(SENSE=SDNforend-to-endnetworked

scienceatexascale)§ NVMeoverFabricsacrosstheWAN§ DTNdesignusing200GNICs(Mellanox/Chelsio)

1.4 TrustedPlatformModules–JohnGraham(UCSD/Calit2/QI)

TrustedPlatformModules(TPMs)offerfacilitiesforthesecuregenerationofcryptographickeys,andlimitationoftheiruse,inadditiontoarandomnumbergenerator.TPMsarelow-costdevices,allowingsystemstointeractsecurely.

HavesuccessfullyimplementedUC-JupyteronSDSC’sCometHPCsystemusingTPMs.JupyterHubauthenticatesauserwithCILogonandspawnskernelsonComet.TrustedPlatformModules(TPM)ontheJupyterHubFIONAssecurethekeysfromtheCILogonmemberorganization.Thesekeysareusedtoconnectjupyter.calit2.optiputer.nettocomet.sdsc.edu.

GlobusandXSEDEhaveCILogonauthentication,soapproachcanextendtomanysystems/users.

HowdoyouscaleaccesstoJupyternotebookscampus-wide?Berkeleyhasbeendoingit.DoingapilotnowwithProfColemanatUCSD.

Developingnext-generationGPUJupyterbox.Dual8-coreCPUs,8GPUs/box,6x480GBSSDs,2NVMebays,~$13K.

DistributedTrustedComputing.UsingKubernetes,aspinoffofGoogleinfrastructuretools;seeslidesforlistoffeatures.Takecontainer,signcontainerwithTPM,thennoonecantamperwithanythingincontainer.

KubeSpawner(jupyterhub-kubernetes-spawner).Canspawnsingle-usernotebookserversonaKubernetescluster–multipleentitiesresidentonsamehardware.

1.5 Session1PanelQ&A

TherewasaquestionforJohnGrahamabouttheSSDsandcoolingissuesinthenext-genGPUbox.Johnindicatedthereare6SSDdrivesand2NVMebays.ThereareeightGPUsandfourfans.Johnnotesthattheunithasbeenordered,butnotrunyet–itwillbeinterestingtoseehowcoolandhowloudtheboxis.Itwillbeinterestingtoevaluatethecooling,buttheyhavehadsimilarboxesandexpectittowork.AzherMughalnotesthattheyhavehadtolookathigh-speedfansfortheirboxes.

Inresponsetoaquestiononliquid-cooledoptions,JohnGrahamrepliesthathehasseensomeliquid-cooledGPUs,butlow-profileunitsarerequiredtofitinthebox.

Page 9: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page9

TherewasaquestionforAzherMughalaboutwhetherspecialconfigurationswererequiredforhigh-speedtransfers.Yes,thesystemshadtobeconfiguredandtunedtoachievehigh-speeds.

TherewasaquestionforPhilPapadopouloswhythesetupforIPv6wasmorepainfulthanIPv4intheirtesting.TheprimaryissueswerelimitationsontheUCSDcampus.BecausethereisnotthatmuchexperiencewithIPv6oncampus,therearenotverymanyIPv6servicessetupyet.Forexample,ittookawhiletogetrequirednetworkinfo,routesarestatic.Thisshouldimprovewithmoreexperience.

CeesdeLaatnotesthattheIPv6servicesavailabletohimlocallyinNetherlandsareprettyrobust.Philwasjealous.

TomHuttonnotesthatbecauseIPv6isoptionalratherthanrequiredformanypeople,itkeepsgettingdeferred.Incontrast,Phil’sChinesecollaboratorsonlyhadIPv6addressspace,sotheyhadtogetuptospeed.ThetentativeplanatUCSDisprobablytosetupasub-domainwithintheDMZOptiputernetworkthat’sIPv6.

GeorgePeekaskedageneralquestion–fromaPRPperspective,what’sthebiggestheadacheforIPv6?TomHuttonandPhilPapadopoulosreplied‘Applications.’JohnHessnotedthatsysadminsmaybereluctanttosetuprulesetsforIPv6.

JoeMambrettiaskedJohnHessaboutIPv4andIPv6issuesforvariousinstitutionsinPRPv2.JohnrepliesthatamongexerciseswithNCSAorUChicago,thecollectionofrouteshavegreatresourcesbutwe’reonlyinterestedinthesubsetofroutesthatmaptoaddressesforScienceDMZandhigh-performanceresources.Mostlargeinstitutionsmayhaveroutesthatarebothv4andv6.WanttomakenetworkIPv6friendlyenoughthatresearchersdon’tneedtoworryaboutIPv4orIPv6configurations–i.e.usersinitiaterequesttomovedataandthenetworkworks.Thetechnicalendisofteneasier,whilethescienceengagementpiececanbemorechallenging.

Session2:ScienceEngagementFuturePanel–Moderator:FrankWürthwein(UCSD) FrankWürthweinposedthreequestionsforthespeakersinthissession–notingthatthespeakershavenotbeeninformedofthesequestionsinadvanceoftheirtalks!

• HowcanPRPv2addvaluetoyourwork?• IsPRPv2goingtoserveITorscienceneeds,orboth?

o AreyouconcernedthatITandscienceneedscouldbeatoddswitheachotherinPRPv2?• HowcanwemakechangeleastdisruptivetoexistingPRPv1scienceuse?

2.1 PRPWeatherDataTransfer–ScottSellars(UCSD/SIO)

Hisgroupishandlinglargeclimatologydatasets(seepresentationforlist).

Theyhadarecentworkflowtoidentifyfeaturesinwatertransfermodels–tookabout20daysfora4-stepprocesswithmuchofthatbeingnetworkdatatransfertime.

TheyhaveworkedwithPRPandotherstoexpeditethisworkflowandimprovetheirscientificproductivity.Thedownloadspeedincreased~4XusingPRPnetworkandFIONAs.TheyhavealsobroughtinKeplerworkflowsoftware,gotconnectedtoSDSC’sCometsystem,andareusingJupyter.

2.2 PRPEngagementFoci–CamilleCrittenden(UCB/CITRIS)&TomDeFanti(UCSD/Calit2/QI)

PRPwillbeparticipatinginanumberofworkshops:

Page 10: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page10

• Add-ontoannualCENICconferenceMarch20,2017.• BigDataandEarthSciencesMay30-June2withScottSellars.• Inplanning:Cryo-EM,machinelearning,HPWREN.• Thereissomeadditionalparticipantsupportfundingavailableinthegrant.

Thereareanumberofadvantagesofparticipatingintheseworkshops,includingexposureandpotentialexperiencetostudentparticipants.

ThePRPalsoconvenesregularPRPnetworkingtelcons–whilesomepartnersparticipateregularly,allpartnershaveaccesstonoticesandminutesofthesemeetings.

Theprojectwouldliketodobetterataninstitutionallevel(comparedtonetworkinglevel)andincrackingthenutofscienceengagementacrosscampuses.PRPwantsmorefaculty,postdocsandstaffonboard.

Scienceengagementeffortsdobetterwhenyoucanhighlightscienceopportunities,e.g.incontextofworkshops.Engagementsneedperson-to-personcontact,andneed‘evangelists’(beyondLarrySmarr)totalkwithpeople.Theprojecttriedtorecruitsomeonetofillthisrole(e.g.apostdoc)andranintoobstacles.Evangelismremainsachallengefortheproject.

Workshopparticipantsareencouragedtoreachouttoscientistsontheircampuses.Networkingproblemsaretough–andaregettingattentionoftalentedpeople;stillneedhelponthesocialengineeringside.

Howdoestheprojectteamknowit’sbeingsuccessful–whenemailsaboutproblemsstop?That’sanimportantmeasure,buttherearebroaderissuesbeyondthatmetric.GenerallyITpeopleonacampusareinadifferentrealmthanprofessorsandresearchstaff.It’simportanttointroducefacultytotheITstaffthatcanhelpthem.

2.3 PRPfromtheUCSCPointofView–ShawfengDong(UCSC)

Heworksin‘ResearchandFacultyPartnerships(RFP),’adivisioninthecampusITorganizationthatisresponsibleforworkingwithfaculty/researchersatUCSC.Hehimselfisanastrophysicist,withexpertiseinHPCsimulationsforhisownresearch.Ithelpshiminhisroleasinterfacetofacultythathehasastrongresearchbackground.

SeepresentationforaslideonrelatedRecentAwardsatUCSC–theyincludeaCC-NIE$500KawardforScienceDMZanda$400KCC-DNIawardforCIEngineer(fundinghim).

Theirpartnersacrosscampus:ComputationalAstrophysics,Astronomy,ParticlePhysics,Genomics,Chemistry,Biology,DataScience.Dominatedbyastrophysics,butaregettingotherpeopleoncampusengaged.

2.4 PRPfromtheUCMercedPointofView–JeffWeekley(UCM)

HeistheDirectorofCyberinfrastructureandResearchComputingatUCMerced.

OneofhisobjectivesistoelevateUCMtoafirst-classparticipantinPRP.

TheyarebuildingcyberinfrastructureatUCMerced:

• BuiltaWAVE.Interestingly,intheSchoolofHumanitiesandtheArts.• RecipientofCC-*awardforScienceDMZ.IntendtosupportIPv6.• CampusisplanningtodoubleitssizeinProject2020-$1.3Bconstructionproject.Shouldbeable

tobuildnetworkingintherightway.

Page 11: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page11

• Planningwireless/microwavelinksoutsidethecampus.• MaybeHPWREN2.0–Yosemite-MontereyBay.Nationalparkservice,UC,StateofCalifornia,

otheragencies.

UCMercedhasfacultythathavesignificantnetworkingneedsinanumberofpioneeringapplications.

AnotherobjectiveistobuildHumanCapacity.Theyareremovingroadblocks,repeatingworkflows,fosteringtouchstoneprojectsacrossUCcampuses,partneringwithCITRISandCalit2,anduncovering‘force-multipliers.’

2.5 Session2PanelQ&A

ScottSellarswasaskedaboutthecomputingenvironmentforanalysishe’sdoingonhisdata.AtthispointitisprimarilyPC-levelhardwarewithtoolssuchasMatlabandPython.TheymaymovetocentralHPCresourcestospeeduptheanalysissteps.Hewasalsoaskedaboutthetotalreductionintimefromtheoriginal20-dayworkflow.Thenetworkingimprovementsreduceditto~10days.Theymayalteralgorithmsandmovetocentralcomputingtoreduceitfurther.

TomDeFantiwasaskedaboutPRPdemonstrationsplanned,inadditiontoworkshops,asameanstoengagewithsciencegroups.Tomnotesthattherehavebeensomekeydemonstrations,includingtheeffortsattheannualSupercomputingconferences.

Thepanelwasaskedaboutmorespecificideasforworkshopparticipantstobetterengagefaculty,includingeffortsthatwouldscaletolargenumbersoffaculty.CamilleCrittendenhighlightedtheconceptofforcemultipliers–e.g.researchersthatareinvolvedinresearchITeffortsalready.Speakingatconferencesisgoodforachievingscale.AnotherideaistoaskresearchersthathaveusedPRPtoacknowledgeitinpublications,justastheywouldgrantfunding.TomDeFantinotesthatoneapproachistopicksomeoneinasectorandfocusresourcesonthemto‘makethemfamous.’Thenthatperson’scolleagueswillhearwhatcanbeachievedbyleveragingsomethinglikePRP.JeffWeekleycommentsthathepitchesPRPandbigdatamoversandScienceDMZ–nomatterwhatthetopicwhenhe’stalkingwithfaculty.Also,ifhecan’tgettofaculty,hegetstogradstudents.CamilleCrittendennotedthatagoodtimetoengageiswhennewfacultymembersarejustjoiningtheuniversity.ShawfengDongcommentedthattheirengagementhasbeenhelpedbythefactthathisbossisanadjunctfacultymember,andShawfengpersonallyhasbeenpartofresearchgroups.

LaterinthediscussionHarveyNewmannotedthatpartofscienceengagementistoshowpeoplewhatothersareabletoachievecomparedtowhatthey’vebeendoing…andhencehowmuchtimethey’rewasting.

TimLanceaskedaboutengagement/evangelizationvia‘listeningversustalking.’JeffWeekleycommentedthathelistenstowhatpeopleneedanddoesn’tbuildwhatisn’tneeded.TomDeFantinotedthathetalkedwithresearchersinmachinelearninginordertocomeupwithideaforthehardwaretosupportit.

ThesessionmoderatorFrankWürthweinwasaskedabouthisprogressinestablishinghigh-speedend-to-endconnectivityforLHChigh-energyphysicsresearchersacrossUCandothercampuses.Histeamhasputboxesat~5campuses(Davis,Irvine,RiversideSanDiego,soonSantaBarbara)andtheyareallconnectedat10Gbps.Theyhavebenchmarkedtheinfrastructureandjustrecentlysubmittedaresearchpaperontheproject.UCIrvineisprobablymakingthemostscienceuseofthecapabilityrightnow.Intermsoflessonslearned,hehadthoughtitwouldbe‘cookiecutter’deployments,butfoundoutthatallthecuttersandcookieswerealittledifferent.Itwasalsohardertogetadoptionthanexpected-

Page 12: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page12

thereisanotherstepbetweengettingworkingequipmentatasite,andtrainingthelocalresearchershowtooperateanduseit.

ShawfengDongandJeffWeekleywereaskedtowhatextentmulti-campusormulti-nodalcollaborations(withincampus)areimportanttoresearchontheircampuses.ShawfengDongrepliedthattheLHCcollaborationwasimportanttocampuseffort,andthePRPeffortreallymadeadifferenceinreducingsignificantbottlenecks.JeffWeekleycharacterizedmulti-campuscollaborationsasanexampleofaforcemultiplierthathehadnotedinhistalk.Hehasmadeinvestmentsthatenablecampus->edgecapabilitiesandthatwillmakeadifference.He’sseenexamplesofothercampusesthatjusthaveawfulnetworkingconnectivityandit’sarealbarriertocollaboration/research.

ThepanelwasaskedaboutusingpublicDTNsthatareclosetoPOPs(similartowhatESNethasdone).ThegeneralreplywasthatDTNsatPOPSareagoodidea,andthatitwouldbeanexcellentideafortheretobecentralizeddatarepositorieswithhighperformance.Oneparticipantnotedthatusersstillneedtobeinformedhowtodiscoverandaccessthoseassets.

ScottSellarswasaskedwhetherhisprimarychallengeisgettingthedatainthefirstplaceorwhereitgoesafterhegetsit.Scottrepliedthatit’sachallengethatthecentraldatasetsaresolarge,sinceonehastomakedecisionsinadvancewhichportionofthedatatodownload.Iftheinfrastructureexistedtoeasilypulldatadownentiredatasetsandthenprunedata,hewouldpreferthatapproach.

Session3:NetworkArchitectureforGlobalizationofPRPv2Panel–Moderator:CeesdeLaat(UVa)

3.1 CooperativeWorkGroupsinPRPv2–LeonGommans(AirFrance/KLM)

‘TrustedSharingofBigDataAssetsinCooperativeWorkingGroups.’

Therearebarriersforbusinessorganizationstosharedata–e.g.wouldothersthatyou’resharingwithgetmorevaluerelativetoyourorganization,intellectualpropertyandlegalconcerns,andprivacyprotections.

‘SecureDigitalMarketPlace’conceptallowscommunitymemberstosharedataaccordingtowell-definedmarketrules.

Usinghigh-speedbandwidthofthePRPtoenablein-memoryanalysesofshareddata,withoutstoringtheshareddata,wouldhelpalleviatesomeofthedata-sharingconcerns.Couldalsoutilize‘containers’onthatdatawithinthesharedspace.

Wantstodevelopa‘futureinternetfieldlab’(seeslides).

3.2 ScienceDMZsatKISTI–JeonghoonMoon(KISTI)

ThespeakernotesthatDMZhasaveryreal–andnon-scientific-meaninginKorea.

HaveanSDN-basedDataTransferNodeandaLayer3-basedDTN(FIONAbox).ThelatterwasusedtojointheSC’16demonstrationdescribedearlierbyAzherMughal,andthisnodewasjustaddedtothePRP‘MaDDash’matrixofend-points.

Theyhaveachieved9.2Gbps(FTP)and9.1(GridFTP)on10GbpslinefromSeoultoDaejeon.DaejontoPRPuniversitiesinCaliforniaachieves7-8GpbsbothdirectionswithGridFTP.

OneoftheirscienceusecasesisAgroMeteorologyFederation(AMF)inKorea,storingclimatedataandanalyzing/sharingresults.

Page 13: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page13

Doingresearchtodevelopan“OpenScienceDMZCloud”withDTN/DMZasaService(DDaaS).Theyarealsoworkingonaprojecttodevelopa“SaaSOverCloudonDTN.”

3.3 FastDataTransfer:Brazil–MichaelStanton(RNP,Brazil)

RNPisBrazil’sResearchandEducationNetwork.TheRNP’snationalbackboneconnectscapitalsofeachstateandfederalcapital(26capitalsand13othercities);mostlinksareat10G,exceptintheAmazonregion.Maymovesomeoftheseto100Ginthenextyear+.TherearealsosharedlinkstotheUSviaASNPandAmLightEXP,EuropeandLatinAmericaviaRedClara,andterrestriallinkswithArgentina.Thereisaconcentrationofuniversities/researchcenters/populationinSaoPaolo/RiodeJaneiroregions.

RNPhasparticipatedinSCdemonstrationssince2004.InSC’16(withCaltechgroup),RNPdemonstrated>95%usageofthe100GaccesstoUS,viaANSP.

It’simportanttotalkaboutusingdata,notjustmovingit(andnotjustdemos).ABraziliane-astronomylabcollaborateswithNCSAandJHU/Sloandataviahigh-speedlinks.Whentheystarted,theygot20-30Mbpsona1Gbpsline,andittooktwoweekstotransfera5.7TBcollection.Nowtheyget~1GbpsusingGridFTP(andaDTNboxessentiallyatthePOP).AndthereisanewprojecttoenableefficientremoteaccesstoLNCC’ssupercomputer(inPetropolis).PotentialclientsarealightsynchrotronsourcesandaWeather/ClimateLab.Thiscapabilitymaysavecostofpurchasingaseparatecomputerforthesynchrotron.

Henotesthathehasseeninterestingworkhereonhigh-speedDTNsthathewillbetakingbacktoBrazil.

3.4 ObservationsonExpandingthePRP–HarveyNewman(Caltech)

Thisisthetimetoexpandhorizons,tothinkinthelarge.ThereareprogramsonthehorizonthatwillbeevenbiggerthanLHC–e.g.LSSTandSKA.

Historically,ESnettraffichasgrown~10Xover4years.MuchofESNettrafficisLHCdata.InpastyearalonewithLHCRun2,LHCdatagrowth~2.7X.

Ontheotherhand,wehavenotbeeninaperiodofhightechnologicaladvancement.We’rein4th-5thyearof~100Gbpstechnology.Nexttechnologystepcomingis~400Gb/s.Thatrateoftechnologyimprovementisnotkeepingpacewithtrafficgrowthrates.

SC16demo–upto1Tbpslocal,many100Gbpsconnections.

ThereisamajoropportunityforanewCPU/storage/networkecosystem+LargeComputingFacilities(LCFs)asfocalpoints.

SENSEeffort:“SDNforend-to-endnetworkedscienceatexascale”withpartnersESNet,Caltech,Fermilab,Argonne,MarylandandLBNL.

PRPhasbecomealocusforactivityinthisarea.

3.5 Session3PanelQ&A

Thepanelwasaskedaboutfuturemajordrivers.HarveyNewmanprovidedexamplessuchasLSST,genomics,andtheInternetofThings.

TherewasaquestionaboutthelatencyofthepathtoKoreaintheSC16demonstration-itapproached400msecround-trip.

Formanyworkflows,datainarepositoryisshippedsomewhereforanalysis,andthenresultsarereturnedtotheresearcher.Thequestionwasposed-whatarethecomputingenvironmentsatthe

Page 14: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page14

endpoints?HarveyNewmanrepliedregardingtheLHCcomputingenvironment–e.g.~20Tier1sites,~150Tier2sites(eachwithseveralpetabytesstorageandseveralthousandjobslots).Datatransfersaretypically~2TB;ifyoucouldmove10sofTBsefficiently,thatwouldsignificantlyincreaseproductivity.

TherewasaquestionabouttheBrazilianAmazonriversprogram,thatwasprimarilydeferredtothefollowingON*VECTORworkshop.MichaelStantonsaidthattheyhavecompleted~250kmthepastyear,andexpecttosetupnext400kmwithinnext4-5months.

LeonGommanswasaskedforadditionalinformationonairlinedatasharing.Leoncitedtheexamplethatwhenanaircraftlands,thereis~500GBofdataavailableperflight.Withafleetofaircraftoveralongperiodoftime,therecouldbeexabytesofdatatheoreticallyavailable.Canthisdatabesharedeffectively?

Session4:SDN/SDXinPRPv2Panel–Moderator:TomHutton(UCSD/SDSC)

4.1 SDNinPRPv2forsecureflows–PhilPapadopoulos(UCSD/SDSC)

ThetalkaddressesusinganOpenFlowSwitchasalow-impactfirewall.

InLinux,everypacketislookedat,meaningthereisoverheadoneverypacket.ButLinuxcan’tmakedecisionsbasedonother‘meta’data.

PurposehereistouseanOpenFlowswitchtomakedecisionsbasedonflows,reducingoverheadperpacket.Butloseobservabilityofflowfromcontroller.Limitedflow-tablespace.Andwhathappensifcontrollergoesoffline–networkgoesaway.

Theytestedhowmuchdelaycouldbeintroducedininitiatingrulesfortheflowwithoutimpactingnetworkperformance:0.5sec(alot!)wasalmostnoimpact,while5seccripplednetwork.SoitlooksreasonabletouseanOpenFlowswitchasafirewall.

ThereareadditionalpossibilitieswithIPv6.

4.2 100GbServicesforData-IntensiveScienceEnabledbySDN/SDX–JoeMambretti(NorthwesternUniv)

‘OpenFlowisdead.’Orchestrationiscritical.Highlyprogrammablenetworks.LotsofimportantworkcomingoutofGENI.

SDX’sareinUS,butwhereelse?SomeatSURFNet,onesooninGermany,oneotherinEurope.WillbeinTaiwan,Singapore,Korea,Tokyo.

ThereweremanydemosatSC16,mostdependentonSDNtechniques.

HelikesmdtmFTPcomparedtoGridFTP.

WantstomovetowardstheGlobalResearchPlatform–aworldwidePRP.

4.3 SDN/SDXinCENIC/PacificWave–JohnHess(CENIC)

PacificWaveisajointcollaborationCENICandPacificNorthwestGigapop(PNWGP),supportingbothcommercialandR&Epeers,servingcountriesacrossthePacificconnectingtothewesternUS.

PacificWavereceivedanNSF/ACIIRNCawardSeptember2015tosupportmore100GconnectionsandtodeploySDN/SDXonparallelinfrastructuretoenableexperimentationwhilemaintainingproductionuseofthePacificWaveexchange.AnSDXnodeisbeingestablishedinLosAngeles,andanexistingSDXnodeinSeattleisbeingleveraged;controlnodesarelocatedinSeattleandLosAngeles.

Page 15: Pacific Research Platform Version 2 (PRPv2) Workshop · CENIC hooks up campus gateways in a reliable fashion. PRP goes from campus network gateway to the researcher on campus. There

SummaryofPRPv2WorkshopPresentations Page15

Thegroupisparticipatingintheworldwide‘AutoGOLE/NSIPilot’project.(GOLE=GLIFOpenLightpathExchanges,GLIF=GlobalLambdaIntegratedFacility.)

4.4 Session4PanelQ&A

JohnGrahamnotedthattheGlobusfeaturesforsharingdata,citedinoneofthecharts,costmoneycomparedtothebasicGridFTPdata-movementcapabilities.JohnHessrepliedthatyes,Globushasalicensingmodelthatenablessomeadditionalfeatureslikemultiplestreams,datasharing,andthird-partyauthentication.ESnethasanetwork-widelicenseforGlobus,andGlobushasallowedthemtousethislicenseforthepilotproject.Theyhopetogetasimilararrangement…butnotclearyethowfaritcanbeextended.

PhilPapadopouloswasaskedwhethertheOpenFlowswitch-as-firewallcouldbeusedforalldata.Herepliedthatit’sthe‘niceflows’(nottheshorthousekeepingflows)thatwouldgothruthissystem.

JohnHesswasaskedhownewinstitutionscanjointheSDN/SDXeffort.Heindicatedthatifaninstitutionisinterested,besttostarttheprocessbytalkingwithhim.