cees de laat surfnet eu

44
Cees de Laat Lambda-Grid developments History - Present - Future SURFnet EU BSIK NWO University of Amsterdam TI TNO NCF NCF

Upload: others

Post on 02-Feb-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Cees de Laat SURFnet EU

Cees de Laat

Lambda-Grid developmentsHistory - Present - Future

SURFnetEUBSIK

NWOUniversity of Amsterdam

TITNO

NCFNCF

Page 2: Cees de Laat SURFnet EU

Contents

1. The need for hybrid networking

2. StarPlane; a grid controlled photonic network

3. RDF/Network Description Language

4. Tera-networking

5. Programmable networks

Page 3: Cees de Laat SURFnet EU

Tier 1

Tier2 Center

Online System

eventreconstruction

Italian Regional

Center

German

Regional Center

InstituteInstituteInstituteInstitute~0.25TIPS

Workstations

~100MBytes/sec

~0.6-2.5 Gbps

100 - 1000Mbits/sec

Physics data cache

~PByte/sec

~2.5 Gbits/sec

Tier2 CenterTier2 CenterTier2 Center

~0.6-2.5 Gbps

Tier 0 +1

Tier 3

Tier 4

Tier2 Center

LHC Data Grid HierarchyCMS as example, Atlas is similar

Tier 2

CERN/CMS data goes to 6-8 Tier 1 regional centers,and from each of these to 6-10 Tier 2 centers.

Physicists work on analysis “channels” at 135 institutes.Each institute has ~10 physicists working on one ormore channels.

2000 physicists in 31 countries are involved in this 20-year experiment in which DOE is a major player.

CMS detector: 15m X 15m X 22m

12,500 tons, $700M.

human=2m

analysis

eventsimulation

NIKHEF Dutch

Regional CenterFermiLab, USA

Regional Center

Courtesy Harvey Newman,CalTech and CERN

Page 4: Cees de Laat SURFnet EU

Grape6

Very!"igh-speed

Network

Data

Reservoir

Data analysis at University of Tokyo

Belle Experiments

X-ray astronomy

Satellite ASUKA

SUBARU

Telescope

NobeyamaRadio

Observatory#VLBI)

Nuclear experiments

Data

Reservoir

Data

ReservoirLocal

Accesses

DistributedShared

files

Data intensive scientific computation through global networks

Digital Sky Survey

Page 5: Cees de Laat SURFnet EU

CineGrid@SARA 2b of 5

Page 6: Cees de Laat SURFnet EU

Sensor Grids

~ 40 Tbit/s

www.lofar.org

eVLBI

Page 7: Cees de Laat SURFnet EU

Telescopes

Input nodes

Correlator nodes

Output node

..... To equal the hardware

correlator we need:

16 streams of 1Gbps

16 * 1Gbps of data

2 Tflops CPU power

2 TFlop / 16 Gbps =

1000 flops/byte

THIS IS A DATA

FLOW PROBLEM !!!

SCARIe: a research project to create a Software Correlator for e-VLBI.VLBI Correlation: signal processing technique to get high precision image from

spatially distributed radio-telescope.

The SCARIe project 2 of 5

Page 8: Cees de Laat SURFnet EU

BW requirements

#users

C

A

B

ADSL (12 Mbit/s) GigE

A. Lightweight users, browsing, mailing, home use

Need full Internet routing, one to all

B. Business/grid applications, multicast, streaming, VO’s, mostly LAN

Need VPN services and full Internet routing, several to several + uplink to all

C. E-Science applications, distributed data processing, all sorts of grids

Need very fat pipes, limited multiple Virtual Organizations, P2P, few to few

For the Netherlands 2007!A = !B = !C ! 250 Gb/sHowever:

• A -> all connects

• B -> on several

• C -> just a few (SP, LHC, LOFAR)

Page 9: Cees de Laat SURFnet EU

Towards Hybrid Networking!• Costs of photonic equipment 10% of switching 10 % of full routing

– for same throughput!

– Photonic vs Optical (optical used for SONET, etc, 10-50 k$/port)

– DWDM lasers for long reach expensive, 10-50 k$

• Bottom line: look for a hybrid architecture which serves all classes in a costeffective way

– map A -> L3 , B -> L2 , C -> L1 and L2

• Give each packet in the network the service it needs, but no more !

L1 ! 2-3 k$/portL2 ! 5-8 k$/port L3 ! 75+ k$/port

Page 10: Cees de Laat SURFnet EU

Trends

• We have made baby-steps on the path to optical

networking

– Still many mails and phone calls

• See several trends:

– lambda’s get fatter and cheaper

– photonic technology cheap per bandwidth

– embedded computation capacity increasing

– latency and high bandwidth congestion avoidance conflict

– ethernet is getting circuit properties (PBT)

– applications need more and more predictable behaviour

Page 11: Cees de Laat SURFnet EU

How low can you go?

Router

Ethernet

SONET

DWDM

Fiber

Application

Endpoint A

Application

Endpoint BRegional

dark

fiber

MEMS

POS

15454

6500

HDXc

GLIF

Trans-Oceanic

Local

Ethernet

NetherL

ight

Sta

rLig

ht

UK

Lig

ht

Page 12: Cees de Laat SURFnet EU

The playfield => GLIF

c5 of 6

Page 13: Cees de Laat SURFnet EU

In The Netherlands SURFnetconnects between 180:

- universities;- academic hospitals;

- most polytechnics;- research centers.

with an indirect ~750K userbase

Red crosses = StarPlane

~ 6000 km

scale

comparable

to railway

system

x

x

x

xx

Page 14: Cees de Laat SURFnet EU

Common

Photonic

Layer

(CPL) in

SURFnet6

supports up to

72 Lambda’s of

10 G each

40 G soon.

Dordrecht1

Breda1

Tilburg1

DenHaag

NLR

BT

BT NLR

BT

Zutphen1

Lelystad1

Subnetwork 4:Blue Azur

Subnetwork 3:Red

Subnetwork 1:Green

Subnetwork 2:Dark blue

Subnetwork 5:Grey

Emmeloord

Zwolle1

Venlo1

Enschede1

Groningen1

LeeuwardenHarlingen

Den Helder

Alkmaar1

Haarlem1

Leiden1

Assen1

Beilen1

Meppel1

Emmen1

Arnhem

Apeldoorn1

Bergen-op-

ZoomZierikzee

Middelburg

Vlissingen Krabbendijke

Breukelen1

Ede

Heerlen2Geleen1

DLO

Schiphol-Rijk

Wageningen1 Nijmegen1

Hilversum1

Hoogeveen1

Lelystad2

Amsterdam1

Dwingeloo1

Amsterdam2

Den Bosch1

Utrecht1

Beilen1

Nieuwegein1Rotterdam1

Delft1

Heerlen1

Heerlen1

Maastricht1

Eindhoven1

Maasbracht1

Rotterdam4

3XLSOP

IBG1 & IBG2Middenmeer1

Page 15: Cees de Laat SURFnet EU

Contents

1. The need for hybrid networking

2. StarPlane; a grid controlled photonic network

3. RDF/Network Description Language

4. Tera-networking

5. Programmable networks

Page 16: Cees de Laat SURFnet EU

StarPlane

DWDM

backplane

R

CP

U’s

R

CPU’sR

CPU’s

CP

U’s

R

CPU’s

R

NOC

CdL

C

P

U

’s

switch

university SURFnet

WS+AAA

NOCWS+AAA

SURFnet

Page 17: Cees de Laat SURFnet EU

QOS in a non destructive way!

• Destructive QOS:

– have a link or !

– set part of it aside for a lucky few under higher priority

– rest gets less service

!

! ! !

• Constructive QOS:

– have a !

– add other !‘s as needed on separate colors

– move the lucky ones over there

– rest gets also a bit happier!

Page 18: Cees de Laat SURFnet EU

Module Operation

> this schematic shows• several input fibres and one output fibre

• light is focused and diffracted such that eachchannel lands on a different MEMS mirror

• the MEMS mirror is electronically controlled to tiltthe reflecting surface

• the angle of tilt directs the light to the correct port

> in this example:• channel 1 is coming in on port 1 (shown in red)

• when it hits the MEMS mirror the mirror is tilted todirect this channel from port 1 to the common

• only port 1 satisfies this angle, therefore all otherports are blocked

diffraction grating

input and output fibres

collimating

lens

MEMS mirror array

(1 pixel per channel)

!1

!n

port 1common

ref Eric Bernier, NORTEL

Page 19: Cees de Laat SURFnet EU

Dispersion compensating modem: eDCO from NORTEL

(Try to Google eDCO :-)

sender receiver

T(f)transport as function of frequency T(f)

Solution in 5 easy steps for dummy’s :

1. try to figure out T(f) by trial and error

2. invert T(f) -> T-1(f)

3. computationally multiply T-1(f) with Fourier transform of bit pattern to send

4. inverse Fourier transform the result from frequency to time space

5. modulate laser with resulting h’(t) = F-1(F(h(t)).T-1(f))

sender with cpu, mod receiver

T(f)

h(t) F-1(F(h(t)).T(f))

F-1(F(h(t))*T-1(f)) F-1(F(F-1(F(h(t)).T-1(f))).T(f)) -> h(t)

(ps. due to power ~ square E the signal to send looks like uncompensated received but is not)

Page 20: Cees de Laat SURFnet EU

GRID Co-scheduling problem space

CPU DATA

Lambda’s

Extensively

under

research

New!

The StarPlane vision is to give flexibility directly to the

applications by allowing them to choose the logical topology

in real time, ultimately with sub-second lambda switching

times on part of the SURFnet6 infrastructure.

Page 21: Cees de Laat SURFnet EU

StarPlane First Light - 1

MAY 31th 2007

Page 22: Cees de Laat SURFnet EU

StarPlane First Light - 3

Very constant

and predictable!

Page 23: Cees de Laat SURFnet EU

Contents

1. The need for hybrid networking

2. StarPlane; a grid controlled photonic network

3. RDF/Network Description Language

4. Tera-networking

5. Programmable networks

Page 24: Cees de Laat SURFnet EU

Architecture SC06

Page 25: Cees de Laat SURFnet EU

Network Description Language

ObjectSubjectPredicate

• From semantic Web / Resource Description Framework.

• The RDF uses XML as an interchange syntax.

• Data is described by triplets:

Location Device Interface Link

name description locatedAt hasInterface

connectedTo capacity encodingType encodingLabel

Page 26: Cees de Laat SURFnet EU

The Modelling Process

Functional

Elements

Network

ElementsSyntax

<ndl:Device rdf:about="#Force10"> <ndl:hasInterface rdf:resource= "#Force10:te6/0"/></ndl:Device><ndl:Interface rdf:about="#Force10:te6/0"> <rdfs:label>te6/0</rdfs:label> <ndl:capacity>1.25E6</ndl:capacity> <ndlconf:multiplex>

<ndlcap:adaptation rdf:resource= "#Tagged-Ethernet-in-Ethernet"/> <ndlconf:serverPropertyValue rdf:resource="#MTU-1500byte"/> </ndlconf:multiplex> <ndlconf:hasChannel> <ndlconf:Channel rdf:about= "#Force10:te6/0:vlan4"> <ndleth:hasVlan>4</ndleth:hasVlan>

<ndlconf:switchedTo rdf:resource= "#Force10:gi5/1:vlan7"/> </ndlconf:Channel> </ndlconf:hasChannel></ndl:Interface>

Page 27: Cees de Laat SURFnet EU

NetherLight in RDF<?xml version="1.0" encoding="UTF-8"?>

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:ndl="http://www.science.uva.nl/research/air/ndl#">

<!-- Description of Netherlight -->

<ndl:Location rdf:about="#Netherlight">

<ndl:name>Netherlight Optical Exchange</ndl:name>

</ndl:Location>

<!-- TDM3.amsterdam1.netherlight.net -->

<ndl:Device rdf:about="#tdm3.amsterdam1.netherlight.net">

<ndl:name>tdm3.amsterdam1.netherlight.net</ndl:name>

<ndl:locatedAt rdf:resource="#amsterdam1.netherlight.net"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:501/1"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:501/3"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:501/4"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:503/1"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:503/2"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:503/3"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:503/4"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:504/1"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:504/2"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:504/3"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:504/4"/>

<ndl:hasInterface rdf:resource="#tdm3.amsterdam1.netherlight.net:501/2"/>

<!-- all the interfaces of TDM3.amsterdam1.netherlight.net -->

<ndl:Interface rdf:about="#tdm3.amsterdam1.netherlight.net:501/1">

<ndl:name>tdm3.amsterdam1.netherlight.net:POS501/1</ndl:name>

<ndl:connectedTo rdf:resource="#tdm4.amsterdam1.netherlight.net:5/1"/>

</ndl:Interface>

<ndl:Interface rdf:about="#tdm3.amsterdam1.netherlight.net:501/2">

<ndl:name>tdm3.amsterdam1.netherlight.net:POS501/2</ndl:name>

<ndl:connectedTo rdf:resource="#tdm1.amsterdam1.netherlight.net:12/1"/>

</ndl:Interface>

Page 28: Cees de Laat SURFnet EU

NDL Generator and Validator

see http://trafficlight.uva.netherlight.nl/NDL-demo/

Page 29: Cees de Laat SURFnet EU

NDL SN6

Visualisation

Page 30: Cees de Laat SURFnet EU

IP layer

Ethernet layer

STS layer

UTP!ber

layerlayer

OC"#$% layer

SONET switchwith

Ethernet intf.Endhost

Endhost

SONETswitch

Ethernet &SONET switch

SONET switchwith

Ethernet intf.

Universitédu

Quebec

StarLightChicago

Universiteitvan

Amsterdam

CA★NetCanada

MAN LANNew York

NetherLightAmsterdam

Multi-layer extensions to NDL

Page 31: Cees de Laat SURFnet EU

Universitédu

Quebec

CA★NetCanada Universiteit

vanAmsterdam

StarLightChicago

MAN LANNew York

NetherLightAmsterdam

Gigabit Ethernet

OC"#$%

OC"#$%

%x OC"#$%

%x OC"#$%

GE

can adapt GE

in STS"%&c

can adapt GE in STS"

%&c or STS"'c"(vcan adapt GE

in STS"'c"(v

)%% free*

)'+ free*

)+( free*

),' free*

A weird example

Page 32: Cees de Laat SURFnet EU

Universitédu

Quebec

CA★NetCanada Universiteit

vanAmsterdam

StarLightChicago

MAN LANNew York

NetherLightAmsterdam

Gigabit Ethernet

OC"#$%

OC"#$%

%x OC"#$%

%x OC"#$%

GE

can adapt GE

in STS"%&c

can adapt GE in STS"

%&c or STS"'c"(vcan adapt GE

in STS"'c"(v

)%% free*

)'+ free*

)+( free*

),' free*

The result :-)

Thanks to Freek Dijkstra & team

Page 33: Cees de Laat SURFnet EU

MultiDomain MultiLayer pathfinding

in action

Page 34: Cees de Laat SURFnet EU

MultiDomain MultiLayer pathfinding

in action

Page 35: Cees de Laat SURFnet EU

OGF NML-WGOpen Grid Forum - Network Markup

Language workgroup

Chairs:

Paola Grosso – Universiteit van Amsterdam

Martin Swany – University of Delaware

Purpose:

To describe network topologies, so that the outcome

is a standardized network description ontology and

schema, facilitating interoperability between different

projects.

https://forge.gridforum.org/sf/projects/nml-wg

Page 36: Cees de Laat SURFnet EU

RDF describing Infrastructure

content

content

RDF/CGRDF/CG

RDF/ST

RDF/NDL

RDF/NDL

RDF/VIZ

RDF/CPU

Application: find video containing x,

then trans-code to it view on Tiled Display

PG&CdL

Page 37: Cees de Laat SURFnet EU

Contents

1. The need for hybrid networking

2. StarPlane; a grid controlled photonic network

3. RDF/Network Description Language

4. Tera-networking

5. Programmable networks

Page 38: Cees de Laat SURFnet EU

TeraThinking• What constitutes a Tb/s network?

• CALIT2 has 8000 Gigabit drops ?->? Terabit Lan?

• look at 80 core Intel processor– cut it in two, left and right communicate 8 TB/s

• think back to teraflop computing!– MPI makes it a teraflop machine

• massive parallel channels in hosts, NIC’s

• TeraApps programming model supported by– TFlops -> MPI / Globus

– TBytes -> OGSA/DAIS

– TPixels -> SAGE

– TSensors -> LOFAR, LHC, LOOKING, CineGrid, ...

– Tbit/s -> ?

ref Larry Smarr & CdL

Page 39: Cees de Laat SURFnet EU

Need for discrete parallelism

• it takes a core to receive 1 or 10 Gbit/s in a computer

• it takes one or two cores to deal with 10 Gbit/s storage

• same for Gigapixels

• same for 100’s of Gflops

• Capacity of every part in a system seems of same scale

• look at 80 core Intel processor

– cut it in two, left and right communicate 8 TB/s

• massive parallel channels in hosts, NIC’s

• Therefore we need to go massively parallel allocating

complete parts for the problem at hand!

Page 40: Cees de Laat SURFnet EU

application

network

element

nc

application

network

element

network

element

network

element

nc nc nc

ac acac ac

nc nc nc

• The network is virtualized as a collection of

resources

• UPVNs enable network resources to be

programmed as part of the application

• Mathematica, a powerful mathematical

software system, can interact with real

networks using UPVNs

User Programmable Virtualized Networks allows the results

of decades of computer science to handle the complexities of

application specific networking.

Page 41: Cees de Laat SURFnet EU

139.63.145.0

139.63.145.1

139.63.145.15

139.63.145.16139.63.145.17

139.63.145.18

139.63.145.2

139.63.145.3

139.63.145.31

139.63.145.32

139.63.145.33

139.63.145.34

139.63.145.38

139.63.145.40

139.63.145.41

139.63.145.42139.63.145.43

139.63.145.44

139.63.145.45

139.63.145.46

139.63.145.49

139.63.145.50

139.63.145.51

139.63.145.52

139.63.145.63

139.63.145.64

139.63.145.65

139.63.145.66

139.63.145.68

139.63.145.69139.63.145.70139.63.145.71

139.63.145.72

139.63.145.73139.63.145.74

139.63.145.75

139.63.145.79

139.63.145.81

139.63.145.82

139.63.145.83

139.63.145.84

139.63.145.85

139.63.145.86

139.63.145.87

139.63.145.88

139.63.145.94 192.168.0.1 192.168.0.2

192.168.0.3

192.168.0.4

192.168.0.5

192.168.0.6

192.168.1.1

192.168.1.2

192.168.1.3

192.168.1.4

192.168.2.1

192.168.2.2

192.168.2.3

192.168.2.4

Visualisation

Initialization and BFS discovery of NEs

95.9

94.5

95.8

96.

95.9

99.9

99.

100.

100.

94.599.9

99.9

100.

95.899.

96.

99.9

100.

100.

99.8

100.

100.

98.9

99.8

100.

100.

100.

100.

100.

98.9

101.

100.

101.100.

100.

100.

139.63.145.94192.168.0.1192.168.0.2

192.168.0.3

192.168.0.4

192.168.0.5

192.168.0.6

192.168.1.1

192.168.1.2

192.168.1.3

192.168.1.4

192.168.2.1

192.168.2.2

192.168.2.3

192.168.2.4

Network flows using real-time

bandwidth measurements

nodePath = ConvertIndicesToNodes[

ShortestPath[ g,

Node2Index[nids,"192.168.3.4"],

Node2Index[nids,"139.63.77.49"]],

nids];

Print["Path: ", nodePath];

If[NetworkTokenTransaction[nodePath, "green"]==True,

Print["Committed"], Print["Transaction failed"]];

Path:

{192.168.3.4,192.168.3.1,139.63.77.30,139.63.77.49}

Committed

Transaction on shortest path with tokens

Topology matters can be dealt with algorithmically

Results can be persisted using a transaction service built in UPVN

Needs["WebServices`"]

<<DiscreteMath`Combinatorica`

<<DiscreteMath`GraphPlot`

InitNetworkTopologyService["edge.ict.tno.nl"]

Available methods:

{DiscoverNetworkElements,GetLinkBandwidth,GetAllIpLinks,Remote,

NetworkTokenTransaction}

Global`upvnverbose = True;

AbsoluteTiming[nes = BFSDiscover["139.63.145.94"];][[1]]

AbsoluteTiming[result = BFSDiscoverLinks["139.63.145.94", nes];][[1]]

Getting neigbours of: 139.63.145.94

Internal links: {192.168.0.1, 139.63.145.94}

(...)

Getting neigbours of:192.168.2.3

Internal links: {192.168.2.3}

Mathematica enables advanced graph queries, visualizations and

real-time network manipulations on UPVNs

ref: Robert J. Meijer, Rudolf J. Strijkers, Leon Gommans, Cees de Laat, User Programmable VirtualiizedNetworks, accepted for publication to the IEEE e-Science 2006 conference Amsterdam.

Page 42: Cees de Laat SURFnet EU

Functional building blocks

Network

layers

Application

layers Use Interface

Control Interface

(protocols API’s)

Network

Service

Application

5b of 6

Page 43: Cees de Laat SURFnet EU

Power is a big issue

• UvA cluster uses (max) 30 kWh

• 1 kWh ~ 0.1 "

• per year -> 26 k"/y

• add cooling 50% -> 39 k"/y

• Emergency power system -> 50 k"/y

• per rack 10 kWh is now normal

•• YO U BURN ABOUT HALF THE CLUSTERYO U BURN ABOUT HALF THE CLUSTEROVER ITS LIFETIME!OVER ITS LIFETIME!

• Terminating a 10 Gb/s wave costs about 200 W

• Entire loaded fiber -> 16 kW

• Wavelength Selective Switch : few W!

Page 44: Cees de Laat SURFnet EU

Questions ?

7 of 7

I did not talk about

...