big data hub infrastructure jan wester, c. de laat, l. gommans · big data hub infrastructure jan...

13
Distributed Big Data Assets Sharing & Processing Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam AirFrance KLM

Upload: others

Post on 25-May-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

Distributed Big Data Assets Sharing & ProcessingBig Data Hub infrastructure

Jan Wester, C. de Laat, L. Gommans

TNOSystem & Network Engineering, University of Amsterdam

AirFrance KLM

Page 2: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

Fading Trust in Internet

DependencyTrust

1980 2017

ResearchGap!

Page 3: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

Main problem statement

• Organizations that normally compete have to bring data together to achieve a common goal!

• The shared data may be used for that goal but not for any other!

• Data may have to be processed in untrusted data centers.– How to enforce that using modern Cyber Infrastructure?– How to organize such alliances?– How to translate from strategic via tactical to operational

level?– What are the different fundamental data infrastructure models

to consider?

Strategic Level

Tactical Level

Operational Level

Page 4: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

© 2016 Internet2

BigDataSharingusecasesplacedinairlinecontext

Global Scale

National Scale

City / regional Scale

Campus / Enterprise Scale

Cybersecurity Big DataNWO COMMIT/

SARNET project3.5 FTE

Aircraft Component Health Monitoring (Big) Data

NWO CIMPLO project4.5 FTE

Cargo Logistics Data(C1) DaL4LoD (C2) Secure scalable policy-enforceddistributed data Processing(using blockchain)

NLIP iShare project

Page 5: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

SAE Use Case envisaged research collaboration

Big Data Hub / Spoke or Industry initiative fundingTopsector Funding

SAE AeroSpace GroupHM-1 working group

Use Case on aircraft sensor Big Data

FundingAgency

InternationalNetworking

Regional /National

Networking

Local University

AircraftMRO, OEM &

Operators

Industry Standards Body

Page 6: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

Examplemodel:PolicyEnforcedDataProcessing

Data-1

Data-2

Comp

Viz

Untrusted Unsecure Cloud or Data CenterOrg 1 Org 2

Org 3 Org 4

Secure VirtualData Processing Vault

• Bringing data and processing software from competing organizations together for common goal• Docker with encryption, policy engine, certs/keys, blockchain and secure networking• Data Docker (virtual encrypted hard drive)• Compute Docker (protected application, signed algorithms)• Visualization Docker (to visualize output)

Page 7: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam
Page 8: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

Science-DMZ

Page 9: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

UvA OpenLab

Page 10: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

Pacific Research Platform testbed involvement

© 2016 Internet2

ExoGENITestbed

Research goal:Explore value of

academicnetwork research

capabilities thatenable innovativeways & models to

share big data assets

prp.ucsd.edu

Page 11: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

Networks of ScienceDMZ’s & SDX’s

ISPNFV

client 1

client 2

client 3

client 4

client n

SDN

InternetPeer ISP’s

Func-c1

Func-c3

DTN

Petabyte email service J

DMZ

DMZ

DMZ

DMZ

DMZ

DMZ contains DTN

SDXNFVFunc-c4

ISP

ISP

SDXISP

SDX

DTN

Supercomputingcenters

(NCSA, ANL, LBNL)

Page 12: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

ValidationFieldlab andDisseminationUVA - OpenLab

KLMNetherLight

GENIFed4FireCloud

SURFSARA…

TNO - Intrepid

SmartDataFactory

InnovationsSmartRailTo-Grip

C2D – Big DataHubs

ArenaKAVEAZURE

UseCases…

• Experimentalfacilitiesfromdayone!• Proofofconceptsdemonstratingsecuredatasharing• Blueprint,roadmapandstandardswhereapplicable• ModelforFAIREOSCInfrastructure

DataHubDTN

DataHubDTN

DataHubDTN

DataHubDTN

Page 13: Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans · Big Data Hub infrastructure Jan Wester, C. de Laat, L. Gommans TNO System & Network Engineering, University of Amsterdam

Program at Global Summit I2 in Washington DC April 2017:15h00 Cees de Laat, University of Amsterdam

Trusted Data Processing in Untrusted Environments.15h05 Leon Gommans, Air France KLM

Trusted Big Data Sharing.15h25 Rodney Wilson

Programmable Supernetworks, Science DMZ based Networking.15h30 Panel of stakeholders Flash talks (~3 min each):

Inder Monga - ESnet - Data Science Driving Discovery.Matt Zekauskas - Internet2 - Thoughts on Internet2 and Trusted Large Data Transfer.Jerry Sobieski - NORDUnet - Issues of Big Data Sharing in a Global Science Collaboration.Adam Slagell – NCSA - What are we trusting?

15h45 Panel discussion moderated by Cees de Laat16h00 End of session.

Q&A