deepak nadig anantha, zhe zhang, byrav ramamurthy, brian

17
SNAG: SDN-managed Network Architecture for GridFTP Transfers Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian Bockelman, Garhan Attebury and David Swanson Dept. of Computer Science & Engineering, University of Nebraska-Lincoln 3rd Intl. Workshop on Innovating the Network for Data-Intensive Science (INDIS), SC 16, Salt Lake City, Utah. 13 th November 2016 This material is based upon work supported by the National Science Foundation under Grant No. 1541442.

Upload: dangkien

Post on 14-Feb-2017

233 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

SNAG: SDN-managed Network Architecture for GridFTP

TransfersDeepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian Bockelman, Garhan Attebury and David Swanson

Dept. of Computer Science & Engineering,

University of Nebraska-Lincoln

3rd Intl. Workshop on Innovating the Network for Data-Intensive Science (INDIS), SC 16,

Salt Lake City, Utah.

13th November 2016

This material is based upon work supported by the National Science Foundation under Grant No. 1541442.

Page 2: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

• Introduction

• SNAG Approach

• Integration Architecture

• Implementation

• Results

• Conclusions and Future Work

OUTLINE

2

Page 3: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

Software Defined Networks (SDN) are transforming research and education (R&E) networks

We are interested in:

• Classifying data flows from scientific projects

• A policy-driven approach to network management & security

INTRODUCTION

3

Page 4: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

Supports well-known scientific projects such as:• Compact Muon Solenoid (CMS)

• Laser Interferometer Gravitational-Wave Observatory (LIGO)

• Large datasets with projects consuming significant storage and networking resources

• Our Work:• Ability to apply policies to these projects at the experiment level

• Ability to differentiate CMS traffic over LIGO

SCIENTIFIC PROJECTS @ UNLHolland Computing Center (HCC) @ UNL

4

Page 5: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

• Protocol for cluster and grid environments• Enables large-volume data transfers

• GridFTP maximizes data transfer throughput• Creates multiple TCP streams per transfer

• Overcomes TCP limitations for high-latency, high-bandwidth WANs

• CMS and LIGO both use GridFTP for data transfers

• Cost: Fairness

Globus GridFTP

5

Page 6: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

Associate policies based on application layer properties with network layer flow-level information.

• GridFTP breaks TCP fairness• Need to differentiate high-priority transfers

• GridFTP control channel is encrypted• Traffic cannot be classified by “sniffing” control channel

PROBLEM

6

Page 7: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

SNAG Architecture:• Integrates SDN capabilities with GridFTP

• Provides network monitoring and management capabilities

• Differentiates GridFTP transfers:• To/from various sources, and

• By owners

Solution: SNAGOur SDN-managed Network Architecture for GridFTP Transfers

7

Page 8: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

SNAG ARCHITECTURE

8

REST REST

Southbound API

MODULE

Page 9: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

Three main components namely:i. GridFTP Servers, Globus XIO Module and GridFTP-HDFS

plugin

ii. SDN controller (ONOS), SNAG App and GridFTP App

iii. Monitoring System (InfluxDB + Grafana)

Communication between components using RESTful APIs• L3, L4 info (src/dst IPs, port-pairs)

• Application layer info (file transfers, direction etc.)

SNAG COMPONENTS

9

REST REST

Southbound API

Page 10: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

GridFTP XIO Callout + SNAG

10

2

5Client

GridFTP Server1

Control Path

6

Data Path

3

REST POST4FLOW_MOD

ONOS SDN Controller

GridFTP XIO Callout:SRC_IPDST_IP, DST_PORTTRANSFER_STATUSFILENAME

Monitor 4REST POST

Page 11: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

NETWORK TOPOLOGY

11

Page 12: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

a) CMS PhEDEx - CMS production data movement • Maps user initiated transfers to the PhEDEx data placement

system

• Consists of large physics datasets (.root files) to/from sites

b) CMS Analysis - Represents analysis transfers associated with users' jobs (typically mapped to an individual user)

c) CMSProd - transfers associated with CMS production workflows

d) LCG Admin - transfers associated with SAM (Site Availability Monitoring)

RESULTS (1)Classified users based on four types of traffic:

12

Page 13: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

RESULTS (2)Classification of CMS Data Transfers by Users

13

• Each measurement shows the #users at every 15 min intervals

• Data normalized over 65 users

Page 14: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

RESULTS (3)Classification of #Transfer Streams per User type

14

• Represents the number of active TCP streams by user type

Page 15: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

• SNAG builds network layer views based on application layer properties• Resulting monitoring views cannot be achieved through the

traditional approaches

• SNAG accounts for resource usage and provides insights into opportunistic sharing (such as LIGO)

CONCLUSIONS

15

Page 16: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

• Transition from passive monitoring to active network management• Proactively change network flows based on monitoring information

• Traffic prioritization and QoS for CMS/LIGO Transfers

• Optimize capacity and increase network utilization

• Integration with other data flows such as XROOTD

• Insights into access patterns (Site-level info)

FUTURE WORK

16

Page 17: Deepak Nadig Anantha, Zhe Zhang, Byrav Ramamurthy, Brian

Thank You

Deepak Nadig Anantha

[email protected]

http://cse.unl.edu/~deepaknadig

17