to support the infrastructural needs of the national and ...kettimut/talks/didc2010.pdf · java web...

21

Upload: others

Post on 31-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 2: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop

To support the infrastructural needs of the national and international climate community, ESG is providing crucial technology to securely access, monitor, catalog, transport, and distribute data in today’s grid computing environment.

HPC hardware running climate models

ESG Sites

ESG Portal

Page 3: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 4: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 5: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop

  Characteristics of scientific data – Typically stored in file

– Modest file size

– Huge number of files

– A dataset is organized into hierarchical directories

  Requirements for data movement – Reliability

– Performance

– Scalability

– Ease of use

Page 6: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 7: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 8: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop

  Fault detection and recovery – Network outage, server crash

> Client (DTL) still active

> Retries

– Client failures > Maintain transfer request checkpoint

> Recover when program starts again

  Support multiple data transfer protocols – Provides interface for integrating new

protocol

– The prototype is based on GridFTP

Page 9: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 10: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop

  FCFS   Dependency-aware scheduling

– User specify dependency between requests – Scheduler parse the dependency

relationship and enqueue requests accordingly

  Multiple-Pair Transfer Scheduling – Different transfer requests may have

different source and destination – Process them concurrently

Page 11: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop

  Java web start application – No installation – Always up-to-date version

  Drag and drop – Zero configuration –  Integrated with myproxy – Automatically trusts the CAs part of IGTF

distribution – Fault tolerant – Transfer status monitoring – Optimized for performance

Page 12: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 13: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 14: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop

  APS beamline experiments generate TBs data

  For example, in tomography beamline users get 3 days of beam time per cycle (4 cycles/yr) – Generate 3-4 TBs of data in that 3 days – About 15-20 users per cycle – 45-80 TBs of data generated per cycle – 180-320 TBs of data per year

  Move data back to their home institute after their experiments

Page 15: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 16: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 17: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop

  Move data from APS to LSU and PSC – 32 ms RTT

  TCP tuning is enabled in Linux OS

  Dataset is about 696GB

  Compared with globus-url-copy

Page 18: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 19: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 20: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop
Page 21: To support the infrastructural needs of the national and ...kettimut/talks/DIDC2010.pdf · Java web start application – No installation – Always up-to-date version Drag and drop