san over wan - a new way of solving the grid data access bottleneck

19
Cracow ‘03 Grid Workshop Silicon Graphics, Inc. SAN over WAN - a new way of solving the GRID data access bottleneck Dr. Wolfgang Mertz Business Development Manager for Storage in EMEA [email protected] Presented by:

Upload: havily

Post on 23-Jan-2016

35 views

Category:

Documents


0 download

DESCRIPTION

SAN over WAN - a new way of solving the GRID data access bottleneck. Presented by:. Dr. Wolfgang Mertz Business Development Manager for Storage in EMEA [email protected]. Data Growth Trends. From 2001 to 2005 it is projected to grow at 83% CAGR. From 1998 to 2000 Storage Shipped grew - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop

Silicon Graphics, Inc.

SAN over WAN - a new way of solving the GRID data

access bottleneck

Dr. Wolfgang Mertz

Business Development Manager for Storage in EMEA

[email protected]

Presented by:

Page 2: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 2

-

1,000,000

2,000,000

3,000,000

4,000,000

5,000,000

6,000,000

7,000,000

1998 1999 2000 2001 2002 2003 2004 2005

(in Terabytes)

From 1998 to 2000Storage Shipped grew

at 78% CAGR

From 2001 to 2005From 2001 to 2005it is projected to growit is projected to grow

at 83% CAGRat 83% CAGR

Data under management in an HPC environment is currently growing at over 100%/year.

Source: Lyman, Peter and Hal R. Varian, "How Much Information", 2000. Retrieved fromhttp://www.sims.berkeley.edu/how-much-info on 12/19/2002.

Data Growth Trends

Page 3: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 3

2 Buzzwords in IT Industry

• Server Consolidation–maybe in a commercial environment–usually not in a technical environment

• a hammer is a hammer, a screwdriver is a screwdriver

• an HPC system cannot be used as a HPV system

• Storage Consolidation–DAS -> NAS -> SAN

Page 4: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 4

History of Storage ArchitecturesDAS - Direct Attached Storage

•pro–appropriate performance

•con–distributed, expensive administration–data may not be where it is needed–multiple copies of data stored

Page 5: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 5

History of Storage ArchitecturesNAS - Network Attached Storage

•pro–centralized, less expensive administration–one copy of data–access from every system

•con–network performance is the bottleneck

Page 6: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 6

Switch

History of Storage Architectures SAN - Storage Area Network

•pro–centralized administration–performance equivalent to DAS

•con–NO FILE SHARING–multiple copies of data stored

Page 7: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 7

How does that translate to a GRID Environment?

• Storage Consolidation–useful in a local environment (GRID node)–does not work between remote GRID nodes

• Current Data Access between GRID Nodes–Data has to be copied before/after the execution of a job–Problems

• copy process has to be done manually or included in the job script

• copy can take long

• multiple copies of data– additional disk space needed– revision problem

Page 8: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 8

What if...

• ... a SAN would have the same file sharing capability as a NAS?

• ... one could build a SAN between different buildings/sites/cities and not loose performance?

Page 9: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 9

LANLANSANSAN

A first step:• each host owns a dedicated volume consolidated on a RAID array.

•Storage management is centralized.

•Offers a certain level of flexibility.

Storage Area Networks (SAN)The High Performance Solution

Page 10: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 10

LANLAN

SANSAN

SOLARIS,

AIX, HP-UX

Windows NT,

2000 and XP

Linux, Mac OS

IRIX

A unique high performances solution:

•Each host shares one or more volumes consolidated in one or more RAID arrays.

•Centralized storage management

•High modularity

•True High Performances Data sharing

•Heterogeneous Environment

SGI InfiniteStorage Shared FileSystem (CXFS)

Page 11: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 11

Data re-transmission due to IP packet loss

limits actual IP throughput over

distance

Distance(kilometers)

NewYork

Boston Chicago Denver

Hours

Fibre Channel over SONET/SDHThe High Efficiency, Long Distance Alternative

Hours to Send 1 TeraByte

0

50

100

150

200

250

0 1000 2000 3000 4000

OC-12 IP

OC-12 SONET/SDH

Page 12: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 12

SAN

Tape SystemStorage

Servers

Client LAN

IP Router

Fibre Channel Switch

WAN

DWDM

DedicatedFiber

SDHSONET

SAN

Tape SystemStorage

Servers

ClientLAN

IP Router

Fibre Channel Switch

SONET

FC

IP

LightSand Solution for building a Global-SAN

Page 13: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 13

LightSand Products

• S-600– 2 ports FC and/or IP 1Gb/s– Point-to-point SAN interconnect over SONET/SDH OC-12c (622 Mb/s

bandwidth)– Low latency (approximately 50 µSec)

• S-2500– 3 ports FC and/or IP 1Gb/s– Point-to-point SAN interconnect over SONET/SDH OC-48c (2.5 Gb/s

bandwidth)– Point-to-multipoint SAN interconnect over SONET/SDH (up to 5 SAN islands.

622 Mb/s per link)– Low latency (approximately 50 µSec)

Page 14: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 14

Sandia National

Laboratory (SNL)

Los AlamosNational

Laboratory(LANL)

IP NetworkServer Server

Fibre ChannelStorage AreaNetwork

Fibre ChannelStorage AreaNetwork

Scientists at LANL currently dump 100GB of supercomputing data to tape and FedEx it to SNL because it is faster than trying to use the existing 155Mb/s IP WAN connection

– Actual measured throughput of 16Mb/s! (10% bandwidth utilization)http://www-unix.mcs.anl.gov/discovery/wufeng.htm

Scientists at LANL currently dump 100GB of supercomputing data to tape and FedEx it to SNL because it is faster than trying to use the existing 155Mb/s IP WAN connection

– Actual measured throughput of 16Mb/s! (10% bandwidth utilization)http://www-unix.mcs.anl.gov/discovery/wufeng.htm

Data Movement Today – A Recent Case Study

Page 15: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 15

Using LightSand gateways, the same data could be transferred in a few minutes!

RemoteData Center

LocalData Center

IPNetwork

FC SAN

Server Server

FC SAN

LightSandGateway

TelcoSONET/SDH Infrastructure LightSand

Gateway

The Better Way – Directly Between Storage Systems

Page 16: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 16

GDAŃSK

POZNAŃ

ŁÓDŹ

KRAKÓW

WROCŁAWWROCŁAW

GDAŃSK

ŁÓDŹ

KRAKÓW

POZNAŃ

What does that mean for a GRID Environment?

• Full Bandwidth Data Access across the GRID• No Multiple Copies of Data

–avoid the revision problem–do not waste disk space

• Make GRID Computing more efficient

WARSZAWA

Page 17: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 17

StorageAdvancedGraphics

High-PerformanceComputing

Highly Integrated, Massively Scalable Systems

Page 18: SAN over WAN - a new way of solving the GRID data access bottleneck

Cracow ‘03 Grid Workshop Page 18

Storage HardwareTP900, TP9100, TP9300, TP9400, TP9500,

HDS 99x0,STK Tape Libraries, ADIC Libraries,

Brocade Switches,NAS 2000, SAN 2000, SAN 3000

Hig

h A

vaila

bilit

y

Dat

a P

rote

ctio

n

HS

M

Dat

a S

harin

g

NASDAS SAN

High Availability

Data Protection

HSM

Data Sharing

Redundant Hardware and FailSafe™XVM

Legato NetWorker,XFS™ Dump, OpenVault™

SGI Data Migration Facility (DMF),TMF, OpenVault™

XFS, CIFS/NFS, Samba,ClusteredXFS (CXFS™),SAN over WAN

Choose only the integrated capabilities you need

SGI InfiniteStorage Product Line

Page 19: SAN over WAN - a new way of solving the GRID data access bottleneck

www.sgi.com/products/storage