introducing the cray sv1

18
ver 0.1 Page 1 SGI Propriet ary Introducing the CRAY SV1 CRAY SV1-128 SuperCluster

Upload: whitney-duran

Post on 31-Dec-2015

52 views

Category:

Documents


2 download

DESCRIPTION

Introducing the CRAY SV1. CRAY SV1-128 SuperCluster. Agenda. Silicon Graphics Supercomputer Roadmap Introducing the Cray SV1 Cray SV1 hardware overview Multi-Streaming Processor Cray SV1 software overview. SV2 Vector. T90. SV1. SN2 Origin 3. J90. T3E. Origin. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Introducing the CRAY SV1

ver 0.1 Page 1

SGI Proprietary

Introducing theCRAY SV1

CRAY SV1-128 SuperCluster

Page 2: Introducing the CRAY SV1

ver 0.1 Page 2

SGI Proprietary

Agenda

• Silicon Graphics Supercomputer Roadmap

• Introducing the Cray SV1

• Cray SV1 hardware overview

• Multi-Streaming Processor

• Cray SV1 software overview

Page 3: Introducing the CRAY SV1

ver 0.1 Page 3

SGI Proprietary

Five Year Product Roadmap

1997 1998 1999 2000 2001 2002

J90

T3E

Origin

SV1SV1e

SN1Origin 2

SN2Origin 3

T90

UNICOSUNICOS/mkIRIX

SV2Vector

Common DSMArchitecture

Page 4: Introducing the CRAY SV1

ver 0.1 Page 4

SGI Proprietary

Vector Supercomputer Roadmap

CRAY T90 Flagship vector supercomputer

• Performance leader Engineering, production and quality initiatives continue

CRAY SV1 Scaling CMOS & UNICOS

• 1 TFLOPS & 1 TByte in 1998 Supercomputer performance & environment Outstanding reliability

CRAY SV2 10+ GFLOPS per CPU, scaling to 50+ TFLOPS

• In 2001 CMOS commodity leverage DSM & IRIX

Page 5: Introducing the CRAY SV1

ver 0.1 Page 5

SGI Proprietary

Introducing the CRAY SV1

• 4th generation CMOS & DRAM vector supercomputer from Cray

• 4 Gflops processors• cluster of tightly coupled Shared Memory

Processing Systems• configurations scale up to 1 Tflop / 1 Tbyte• highly competitive price / performance• compatible with Cray UNICOS vector applications• first shipment August 1998

Page 6: Introducing the CRAY SV1

ver 0.1 Page 6

SGI Proprietary

CRAY SV1 Highlights

Three technology breakthroughs: Caches combined with vectors Pipe selectable processors

• 1 GFLOPS peak with 2 pipes • 4 GFLOPS peak with 8 pipes

Scaling to 1,024 processors using vector CPUs• And 1 TByte of DRAM memory• Using 32-way flat memory SMP nodes

Page 7: Introducing the CRAY SV1

ver 0.1 Page 7

SGI Proprietary

Single Node CRAY SV1 Configurations

CRAY SV1A-1: Up to 3 four GFLOPS processors

• Up to 16 one GFLOPS processors 25.6 GByte/sec memory bandwidth 2 to 16 GByte of central memory 1 to 4 GigaRing channel adaptors

CRAY SV1-1: Up to 6 four GFLOPS processors

• Up to 32 one GFLOPS processors 51.2 GByte/sec memory bandwidth 4 to 32 GByte of central memory 1 to 8 GigaRing channel adaptors

Mem

Mem

Mem

Mem

CP

U

CP

U

CP

U

CP

U

CRAY SV1A-1(4x4 Backplane)

CRAY SV1-1(8x8 Backplane)

Mem

Mem

Mem

Mem

CP

U

CP

U

CP

U

CP

U

Mem

Mem

Mem

Mem

CP

U

CP

U

CP

U

CP

U

Page 8: Introducing the CRAY SV1

ver 0.1 Page 8

SGI Proprietary

SuperClustering The CRAY SV1:Basic Building Block = 32 Processor SMP Node

GigaRings

Minimum Configuration = Four Nodes(A Row of 4 -- 32 Processor Cabinets)

I/O32

PEs32

PEs32

PEs32

PEsI/O

SWS

MPNVector Clustering Experience:

• Over 5 years of providing production clusters to customers

• Several customer sites Each with multiple years of usage

Page 9: Introducing the CRAY SV1

ver 0.1 Page 9

SGI Proprietary• CPU Module

• High performance vector processors• 250MHz • 2 vector pipes per processor • 4 GFLOPS per module• BMM Functional Unit• 1 GigaRing IO channel per

module• Cache

• Cache for both vector and scalar data• 256KBytes• 4-way associative• 1 word (8 byte) per cache line

• Air cooled, surface mount design• Field upgradable

CACA

CACA

CACA

CACA

PVPV

PVPV

PVPV

PVPV

VBVB

VBVB

VBVB

VBVB

VAVA

VAVA

VAVA

VAVA

SV1 Processor Overview

Page 10: Introducing the CRAY SV1

ver 0.1 Page 10

SGI Proprietary

Multi-Streaming Processor

4 SV1 CPUs tightly coupled Based on optimal memory performance Coupled using shared registers

Page 11: Introducing the CRAY SV1

ver 0.1 Page 11

SGI Proprietary

In one CRAY SV1-32 cabinet: 32 -- 1 GFLOPS, 2 vector pipe processors or; 4 -- 4 GFLOPS, 8 vector pipe, processors, plus 16

1 GFLOPS processors or; 6 -- 4 GFLOPS, 8 vector pipe processors, plus 8

1GFLOPS processors

CPU CPUCPUCPU

CPU CPUCPUCPU

CPU CPUCPUCPU

CPU CPUCPUCPU

Memory

MultiStream-CPUMultiStream-CPU

Cache=1MB

Pipe Selectable Processor Concept

Page 12: Introducing the CRAY SV1

ver 0.1 Page 12

SGI Proprietary

SV1 Product Configurations

CRAY SVA-1 CRAY SV1-1 CRAY SV1-4 CRAY SV1-8 CRAY SV1-32

Number of CPUs 8 to 16 8 to 32 32 to 128 64 to 256 256 to 1024max MSP CPUs 3 6 24 48 192

CPU Clock Period 250 250 250 250 250(Mhz)

Number of 1 1 4 8 32

SMP Nodes

Peak Performance 8 to 16 8 to 32 32 to 128 64 to 256 256 to 1024

(Gflops)

Memory Size 1 to 16 2 to 32 8 to 128 16 to 256 64 to 1024

(GBytes)

Memory DRAM DRAM DRAM DRAM DRAM

Technology

Cooling Options Air Air Air or Water Air or Water Water

Page 13: Introducing the CRAY SV1

ver 0.1 Page 13

SGI Proprietary

CRAY SV1 Software Overview

CRAY PVP compatibility YMP Architecture compatibility CRAY UNICOS operating system Source and binary compatible with the YMP, C90 and J90

products

Immediate data center integrationlarge (~500) optimized applications-catalogOutstanding throughputSupports popular programming models

PVM, MPI, SHMEM, Autotasking, OpenMP, F--

Many tools for clusteringSeveral new enhancements planned

Page 14: Introducing the CRAY SV1

ver 0.1 Page 14

SGI Proprietary

UNICOS

Initially delivered in 1986 A 64 bit OS for 13 years A parallel OS for the highest performance computers in the

world

Reliable and robust with an MTTI of 100,000+ hours on the CRAY J90 series systems

Full Data Center functionality Security (B1) – Full tape support Accounting – ESCON, BlockMux, SCSI 64 bit filesystem – Data Migration State-of-the-art programming environment

Page 15: Introducing the CRAY SV1

ver 0.1 Page 15

SGI Proprietary

SV1 SuperCluster Capabilities Production quality OS, compilers, tools & batch systems Single system image

Within a 32 processor SMP nodeHigh speed, transparent file access (NFS, DFS, BDS)

Single system sign-on (via DCE)Mass Storage

DMF Data Migration Facility Data migration from cluster nodes to central tape resource

Job Scheduling NQE Network Queuing Environment --- access to cluster from

entire networkParallel Programing

Cray Message Passing Toolkit (MPI) and PVM Autotasking, OpenMP

Array Services Cluster wide status information and command execution

Page 16: Introducing the CRAY SV1

ver 0.1 Page 16

SGI Proprietary

SuperCluster Enhancements planned

Improved enviroment for parallel programming Increase IO bandwidth GigaRing OS Bypass to reduce interconnect latency

• Scheduled Transport protocal• SHMEM and MPI

Improved administration and user tools Single point of administration

• Accounting, UDB, installation, startup/shutdown, backup/recovery, etc

Enhanced system recovery and redundancy

Distributed HSM Client/Server DMF

Page 17: Introducing the CRAY SV1

ver 0.1 Page 17

SGI Proprietary

CRAY SV1 Applications

Supports the large robust UNICOS ISV set of applications, e.g.:

Structural Dynamics Chemistry CFD Environmental Electronics Petroleum

Provides compatibility with customer in-house application codes

Source and binary compatible with J90, C90 and YMP products

Page 18: Introducing the CRAY SV1

ver 0.1 Page 18

SGI Proprietary

SV1 Product Summary4 GFLOPS processors

Customers can select either 2 or 8 pipe operation Mixed processor sizes allow customers to optimize their code

performance

Highly competitive price/performance Low cost design and low power CMOS and DRAM Low life-cycle costs in power, cooling and maintenance

Up to 1 TFLOP peak and 1TByte memory capacity Excellent for throughput needs Peak capability is 20 times the powerful CRAY T932 Memory size is up to 125 times the largest CRAY T90

Proven UNICOS Operating System Runs UNICOS applications

Proves Silicon Graphics commitment from to the Very High End Supercomputing