introducing the cray sv1
DESCRIPTION
Introducing the CRAY SV1. CRAY SV1-128 SuperCluster. Agenda. Silicon Graphics Supercomputer Roadmap Introducing the Cray SV1 Cray SV1 hardware overview Multi-Streaming Processor Cray SV1 software overview. SV2 Vector. T90. SV1. SN2 Origin 3. J90. T3E. Origin. - PowerPoint PPT PresentationTRANSCRIPT
ver 0.1 Page 1
SGI Proprietary
Introducing theCRAY SV1
CRAY SV1-128 SuperCluster
ver 0.1 Page 2
SGI Proprietary
Agenda
• Silicon Graphics Supercomputer Roadmap
• Introducing the Cray SV1
• Cray SV1 hardware overview
• Multi-Streaming Processor
• Cray SV1 software overview
ver 0.1 Page 3
SGI Proprietary
Five Year Product Roadmap
1997 1998 1999 2000 2001 2002
J90
T3E
Origin
SV1SV1e
SN1Origin 2
SN2Origin 3
T90
UNICOSUNICOS/mkIRIX
SV2Vector
Common DSMArchitecture
ver 0.1 Page 4
SGI Proprietary
Vector Supercomputer Roadmap
CRAY T90 Flagship vector supercomputer
• Performance leader Engineering, production and quality initiatives continue
CRAY SV1 Scaling CMOS & UNICOS
• 1 TFLOPS & 1 TByte in 1998 Supercomputer performance & environment Outstanding reliability
CRAY SV2 10+ GFLOPS per CPU, scaling to 50+ TFLOPS
• In 2001 CMOS commodity leverage DSM & IRIX
ver 0.1 Page 5
SGI Proprietary
Introducing the CRAY SV1
• 4th generation CMOS & DRAM vector supercomputer from Cray
• 4 Gflops processors• cluster of tightly coupled Shared Memory
Processing Systems• configurations scale up to 1 Tflop / 1 Tbyte• highly competitive price / performance• compatible with Cray UNICOS vector applications• first shipment August 1998
ver 0.1 Page 6
SGI Proprietary
CRAY SV1 Highlights
Three technology breakthroughs: Caches combined with vectors Pipe selectable processors
• 1 GFLOPS peak with 2 pipes • 4 GFLOPS peak with 8 pipes
Scaling to 1,024 processors using vector CPUs• And 1 TByte of DRAM memory• Using 32-way flat memory SMP nodes
ver 0.1 Page 7
SGI Proprietary
Single Node CRAY SV1 Configurations
CRAY SV1A-1: Up to 3 four GFLOPS processors
• Up to 16 one GFLOPS processors 25.6 GByte/sec memory bandwidth 2 to 16 GByte of central memory 1 to 4 GigaRing channel adaptors
CRAY SV1-1: Up to 6 four GFLOPS processors
• Up to 32 one GFLOPS processors 51.2 GByte/sec memory bandwidth 4 to 32 GByte of central memory 1 to 8 GigaRing channel adaptors
Mem
Mem
Mem
Mem
CP
U
CP
U
CP
U
CP
U
CRAY SV1A-1(4x4 Backplane)
CRAY SV1-1(8x8 Backplane)
Mem
Mem
Mem
Mem
CP
U
CP
U
CP
U
CP
U
Mem
Mem
Mem
Mem
CP
U
CP
U
CP
U
CP
U
ver 0.1 Page 8
SGI Proprietary
SuperClustering The CRAY SV1:Basic Building Block = 32 Processor SMP Node
GigaRings
Minimum Configuration = Four Nodes(A Row of 4 -- 32 Processor Cabinets)
I/O32
PEs32
PEs32
PEs32
PEsI/O
SWS
MPNVector Clustering Experience:
• Over 5 years of providing production clusters to customers
• Several customer sites Each with multiple years of usage
ver 0.1 Page 9
SGI Proprietary• CPU Module
• High performance vector processors• 250MHz • 2 vector pipes per processor • 4 GFLOPS per module• BMM Functional Unit• 1 GigaRing IO channel per
module• Cache
• Cache for both vector and scalar data• 256KBytes• 4-way associative• 1 word (8 byte) per cache line
• Air cooled, surface mount design• Field upgradable
CACA
CACA
CACA
CACA
PVPV
PVPV
PVPV
PVPV
VBVB
VBVB
VBVB
VBVB
VAVA
VAVA
VAVA
VAVA
SV1 Processor Overview
ver 0.1 Page 10
SGI Proprietary
Multi-Streaming Processor
4 SV1 CPUs tightly coupled Based on optimal memory performance Coupled using shared registers
ver 0.1 Page 11
SGI Proprietary
In one CRAY SV1-32 cabinet: 32 -- 1 GFLOPS, 2 vector pipe processors or; 4 -- 4 GFLOPS, 8 vector pipe, processors, plus 16
1 GFLOPS processors or; 6 -- 4 GFLOPS, 8 vector pipe processors, plus 8
1GFLOPS processors
CPU CPUCPUCPU
CPU CPUCPUCPU
CPU CPUCPUCPU
CPU CPUCPUCPU
Memory
MultiStream-CPUMultiStream-CPU
Cache=1MB
Pipe Selectable Processor Concept
ver 0.1 Page 12
SGI Proprietary
SV1 Product Configurations
CRAY SVA-1 CRAY SV1-1 CRAY SV1-4 CRAY SV1-8 CRAY SV1-32
Number of CPUs 8 to 16 8 to 32 32 to 128 64 to 256 256 to 1024max MSP CPUs 3 6 24 48 192
CPU Clock Period 250 250 250 250 250(Mhz)
Number of 1 1 4 8 32
SMP Nodes
Peak Performance 8 to 16 8 to 32 32 to 128 64 to 256 256 to 1024
(Gflops)
Memory Size 1 to 16 2 to 32 8 to 128 16 to 256 64 to 1024
(GBytes)
Memory DRAM DRAM DRAM DRAM DRAM
Technology
Cooling Options Air Air Air or Water Air or Water Water
ver 0.1 Page 13
SGI Proprietary
CRAY SV1 Software Overview
CRAY PVP compatibility YMP Architecture compatibility CRAY UNICOS operating system Source and binary compatible with the YMP, C90 and J90
products
Immediate data center integrationlarge (~500) optimized applications-catalogOutstanding throughputSupports popular programming models
PVM, MPI, SHMEM, Autotasking, OpenMP, F--
Many tools for clusteringSeveral new enhancements planned
ver 0.1 Page 14
SGI Proprietary
UNICOS
Initially delivered in 1986 A 64 bit OS for 13 years A parallel OS for the highest performance computers in the
world
Reliable and robust with an MTTI of 100,000+ hours on the CRAY J90 series systems
Full Data Center functionality Security (B1) – Full tape support Accounting – ESCON, BlockMux, SCSI 64 bit filesystem – Data Migration State-of-the-art programming environment
ver 0.1 Page 15
SGI Proprietary
SV1 SuperCluster Capabilities Production quality OS, compilers, tools & batch systems Single system image
Within a 32 processor SMP nodeHigh speed, transparent file access (NFS, DFS, BDS)
Single system sign-on (via DCE)Mass Storage
DMF Data Migration Facility Data migration from cluster nodes to central tape resource
Job Scheduling NQE Network Queuing Environment --- access to cluster from
entire networkParallel Programing
Cray Message Passing Toolkit (MPI) and PVM Autotasking, OpenMP
Array Services Cluster wide status information and command execution
ver 0.1 Page 16
SGI Proprietary
SuperCluster Enhancements planned
Improved enviroment for parallel programming Increase IO bandwidth GigaRing OS Bypass to reduce interconnect latency
• Scheduled Transport protocal• SHMEM and MPI
Improved administration and user tools Single point of administration
• Accounting, UDB, installation, startup/shutdown, backup/recovery, etc
Enhanced system recovery and redundancy
Distributed HSM Client/Server DMF
ver 0.1 Page 17
SGI Proprietary
CRAY SV1 Applications
Supports the large robust UNICOS ISV set of applications, e.g.:
Structural Dynamics Chemistry CFD Environmental Electronics Petroleum
Provides compatibility with customer in-house application codes
Source and binary compatible with J90, C90 and YMP products
ver 0.1 Page 18
SGI Proprietary
SV1 Product Summary4 GFLOPS processors
Customers can select either 2 or 8 pipe operation Mixed processor sizes allow customers to optimize their code
performance
Highly competitive price/performance Low cost design and low power CMOS and DRAM Low life-cycle costs in power, cooling and maintenance
Up to 1 TFLOP peak and 1TByte memory capacity Excellent for throughput needs Peak capability is 20 times the powerful CRAY T932 Memory size is up to 125 times the largest CRAY T90
Proven UNICOS Operating System Runs UNICOS applications
Proves Silicon Graphics commitment from to the Very High End Supercomputing