eurotech 30 01 2013

20
Aurora Tigon Green, Dense, Standard HPC G. Tecchiolli – CTO Cineca, Bologna March, 30 th 2013

Upload: brian-caulfield

Post on 28-Nov-2014

18.794 views

Category:

Documents


2 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Eurotech 30 01 2013

Aurora Tigon

Green, Dense, Standard HPCG. Tecchiolli – CTO

Cineca, Bologna March, 30th 2013

Page 2: Eurotech 30 01 2013

EurotechHPC R&D and Operations

JAPAN

SINGAPOREITALYF R AN C EUSA

USA

USA

U K

INDIA

JAPAN

SINGAPOREITALYUSA

USA

USA

U K

INDIA

Page 3: Eurotech 30 01 2013

AURORA TIGONUnleash the hybrid power

Page 4: Eurotech 30 01 2013

From 1 node to large petascale systems

NodeNode BackplaneBackplane ChassisChassis CoolingCooling

SystemSystem RackRack

Page 5: Eurotech 30 01 2013

Key Features:

High Performance Density – 256 CPUs, 256accelerators, up to 350 TFlops in just 1.5 m2

Energy efficiency– the Aurora direct coolingtarget datacenter PUE of 1.05, no need for airconditioning, up to 50% less energy

Programmability and compatibility – Based onstandard HPC cluster architecture. 100%compatibility with existing applications.

Flexible Liquid Cooling– All components arecooled by water, temperature from 18°C to 52°Cand variable flow rates

Reliability– 3 independent sensor networks,soldered memory, no moving parts, uniformcooling, quality controls

The Aurora TigonUnleash the hybrid power

Page 6: Eurotech 30 01 2013

2 x Intel Xeon E5sockets

2 x Intel Xeon E5sockets

Local DiskLocal Disk

Soldered RAMSoldered RAM

Cooling PlateCooling Plate

3D Torus3D Torus

The Aurora Tigon node card

2 x Nvidia K202 x Nvidia K20

Infiniband QDRInfiniband QDR

Page 7: Eurotech 30 01 2013

Energy efficiency measurements according to the Green500 guidelines

The Setup

• All measurements made with a calibrated power meter with thesystem running HPL

System Eurora supercomputer: 64 nodes, 128 CPUs,128 GPUs

Node Card Intel Xeon E5-2687W (150W)

n.2 nVIDIA K20s, n.1 Infiniband QDR NVIDIA® Tesla® K20Ambient Temperature 20°C+/-1°CCoolant Temperature 19°C+/-1°CCoolant waterFlowrate 120lph +/-7lph each EuroraBoard

Page 8: Eurotech 30 01 2013

HPL Benchmark Results

1450

1500

1550

1600

1650

1700

1750

1,00 1,50 2,00 2,50 3,00 3,50 4,00

Perf

orm

ance

(GFl

ops)

Frequenza (GHz)

Performance Vs CPU Frequency

0,0

200,0

400,0

600,0

800,0

1000,0

1,00 1,50 2,00 2,50 3,00 3,50 4,00

Pow

er(W

)

Frequenza (GHz)

Power Vs CPU Frequency

0,00

0,50

1,00

1,50

2,00

2,50

3,00

3,50

1,00 1,50 2,00 2,50 3,00 3,50 4,00

Perf

orm

ance

(GFl

ops/

W)

Frequenza (GHz)

Performance/Power Vs CPU Frequency

3.15 ± 0.12 !!

Page 9: Eurotech 30 01 2013
Page 10: Eurotech 30 01 2013

3150 MFlop/s per WATT

30% more efficient than #1 in Green 500Final results

What does it mean?

Each Eurora node (server) of the same size of a laptop is capable ofperforming 1.700.000.000.000 floating point operations per second, 30 timesmore than a desktop

The Eurora system is currently the most energy-efficient standard x86-basedsystem of the world with 3150 Mflop/s per WATT. This is 15 times more efficientthan an average desktop computer

What does it mean?

Each Eurora node (server) of the same size of a laptop is capable ofperforming 1.700.000.000.000 floating point operations per second, 30 timesmore than a desktop

The Eurora system is currently the most energy-efficient standard x86-basedsystem of the world with 3150 Mflop/s per WATT. This is 15 times more efficientthan an average desktop computer

1700 Sustained GFlop/s per node

Page 11: Eurotech 30 01 2013

Tigon: an energy-aware design

• GPUs• Optimized design:

– No unused components– No fans– Soldered components– Dense architecture (with integrated interconnect)

• Optimized power conversion chain– To enable system level energy efficiency– To enable data-center level energy efficiency

• Liquid Cooling– To enable system level energy efficiency when cold water is used– To enable data-center level energy efficiency when hot water is

used

Page 12: Eurotech 30 01 2013

“standard” power distribution conversion steps

Data from Intel

Page 13: Eurotech 30 01 2013

Moving towards DC reduces steps in power conversion

Data from Intel

Page 14: Eurotech 30 01 2013

Aurora power distribution

10 V

48 Vdc

230 V

OptionalUPS

97% efficiency 98% efficiency

Page 15: Eurotech 30 01 2013

Gain DC/DC conversion efficiency

• In the DC/DC conversion a gain of over 2% in efficiency, from95,5 % to 98%

Existing DC/DC conversion New upgraded DC/DC conversion

Page 16: Eurotech 30 01 2013

Liquid cooling and efficiency at system level178 nodes – AMD Opteron 6128HE CPUs (Magny Cours) - 16GB RAM Measuremets

taken by LRZ

Page 17: Eurotech 30 01 2013

Why liquid cooling is better?• Heat capacity:

• Air: 1• Water: 3500

• Control over coolant flow and heat exchange• Control over temperature

Page 18: Eurotech 30 01 2013

Ways of cooling car engines

20

Page 19: Eurotech 30 01 2013

Free cooling and energy re-use

PUE < 1 !!

Page 20: Eurotech 30 01 2013

11000 CO2 tons saved!

1500 cars that do not circulate for 1 year11500 saved trees

15 Km2 of rain forest left untouched

1500 cars that do not circulate for 1 year11500 saved trees

15 Km2 of rain forest left untouched

“Green is the prime color of the world, and that from whichits loveliness arises”

Pedro Calderon de la Barca