new twinkle, twinkle little star using 18.000 gpus to simulate jets in … · 2013. 11. 26. ·...
TRANSCRIPT
Text optional: Institutsname Prof. Dr. Hans Mustermann www.fzd.de Mitglied der Leibniz-Gemeinschaft
Twinkle, twinkle little star
using 18.000 GPUs to simulate jets in the cosmos
Text optional: Institutsname Prof. Dr. Hans Mustermann www.fzd.de Mitglied der Leibniz-Gemeinschaft
Twinkle, twinkle little star
using 18.000 GPUs to simulate jets in the cosmos
Michael Bussmann1, Heiko Burau1, Thomas E. Cowan1, Alexander Debus1, Axel Hübl1,
Guido Juckeland2, Thomas Kluge1, Wolfgang E. Nagel1, Richard Pausch1, Felix Schmitt2,
Ulrich Schramm1, Joseph Schuchart2,3, René Widera1
1 Helmholtz-Zentrum Dresden – Rossendorf 2 ZIH, Technical University Dresden 3 Oak Ridge National Laboratory
Slide 3 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
© Miguel Claro Night Sky Photography www.miguelclaro.com
We understand the Universe …
Slide 4 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
© Miguel Claro Night Sky Photography www.miguelclaro.com
… by the Light we see from Earth.
Slide 5 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
© Miguel Claro Night Sky Photography www.miguelclaro.com
We have built Telescopes to see Objects far away …
Slide 6 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
… and at any Wavelength of the Spectrum.
X-Ray: NASA/CXC/J.Hester (ASU); Optical: NASA/ESA/J.Hester & A.Loll (ASU); Infrared: NASA/JPL-Caltech/R.Gehrz (Univ. Minn.)
Slide 7 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Monsters in the Sky ─ Pulsars, Active Galactic Nuclei & Black Holes
Relativistic Speeds
Turbulent Dynamics
Particle Acceleration
Relativistic Speeds
Turbulent Dynamics
Particle Acceleration
Video by CHANDRA X-Ray Observatory
Slide 8 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Jets are Sources of Accelerated Particles
http://imgs.xkcd.com/comics/fountain.png
Slide 9 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Can we have a Look inside these Jets?
NASA Publich Domain: http://en.wikipedia.org/wiki/File:Messier_87_Hubble_WikiSky.jpg
Slide 10 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
The Model Jet
v ≈ c
Gas around the Jet
Jet
Gas around the Jet
v ≈ c
Slide 11 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Hydrogen Plasma
Hydrogen Plasma
Hydrogen Plasma
v ≈ c
The Model Jet
Slide 12 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Hydrogen Plasma
Hydrogen Plasma
Hydrogen Plasma
vrelative
vrelative
vrelative
The Model Jet
Slide 13 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
+ +
+ +
+ + + +
─
─
─
─
─ ─
─ ─
The Model Jet
Slide 14 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
+
+ +
+
+
+
+
+
+
+ +
+
+
+
+ +
+
+
+ +
+
+
─
─
─
─
─
─
─
─
─ ─
─
─ ─
─
─
─
─
─
─
─
─
Slide 15 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
A Many-GPGPU Particle-in-Cell Code
Built to simulate Laser-Plasma Interaction
Compact Accelerators for Cancer Therapy
Ideal for Astro-Plasma-Physics
Open Source, free for Download
Slide 16 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
+
+ +
+
+
+
+
+
+
+ +
+
+
+
+ +
+
+
+ +
+
+
─
─
─
─
─
─
─
─
─ ─
─
─ ─
─
─
─
─
─
─
─
─
Moving Particles create Fields
Fields act back on Particles
Particles change Cells
Slide 17 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
// calculate electromagnetic field energy
algorithm::kernel::Foreach<math::CT::Int<TILE_WIDTH,TILE_HEIGHT,1> >()(
energyDBuffer.zone(),
energyDBuffer.origin(),
cursor::tools::slice(fieldE.origin()),
cursor::tools::slice(fieldB.origin()),
_1 = (_abs2(_2) + _abs2(_3) * MUE0_EPS0) * 0.5 * EPS0);
In-Kernel STL Magic
Slide 18 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
HOST NODE (CPU)
Communication Thread 1
GPU 1
GPU 2
Communication Thread 2
MPI
MPI
Slide 19 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
GPU
GPU
GPU
GPU
GPU
GPU
GPU
GPU
GPU
GPU
GPU
GPU
Slide 20 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Computations on GPU
CPU for I/O only
Fast GPU Memory Access
Data + Task Parallelism
Concurrent Kernels
Asynchronous I/O
Slide 21 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
The largest kinetic Simulation of the relativistic KHI yet
46 × larger ─ 4.7 × higher resolution ─ 75 Billion Particles
Visualization by David Pugmire
Slide 22 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
There‘s more to it than Particles ─ Fields
Slide 23 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Field and Particles show similar Structures
Slide 24 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
But we can‘t SEE Electrons from Millions of Lightyears away!
─
Slide 25 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
512 Frequencies, 481 Directions
─
Offline Analysis: 4 Pbyte of Trajectories
Slide 26 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
KHI-Skymap
Slide 27 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Skymap with Particles
Slide 28 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Skymap with Fields
Slide 29 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
Gordon-Bell Results
Slide 30 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
PIConGPU ─ Speedup from 16 to 18,432 Nodes
*ideal: 1152
Slide 31 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
PIConGPU ─ Strong Scaling 16 to 18,432 Nodes
Slide 32 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
PIConGPU ─ Weak Scaling 1 to 18,432 Nodes
Slide 33 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
PIConGPU ─ Weak Scaling 1 to 18,432 Nodes
Slide 34 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
PIConGPU ─ Performance MEASURED (Peak / Sustained)
CUPTI Metrics: flops_sp, flops_dp
7.176 PFLOP/s (dp)
+
1.449 PFLOP/s (sp)
Slide 35 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]
René
Axel
Heiko Richard
Slide 36 Michael Bussmann · Computational Radiation Physics · picongpu.hzdr.de [email protected]