hpc impacts our lives – space research16 super computer 'top 500' divisional...
TRANSCRIPT
SUSE® High Performance Computing
Kai DupkeSenior Product Manager
SUSE Linux Enterprise
Distribution: pdf anyDate: 2015-10-22Not a public document.
Meike ChabowskiSenior Product Marketing Manager
SUSE Linux Enterprise
3
4
HPC Impacts Our Lives – Space Research
5
6
HPC Impacts Our Lives – Weather & Climate
7
8
HPC Impacts Our Lives – Oil & Gas
9
10
HPC Impacts Our Lifes - Entertainment
11
… to Games
From Cinema ...
12
High Productivity Computing
Big Data – or HPC??
Overview
14
HPC OverviewSUSE® High Performance Computing
• Solving computational, data-intensive, or numerically-intensive tasks
• Reducing the time and effort required to set-up and maintain HPC clusters
• Ensuring that all components of the HPC stack work together
15
HPC DevelopmentSUSE® High Performance Computing
• Yesterday‒ Academia and Research
• Today‒ Academia and Research‒ Financial Services‒ Oil and Gas‒ Semiconductor‒ Life Sciences‒ Manufacturing
• Tomorrow‒ Departmental and workgroup
clusters‒ High Productivity Computing
16
Super Computer'top 500'
Divisional
Departmental
Work Group
>500K$
<500K$
<250K$
<100K$
HPC class SystemBudget
Ready
+++
+++
++
+
Special build HWonly performance countSelf-supportedPartner supported
Customized HWPartner drivenPartner supportedSUSE supported
Commodity HWBusiness drivenSUSE supported
Customer drivenHome brewed
Key drivers GTM
HPC-IHV
IHVISVSI
ChannelSUSE
ChannelShop
Market SegmentationSUSE High Performance Computing
17
• Lighthouse projects
• Government sponsored
• Generic workloads
• Often self-supported by Academic staff
• Specialized hardware
• Highly specialized application
• ROI and reliabilityare key
• Data Center support
• Commodity hardware
Split MarketSUSE® HPC
Commercial High Productivity
Computing
ScientificTop 500-class
SUSE Linux Enterprise HPC
19
HIGH PRODUCTIVITY COMPUTING
TotalBaker Hughes
Texas Instruments…..
MULTI- and MANY-CORE PROCESSOR SUPPORT
Intel, AMD,POWER
…..
TECHNOLOGY
Kernel 3.xLustre enablement
Ceph storage platformup to 8192 cores
…..
COOPERATION
IBMSGIHPDell…..
ACADEMIC AND RESEARCH
LRZ / SuperMUCBSC / MareNostrum
Tokyo Institute of TechnologyBeijing Computing Center
NASA…..
SUSESince 1992
Strongin Top500
BullNECCrayCisco…..
SUSE – Strong in HPC Market!SUSE® HPC
20
• Open Source benefits‒ Easy to customize, maintain and improve
• Innovation‒ Beowulf Clusters “born” on Linux
• Modularity‒ GUI overhead not required
‒ appliance form factors
• Linux Standards‒ Large base of tools, including remote management
‒ Hardware availability
‒ Large vendor ecosystem surrounding Linux HPC clusters
Why Linux?SUSE® High Performance Computing
21
Linux Preferred for HPCSUSE® High Performance Computing
• Linux‒ runs on more than 97% of
the world's top 500 supercomputers*
‒ is used by nearly 90% of general clusters
‒ Linux is used in the majority of HPC systems,from smaller departmental implementations to larger, integrated cluster solutions
*top500.org July 2015
22
Why SUSE® Linux Enterprise ServerFor High Performance Computing
• Early player in HPC, pushing innovation and new technologies
• Highly reliable, interoperable and manageable server operating system
• Built to power mission-critical workloads in physical, virtual and cloud environments
• The natural successor to UNIX, backed by proven services for UNIX migration
• Special features to improve performance
• Backed by established ecosystem – support and certificates
• The only Linux recommended by Microsoft
23
• Up-to-date Linux Kernel for optimal performance
• CPU Management and System Activity‒ CPUset System, CPUset command line tool
‒ Sysstat package
‒ IRQbalance
• OpenFabrics Enterprise Distribution (OFED)‒ Remote Direct Memory Access (RDMA) switched fabric
technologies, high-speed data transport technologies for server and storage connectivity
• SystemTap, LTTng 2.0
• Lustre enabled Kernel
SUSE Additional FeaturesSUSE® High Performance Computing
24
• Asynchronous I/O (AIO) ‒ Input/output processing that permits other processing to
continue before the transmission has finished
• Modular I/O Scheduler‒ Algorithm most suitable for workload can be chosen
dynamically
• Multi-core/hyper-threading processor support‒ Execute threads in parallel within each individual processor
‒ Supports up to 4096 cores per system
• Intel I/O Acceleration ‒ Offloads the CPU towards the network card, thus allowing the
system to continue processing data while I/O is taking place
SUSE Advanced I/O ProcessingSUSE® High Performance Computing
Update
26
• Simplified model
‒ Number of socket pairs matter
‒ Socket pairs are accumulated per system
‒ Head nodes and compute nodes are threaten equal
Simplify projects!SUSE® HPC
27
• 2 (8 sockets) head nodes for redundancy / scalability100 (4 sockets) compute nodes
• 416 sockets total (order: 208x 1-2 sockets)
Example – regular HPC setupSUSE® HPC
ClientHead Node2x 8 sockets
Compute Nodes100x 4 sockets+ = total
416 sockets
29
• SUSE Vendor Support
‒ Maintenance
‒ Standard & Priority support for the whole system
Keep it running!SUSE® HPC
30
Recent DevelopmentsSUSE® High Performance Computing
• Storage‒ Release of SUSE Storage
with Infiniband support
• ARM64‒ Partnering with Cavium
‒ SUSE Linux Enterprise for ARM64
• Cloud‒ MS Azure with SLES 12:
RDMA & Infiniband
• SUSE Linux Enterprise‒ 11 SP4 with latest HW
enablement (Intel, Power8)
• Network‒ Higher network throughput
‒ Added tunables in the IP stack (for lower latency)
Partner, Customers
32
Customers and PartnersSUSE® High Performance Computing
Customers
Partners
33
• Oil & Gas exploration‒ Process seismic data
‒ Simulation of deposit fluids
• Superior Performance‒ 2.3 petaflops
‒ 10x performance increase
‒ Equivalent to 27,000 PCs
• Future is coming‒ 6.7 petaflops
‒ Equivalent to 80,000 PCs
PANGEA – Total ExplorationSUSE High Performance Computing
“... from our point of view, SGI plus SUSE Linux Enterprise Server was a complete, integrated solution.”
“... SUSE Linux Enterprise Server gives us the ability to keep scaling on ever larger machines.”
— Diego KlahrHPC Engineer
Total
36
• Designed to simplify purchasing,deployment and management ofHPC clusters
• SUSE Linux Enterprise Server is IntelCluster Ready and powers manycertified Intel Cluster Ready systems
• intel® Cluster Ready “recipes” are available with SUSE Linux Enterprise Server
‒ Reference designs to help hardware vendors, platform integrators, and system integrators design and build certified Intel Cluster Ready systems
Intel Cluster Ready ProgramSUSE® High Performance Computing
Outlook
38
ChallengeSUSE® High Performance Computing
HPC market fast developing
Stack components provided by various vendors
Some stack components run in parallel
Mix of small and big vendors
Segmented into commercial and scientific
39
OutlookSUSE® High Performance Computing
• SUSE Linux Enterprise‒ 12 SP1 beta program running
• SUSE HPC‒ Evaluating optimized SUSE Linux Enterprise for HPC
‒ Your input is needed!
Forward looking statement, might change without notice.
Thank you.
40
www.suse.com/products/server/hpc.html
Learn more
Kai DupkeSenior Product Manager
SUSE Linux Enterprise
Meike ChabowskiSenior Product Marketing ManagerSUSE Linux [email protected]
Backup
HPC Stack
43
HPC StackSUSE® High Performance Computing
SUSE Linux Enterprise Server
Network
OFED10G
Storage
OCFS2 NFS
Message Passing InterfaceMPI
SGIIntelParastation
GPFS
Hardware
Queuing / ManagementSoftware & Tools
Application
IBRIX
= SUSE Partner= SUSE supported = SUSE future
cephFS
pNFS
pNFS
EXT3 XFS BTRFS
TCP offload MPICH openMPI
HP
PBS Pro Moab IBM LSF Bright CM
Lustre
SuperMUC
45
SuperMUC – FactsSUSE® High Performance Computing
• 60x faster,one of the fastest HPC systems in Europe
• 20x better performance per Watt,provide green HPC
• > 155,000 Intel Xeon Processor,migration from Itanium2 to x86
46
LRZ - Leibniz RechenzentrumEurope’s supercomputer run SUSE Linux Enterprise Server
Business challenge:LRZ is part of the Gauss Centre for Supercomputing (GCS), which operates the most powerful HPC infrastructure in Europe, and needs to provide researchers across Europe with a reliable and powerful HPC platform, which enables users to make faster progress in their complex research projects. To reduce the environmental impact of HPC, the institution aimed at improving the energy efficiency leverage established automation solutions to maximise the efficiency and manageability of the new supercomputing platform.
Benefits:• Completed easy and smooth migration from previous Itanium 2
infrastructure to new x86 processor architecture• Considerably simplified configuration and automation of the new system,
using the automation capabilities of AutoYaST(integrated with SLES)• Improved the energy efficiency: SuperMUC delivers appro. 20 times
more performance per watt than its predecessor• Boosted overall performance by a factor of 60
Solution:Working with SUSE and IBM, LRZ implemented SuperMUC with approx. 9,400 general purpose computing nodes, a peak performance of three Petaflop/s, comprised of 155,000 Intel Xeon processor cores and more than 300 TB main memory. LRZ chose to run SuperMUC on SUSE Linux Enterprise Server, leveraging SUSE’s proven HPC expertise and leading automation tools such as AutoYaST, which allows systems to be installed without manual intervention.
“We have relied on SUSE Linux Enterprise Server for 15 years, and have always been very satisfied.
The SUSE team is close at hand, should we require support or guidance.
We have received highly competent support over the years, and look forward to collaborating with them.
— Dr. Herbert Huber
Division Head of SupercomputingLeibniz Rechenzentrum
47
SuperMUC – Facts and Business AspectsSUSE® High Performance Computing
48
SuperMUC – System Overview
49
SuperMUC – System Overview
50
• Hot Water Cooling – reduce cooling cost‒ Use free air cooling
‒ Use of system heat for heating and technical processes
• RAS driven – high system availability‒ Full maintained SUSE Linux Enterprise Server
‒ Full support via IBM and SUSE
• Automated deployment – less management cost‒ Full use of SUSE's autoYAST feature
SuperMUC – Business AspectsSUSE High Performance Computing
51
• Support for Itanium2 and x86‒ Smooth migration of old to new system
‒ No additional staff training needed
• Great support experience‒ Cooperation for more than 15 years
‒ Backed by SUSE's winning support
• Easy deployment methods‒ SUSE's autoYAST used today
‒ Other SUSE offerings – SUSE Cloud, SUSE Manager – considered
SuperMUC – SUSE benefitsSUSE High Performance Computing
“We have relied on SUSE Linux Enterprise Server for 15 years, and have always been very satisfied.
The SUSE team is close at hand, should we require support or guidance.
We have received highly competent support over the years, and look forward to collaborating with them.
— Dr. Herbert Huber
Division Head of Supercomputing
Leibniz Rechenzentrum
Corporate HeadquartersMaxfeldstrasse 590409 NurembergGermany
+49 911 740 53 0 (Worldwide)www.suse.com
Join us on:www.opensuse.org
52
Unpublished Work of SUSE. All Rights Reserved.This work is an unpublished work and contains confidential, proprietary and trade secret information of SUSE. Access to this work is restricted to SUSE employees who have a need to know to perform tasks within the scope of their assignments. No part of this work may be practiced, performed, copied, distributed, revised, modified, translated, abridged, condensed, expanded, collected, or adapted without the prior written consent of SUSE. Any use or exploitation of this work without authorization could subject the perpetrator to criminal and civil liability.
General DisclaimerThis document is not to be construed as a promise by any participating company to develop, deliver, or market a product. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. SUSE makes no representations or warranties with respect to the contents of this document, and specifically disclaims any express or implied warranties of merchantability or fitness for any particular purpose. The development, release, and timing of features or functionality described for SUSE products remains at the sole discretion of SUSE. Further, SUSE reserves the right to revise this document and to make changes to its content, at any time, without obligation to notify any person or entity of such revisions or changes. All SUSE marks referenced in this presentation are trademarks or registered trademarks of Novell, Inc. in the United States and other countries. All third-party trademarks are the property of their respective owners.