ninth annual workshop for the ee hpc wg: panel discussion · ninth annual workshop for the ee hpc...

14
Ninth Annual Workshop for the EE HPC WG: Panel Discussion Software for Energy Efficiency PowerStack, PowerAPI, READEX, GEOPM Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 1

Upload: others

Post on 30-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Ninth Annual Workshop for the EE HPC WG: Panel Discussion

Software for Energy Efficiency PowerStack, PowerAPI, READEX, GEOPM

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 1

Page 2: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

PANEL MODERATOR

Steven J. Martin, Cray Inc. Staff Engineer, Hardware Management Group

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 2

Page 3: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Panel Agenda

• Introductions (3 min)• Short panelist presentations (12 min)• Open Q&A (45 min)

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 3

Page 4: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Panel Introductions• GEOPM: Jonathan Eastep, (Intel)• PowerAPI: Grant, Ryan Eric, (Sandia) • PowerStack: Siddhartha Jana, (Intel)• READEX: Andreas Gocht, (TU-Dresden)

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 4

Page 5: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Introduction to• Global Extensible Open Power Manager

– Community collaborative open source project, started + supported by Intel– Project page: https://geopm.github.io/– Principal Investigator: [email protected]

• Product-grade job-level runtime – Globally coordinates optimizations to HW or SW control knob settings

across compute nodes in a job– Promises to increase performance or energy efficiency by 5-30%

• Depending on selected optimization plugin• And depending on application characteristics, controls + monitors available

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 5

EOPM

Page 6: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

• In-band, feedback-guided optimization of control knob settings: application profiling data from HW performance counters plus PMPI and OMPT

• Global application awareness: analysis of long history of profile data for forecasting exploitable patterns such as application phases and load imbalance across nodes

• Scalable global optimization: coordinated tuning of control knobs within and across nodes leveraging application analysis above; tree-hierarchical / recursive optimization algorithms and aggregation of profile data

• Flexibility and extensibility through plugin architecture: vendor HW-neutral (cross-platform) and customizable to each center’s individual performance vs energy efficiency preferences

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 6

Key Technologies

MPI Comms Overlay Shared Mem Region

Power-AwareRM / Scheduler

GEOPM Controller

SHM

GEOPM

GEOPM Root

GEOPM Aggregator

GEOPM Aggregator

GEOPM Leaf

Library or Driver such as msr-safe

HW IO

MPI Ranks0 to i-1

GEOPM Leaf

Node HW

MPI Ranksi to j-1

Node HW

MPI Ranksj to k-1

GEOPM Leaf

Node HW

MPI Ranksk to n-1

GEOPM Leaf

Node HW

Page 7: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

• Long-running collaboration with PowerAPI and PowerStack teams• GEOPM Beta release available: https://geopm.github.io/• GEOPM v1.0 release candidate available before Christmas• 1st production deployment is imminent: Argonne Theta system• Work in progress on additional deployments at LRZ, CINECA, LLNL, ...• Broader deployments via OpenHPC and TOSS

– GEOPM Beta and msr-safe included in latest OpenHPC release (v1.3.6)– Work in progress on integrating GEOPM into future TOSS releases

• New course at TUM leveraging GEOPM for course project infrastructure• Discussions beginning with system builders to co-develop GEOPM

enhancements that give their systems a competitive edge

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 7

GEOPM Project Status

Page 8: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

The HPC PowerStack (1/2)https://powerstack.lrr.in.tum.de https://gitlab.com/powerstack/

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 8

• Collaboration towards a well-defined, community-widestack that accounts for power-awareness across variouslayers of the HPC software ecosystem

• Collaborators include vendors, academia, and govt. labs.

Charter:1. Identify different actors that play a role in energy- and

power-aware job scheduling and resource mgmt2. Reach a community-wide consensus on the roles and

responsibilities of the different actors, theirinteroperability, and communication protocols

3. Work towards prototypes and full-scale production-grade solutions that are adaptive and feedback-driven

Page 9: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 9

Next Steps:• First Round of Working Group Meetings in the coming weeks• Topic-specific working-groups : Periodic Meetings

• Once every 1.5 months (~ 6 weeks)• Subscribe to Mailing lists• Subscribe to the GitLab project

• SC-18 BoF, Nov 15 (Thursday), 12:15 - 1:15pm, Room D167

Mailing list names:• PowerStack Announcements [email protected]• PowerStack Development [email protected]• PowerStack Adaptive Runtime and Control [email protected]• PowerStack Platform Interaction [email protected]• PowerStack Site Policy and Verification [email protected]

The HPC PowerStack (2/2)https://powerstack.lrr.in.tum.de https://gitlab.com/powerstack/

Page 10: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Ryan GrantPrincipal Member of Technical Staff

Sandia National Laboratories

Chair of the Power API Specification Committee

Working on: Power API Spec, Community Reference Implementation

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM)

Power API

10

Page 11: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

New Advances in the Power API• Community model

– New Specifications Document– Open meetings– Multi-institution involvement

• New convenience functions– Streamline writing code with object name length queries

• New reporting functions– Allow multiple statistics to be gathered in a single object (forthcoming)

• New notification methods between Interface hierarchy– Upcoming

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 11

Power API and Redfish BoFWednesday 12:15-1:15Room: D227

Page 12: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 12

• Finished 08/2018• Funded by the European Union‘s Horizon 2020

research and innovation programme under grant agreement No 671657

• Website: www.readex.eu

Page 13: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 13

43 k

J

56 k

J

78 k

J

457

kJ

74 k

J

15 k

J

103

kJ

38 k

J

39 k

J

47 k

J

75 k

J

418

kJ

68 k

J

13 k

J

93 k

J

34 k

J

0%

20%

40%

60%

80%

100%

120%

140%

Kripkecompiler,

intel

blasbenchcompiler,

GCC

Kripkecompiler,

GCC

NPB3.3-OMPcompiler,

GCC

OpenFOAMcompiler,

GCC

blasbenchcompiler,

Intel

BEM4Icompiler,

Intel

Alya manual,Intel

Energy Consumption

Default Energy Consumption Energy Consumption With READEX RRL runtime related to default

Page 14: Ninth Annual Workshop for the EE HPC WG: Panel Discussion · Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI ,READEX,

Q&A

Thankyou for participating!

Ninth Annual Workshop for the EE HPC WG: Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM) 14