amd energy star server energy efficiency technology...4 “jaguar”cpu cores gcn graphics compute...
TRANSCRIPT
AMD Opteron Energy Efficiency Technology
ENERGY STAR Computer Servers Off-Season Meeting
June 2014
David Reiner | Server Performance & Power Optimization Manager
Donna Sadowy | Sr. Manager, Government Relations
AMD 2013-2014 SERVER ROADMAP
AMD roadmaps are subject to change without notice or obligations to notify of changes. Placement of boxes intendedto represent first year of production shipments.
12 or 16 “Piledriver” CPU Cores35W-140W
2013 2014
2P and 4P Enterprise,
Mainstream Platforms
AMD Opteron™ 6300 and 4300 Series4, 6, 8, 12 or 16 “Piledriver” CPU Cores
32nm
“Warsaw” CPU
32nm
1P Web/EnterpriseServices Clusters
AMD Opteron™ 3300 Series4 or 8 “Piledriver” CPU Cores
25W-65W TDP
32nm
AMD Opteron™ X1150 CPU and X2150 APU4 “Jaguar” CPU Cores
GCN Graphics Compute Units (APU)9W-22W
28nm SoC
“Berlin” CPU/APU4 “Steamroller” CPU Cores
GCN Graphics Compute Units (APU) HSA Features (APU)
28nm
“Seattle” CPUARM “A57” CPU Cores
28nm SoC
A History of Energy Efficiency
1.0x
2.0x
3.0x
4.0x
5.0x
6.0x
7.0x
8.0x
9.0x
10.0x
2008 2009 2010 2011 2012 2013 2014
ENER
GY
EFFI
CIE
NC
Y (L
OG
SC
ALE
)
“Puma”
“Tigris”
“Danube”
“Llano”
“Trinity”
“Richland”
“Kaveri” 10X
2013201220112010
Dynamic power tracking and management with DVFS
Thermal aware power
management
Platform aware dynamic thermal management
Fine grained power gating
Dynamic CPU GPU power sharing
2014
Dynamic thermal tracking for short
term boost
Finer grained voltage planes
Voltage-adaptive frequency
scaling
ACPI driven workload specific optimization
Intelligent boost andPerformance aware energy
optimization
Finer grained power tracking, increased voltage granularity
OpenCL GPU compute moving to full HSA enabled programming
2015
Per part adaptive voltage
Integrated voltage
regulation
Video encode acceleration
Video decode acceleration
Inter-frame power gating
Audio acceleration
Power efficient APU architecture integrating GPU+CPU and accelerators
In market In product In development
Core 2
$
Core 1
Core 0
CHIP-LEVEL POWER DISTRIBUTIONS
TDP
100%
Power consumption
(and hence performance)
is set by the cooling
capabilities of theplatform
Core 390% VCE
80%
L270%
60%
Power varies a lot byworkload
GPU50%
power40%
We measure and
manage the power of
each component on the
chip to generate the best
performance/watt
30%
UVD MC20%
10% Memory PADS
power
0%
App 1 App 2 App 3
PA
DS
Dis
pla
yP
HY
s&
DA
CF
CH
GPUpower
GPUpower
GPUpower
CPUpowerCPU
powerCPU
Display power
Display power
Display
power
FCHpower
FCHpower
FCHpower
Other chip power
Other chip power
Other chip
AMD Turbo Core Technology
To manage temperature and
send the power wherever
it’s needed, we use power
monitors in all chip
components
“Kyoto” has power monitorsin each CPU, the GPU, thedisplay interface, and theFCH
P1AMD Turbo CORE
The central controller uses
this information to optimize
performance within thermal
constraints
CPU CPU CPU CPU
Monitor Events
Weights Weights Weights Weights
Weights Events
CAC CAC CAC CAC
Add-in Leakage Monitor Monitor Monitor Monitor
Calculate Power APU
(P = PLeakage + CacV2f) P-states
Pboost
Add in calculated GPU power
Manager Change P2
FCH CAC Monitor P-state based
(Microcontroller) on TDP credits
Pmin
GPU Display
EVOLUTION OF AMD TURBO CORE TECHNOLOGY
AMD A-Series APU
CPUGPU
Year ProcessorBoosting decision
based on Notes
2010 AMD Phenom™ II Number of cores active Single boost Pstate used if half or more cores are inactive
Coarse-grain power margin exploited
20111st-Generation
Calculated power
Unidirectional power transfer between thermal entities
GPUCPU
Exploit fine-grain power margin
20122nd-Generation
AMD A-Series APU
Calculated power
Calculated temperature
Bidirectional power transfer between thermal entities
GPUCPU
Exploit temperature margin
2013
3rd-Generation AMD A-Series APU
("Richland“)
Calculated power
Calculated temperature
Measured/Sensor temperature
Efficiency of power usage by individual
entities (CPU, GPU, etc.)
Designed to more effectively exploit temperature marginby detecting favorable thermal conditions in real time
Intelligent Boost
Core 0
Core 3
Core 2
Core 1
CHIP-LEVEL POWER DISTRIBUTIONS: GPU-CENTRIC
Lower-power cores serve as a heat sink for the active GPUTDP
100%
90%
80%
70%
60%
GPU50%
40%
30%
20%
10%
0%
App 1
GPUpower
CPUpower
Display power
FCHpower
Other chip
power
Core 0
Core 3
Core 2
Core 1
CHIP-LEVEL POWER DISTRIBUTIONS: CPU-CENTRIC
Lower-power GPU serves as a heat sink for the active CPUsTDP
100%
90%
80%
70%
60%
GPU50%
40%
30%
20%
10%
0%
App 2
GPUpower
CPUpower
Display power
FCHpower
Other chip
power
Avoiding power waste with Intelligent Boost control• Intelligent Boost is designed to
avoid power waste that results from boosting applications that benefit very little from higher frequency
• Enables long battery life and cool operation while maintaining great performance
• Power management micro-controller tracks application behavior real-time to determine frequency sensitivity
• Boost behavior is adjusted accordingly
Performance metrics tracked by power manager
Freq
uen
cy S
ensi
tivi
ty
(i.e
. % p
erf
gain
fo
r %
fre
qga
in)
BOOST THESE
DON’T BOOST THESE
Different applications
FABRICS REDUCE THE POWER CONSUMPTION OF EVERY SERVER
~70% of energy consumption in servers from components beyond CPU
Eliminate unnecessary components and functions
Remove tiers of networking equipment and thousands of cables
SeaMicroIOVT TIO
Freedom SupercomputeFabric
SHARE• Sharing instead of
replicating components eliminates pieces from motherboard
• Reduces power and space
POWER OPTIMIZE• Turning off unneeded
logic blocks saves power
LINK EFFICENT UNITS• A low cost, on board
interconnect allows removal of top of rack switching and increases bandwidth
Slide Sources
- ““Richland” Client APU” Presentation by Praveen Dongara, Lloyd Bircher, John Darrilek - Hot Chips 25, August 2013
- “AMD Product and Tech Roadmaps 5.5.14”
- “Energy Efficiency” by Sam Naffziger, June 16, 2014
- “AMD “Kabini” APU SOC” by Dan Bouvier, Ben Bates, Walter Fry, Sreekanth Godey – Hot Chips 25, August 2013
- “Energy Efficiency Messaging” May 9, 2014
- “AMD Advanced Power Management” by Sam Naffziger, April 2014
24
DISCLAIMER & ATTRIBUTION
The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors.
The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to product and roadmap
changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software
changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD
reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such
revisions or changes.
AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANYINACCURACIES, ERRORS OR OMISSIONS THAT MAYAPPEAR IN THIS INFORMATION.
AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT
WILLAMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF
ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLYADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
ATTRIBUTION
© 2014Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo and combinations thereof are trademarks of Advanced Micro Devices, Inc.
in the United States and/or other jurisdictions. Names are for informational purposes only and may be trademarks of their respective owners.
25