efficient power management for memory in soft real time systems midterm presentation
Post on 13-Jan-2016
214 Views
Preview:
TRANSCRIPT
Efficient Power Management for Memory in Soft Real Time Systems
Midterm presentation
DRAM
Every Chip and rank in the DRAM has several powermodes:
Transition in and out of lower power modes needs time <-> read writes are only possible in active mode!
Power saving methods
• Power aware scheduler• Power augmented History Scheduler• Throttling• Power dynamic transition• Power static transition
Power saving methods
• Power down unused chips (PD)→ OS/Memory scheduler should use just minimal
needed number of chips• Group read write accesses timely
– Scheduler has to look at a stack of instructions and reorder them in a reasonable manner
• Throttle speed/queue
Power saving methods
DRAM
Processors/Caches
MemoryQueue
Scheduler
ReadWrite
Queues
Reads/WritesMEMORYCONTROLLER
1. Read/Write instructions are queued in a stack
2. Scheduler (AHB) decides which instruction is preferred
3. Subsequently instructions are transferred into FIFO Memory Queue
Chosen ApproachA. Many Power Down mechanism were evolved for DRAM, but
usually they just handle two Power Modes– Use all 4 (active, standby, nap, power down)
B. Algorithm‘s primary concern is to meet timing constraintsC. If no negative influence is expected power down a chip/rank
(according to simple algorithm):
D. Maybe allow fast „backdoor“ for highest priority tasks– Chips/Ranks where critical tasks are assigned to never power down and
are served with FIFO manner
E. Throttle commands in Queue to allow better task execution time estimation, reordering mechanism and longer power down times
A. Powering down unused chipsRead/Write Queue
C:1 - R:1 - B-1 - …
C:1 - …. - …
C:1 - … - …
C:2 - … - …
C:1 - … - …
C:2 - … - … -
C:1 - … - … -
C:2 - … - … -
Chip 3
Chip 2
K2 epochs later
K3 epochs laterPM
t
A. Reordering
Read/Write Queue
C:1 - R:1 - B-1 - …
C:1 - …. - …
C:1 - … - …
C:2 - … - …
C:1 - … - …
C:2 - … - … -
C:1 - … - … -
C:2 - … - … -
Chip 1
Chip 2
Manipulate PM
B. Guarantee Soft RT
• Set Power Mode to result in a shorter estimated execution time then deadline
• The Transfer function can be estimated by
+
ttopt
Transfer functionController
-
Command threshold
If timing constraint is violated set a certain threshold for power down
C. Power down used chips with gaps
1 For I from 1 to n2 Get Deadline miss for task i3 If deadline misses > 0 4 PMi = 1 // set the power mode of the concerned rank/chip to highest.5 else6 Get Deadline for task i7 Calculate tact for task i8 Set tact (T) < tdl //set T so that t is just shorter then tdl 9 Calculate best Energy consumption for tasks 10 If optimal Energy PM is higher then optimal Timing PM10a Then use optimal Energy PM10b Else use optimal Timing PM11 Get which Rank/Chip wasn’t used for a certain threshold (counting flags)12 Power them down one step13 Assign PM to DRAM
Pseudo code Control Algorithm:DRAM example with Matlab
1020 1020 1 1 51040 1040 2 2 01060 1060 3 3 01080 1080 4 4 063500 63500 5 5 0156520 156520 6 6 0337140 337140 7 7 0656360 656360 8 8 01181980 1181980 1 8 02001000 2001000 2 8 0
1 0 0 0 0 0 0 40 3 0 0 0 0 0 40 0 3 0 0 0 0 00 0 0 3 0 0 0 00 0 0 0 4 0 0 00 0 0 0 0 4 0 00 0 0 0 0 0 4 00 0 0 0 0 0 0 4
Tasks 1- 10:(T,Dl,adr1,adr2, misses)
D. Power down used chips with gaps
• Tasks power down primary when timing constraints are hold but with reasonable power budget: tfastPM < topt < dl• Energy loop has only little influence• EL is mainly concerned with unused chips
E. Throttling Approach
• “Force memory commands to wait in the memory controller, DRAM structures can remain in low power mode for arbitrarily long periods of time, thereby modulating the DRAM’s average power consumption over some small time interval.”
. . .
10,000 cycles 10,000 cycles
Tcycles
active stall active stall
time
Tcycles
13
E. Key Difficulty: to determine accurate throttling
0
20
40
60
80
100
120
Throttling Degree (Execution Time)
DRAM Power
Inaccurate throttling Power consumption is over the budget Unnecessary performance loss
A B
Application 1App. 2
E. Existing Method
• By Ibrahim Hur and Calvin Lin• Model features that we determine
• Power threshold• Number of Reads• Number of Writes• Bank conflict information
• System model• Compute model coefficients during system
design/installation by experiments with various memory access behavior.
15
Monitor
Set PowerPs = 60W
Write RatesRead Rates
RAM CtrlController
P(n-1)
K*Δp(n)
RAM
Power Calc
Throttling: y(n)
E. Throttling Mechanism with Feedback Control
E. Power Model
Total Power:
Detailed information refer to :http://www.micron.com/products/dram/syscalc.html And DRAMsim Manual at http://www.ece.umd.edu/DRAMsim/download/DRAMsimManual.pdf
E. Simulator-DRAMsim+VisTool
Device Utilization Utilization Statistic
Outlook for future work
•Feedback for change of threshold
•Reordering Mechanism
•“Backdoor”
•Throttling Mechanism which doesn’t violate timing constraints
•Simulations to show superior RT behavior of memory controller
top related