ee 382n microarchitecture yale patt, instructor eiman ebrahimi, khubaib, tas

23
EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs The University of Texas at Austin Spring semester, 2010

Upload: inoke

Post on 21-Feb-2016

48 views

Category:

Documents


0 download

DESCRIPTION

EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs The University of Texas at Austin Spring semester, 2010. Compile-time outline of what we will do. Introduction and Focus on the Fundamentals Tradeoffs Mechanisms: run-time and compile-time - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

EE 382N Microarchitecture

Yale Patt, instructorEiman Ebrahimi, Khubaib, TAs

The University of Texas at AustinSpring semester, 2010

Page 2: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Compile-time outline of what we will do

• Introduction and Focus on the Fundamentals

• Tradeoffs

• Mechanisms: run-time and compile-time

• Approaches to concurrency

• Impact of Multi-core

Page 3: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Lecture 1: Introduction and Focus

Page 4: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Outline

• A science of tradeoffs• The transformation hierarchy• The algorithm, the compiler, the microarchitecture• The microarchitecture view• The physical view• Speculation• Design points• Design Principles• Role of the Architect• Numbers • Embedded processors – because…• Thinking outside the box

Page 5: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Trade-offs, the overriding consideration:What is the cost?

What is the benefit?

• Global view– Global vs. Local transformations

• Microarchitecture view– The three ingredients to performance

• Physical view– Wire delay (recently relevant)– Power, energy (recently relevant)– Soft errors (recently relevant)– Partitioning (since the beginning of time)

Page 6: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Trade-offs, the overriding consideration:What is the cost?

What is the benefit?

• Global view– Global vs. Local transformations

• Microarchitecture view– The three ingredients to performance

• Physical view– Wire delay (recently relevant)– Power, energy (recently relevant)– Soft errors (recently relevant)– Partitioning (since the beginning of time)

Page 7: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Algorithm

Program

ISA (Instruction Set Arch)

Microarchitecture

Circuits

Problem

Electrons

Page 8: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

The Triangle(originally from George Michael)

• Only the programmer knows the ALGORITHM– Pragmas– Pointer chasing– Partition code, data

• Only the COMPILER knows the future (sort of ??)– Predication– Prefetch/Poststore– Block-structured ISA

• Only the HARDWARE knows the past– Branch directions – Cache misses– Functional unit latency

Page 9: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Trade-offs, the overriding consideration:What is the cost?

What is the benefit?

• Global view– Global vs. Local transformations

• Microarchitecture view– The three ingredients to performance

• Physical view– Wire delay (recently relevant)– Power, energy (recently relevant)– Soft errors (recently relevant)– Partitioning (since the beginning of time)

Page 10: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs
Page 11: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Trade-offs, the overriding consideration:What is the cost?

What is the benefit?

• Global view– Global vs. Local transformations

• Microarchitecture view– The three ingredients to performance

• Physical view– Wire delay (recently relevant)– Power, energy (recently relevant)– Soft errors (recently relevant)– Partitioning (since the beginning of time)

Page 12: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Outline

• A science of tradeoffs• The transformation hierarchy• The algorithm, the compiler, the microarchitecture• The microarchitecture view• The physical view• Speculation• Design points• Design Principles• Role of the Architect• Numbers • Embedded processors – because…• Thinking outside the box

Page 13: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Speculation

• Why good? – improves performance

• How? – we guess– Branch prediction– Way prediction– Data prefetching– Value prediction– Address prediction– Recent visibility: Memory disambiguation

• Why bad? – consumes energyl

Page 14: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs
Page 15: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Design Principles

• Critical path design

• Bread and Butter design

• Balanced design

Page 16: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs
Page 17: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Numbers(because comparch is obsessed with numbers)

• The Baseline – Make sure it is the best– Superlinear speedup– Recent example, one core vs. 4 cores with ability to fork

• The Simulator you use – Is it bug-free?• Understanding vs “See, it works!”

– 16/64• You get to choose your experiments

– SMT, throughput: run the idle process– Combining cores: what should each core look like

• You get to choose the data you report– Wrong path detection: WHEN was the wrong path detected

• Never gloss over anomalous data

Page 18: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Embedded processors vs General Purpose(because it represents 99%+ of the market)

• Mostly everything about general purpose applies

• Advantage: easier to design wholistically– advantage of special purpose or limited purpose

• Greater use of ASICs– Although that could change

• Why VLIW is good (though bad for general purpose)

• Partitioning is critical

• Memory latency

• Reconfigurability

Page 19: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Finally, people are always telling you:Think outside the box

Page 20: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

I prefer: Expand the box

Page 21: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs
Page 22: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

Something we are all familiar with:Look-ahead Carry Generators

• They speed up ADDITION

• But why do they work?

Page 23: EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

12 9

21

182378645259827637

Addition