1 reading assignment presentations for en0291 s40 “effect of increasing chip density on the...

Reading assignment presentations for EN0291 S40

“Effect of increasing chip density on the evolution of computer architectures,” IBM J. Res & Dev, Vol. 46. Brendan

“Repeater scaling and its impact on CAD,” IEEE Trans. on CAD, Vol. 23(4) Elif

“SOI technology for the GHz era,” IBM J. Res. & Dev., Vol. 46. Cesare

“Turning Silicon on Its Edge,” IEEE Circuits & Devices Magazine, Jan/Feb’04. Yiwen

Effect of Increasing Chip Density on the Evolution of Computer Architectures

R. Nair

IBM Journal of Research and Development

Volume 46 Number 2/3 March/May 2002

International Roadmap for Semiconductors (1999)

A Billion Transistors on a Chip

• “What functions will be expected of billion-transistor chips, and how will they be organized?”

• Move Memory closer to the processors (physically speaking)

• System on a Chip – integration on the same chip of varied structures such as processors, DRAM, sensors, and transducers

Processor Evolution

• New generations depending on prediction algorithms

• Performance benefit decreasing

• Sometimes simpler is better!

The Current Techniques (Benchmarks)

• Increasing pipeline depth and frequency

• Fewer applications responding well

Cellular Architectures

• Little communication overhead between threads

• Connectionist architecture – large number of processors with little memory

• Advantages– Off-the-shelf commodity parts– Use existing compilers– Possibilities of redundancy

System-on-a-Chip

• Integrate functions that are outside processor

• Reduce communication costs between elements

• Current state – performance decrease when combining technologies on one die

• Help with clock skew

Conclusions

• Convergence of processors

• Less focus on more computation power

• Scalable, distributed computing

The Scaling Challenge:Can Correct-by-Construction Design Help?

P. Saxena N. Menezes P. Cocchini D. A. Kirkpatrick

Intel Labs. (CAD Research)

EN0291

Elif Alpaslan

Introduction:FROM LAST LECTURE:

• Cmos scaling in VLSI chips bring new design concerns: • increasing dominance of interconnects • leakage

• In this paper : Results of scaling studies in the context of typical block level wiring distributions, and study the impact of the identified

trends on post-RTL design process.

• Goal of the paper: To show how does exponentially increasing repeater and clocked repeater count will effect logic synthesis, technology mapping, layout and new research problems relevant to future designs.

Some Scaling Experiments (SPICE)

Critical repeater length

Min. distance at which inserting repeater speeds up line

Critical sequential length

Max. distance that signal can travel in an optimally sized and buffered interconnect in 1 cycle

CRL for M3 & M6 shrink at the rate of 0.57x per generation ~ faster than normal scaling of 0.7x

CSL shrink at a rate of 0.43x per generation ~ faster than normal scaling and the rate of decrease in CRL

Additional repeaters need to be added during a shrink of an optimal repeated interconnect from one process generation to next.

Ideally shrink interconnects won’t only require additional repeaters but many of them need to be clocked

Impact of decrease in critical repeater and sequential lengths

How CRL & CSL are migrated across Block Level Wiring Histogram ?

• # of nets requiring repeaters ~ area under histogram curve to right of line representing critical length

•left migration of critical length exponentially increasing # of nets

Impact of decrease in critical repeater and sequential lengths (cont.)

Block Level Wiring Histogram (Zoomed View)

Increasingly steep slope of curve (log scale on y-axis) => # impacted nets Increasingly steep slope of curve (log scale on y-axis) => # impacted nets exploding!exploding!

Impact of decrease in critical repeater and sequential lengths (cont.)

• Percentage of block-level nets impacted by repeaters

• Percentage of block-level nets impacted by clocked repeaters

Impact on POST-RTL CAD

• Logic Synthesis and Technology Mapping:

– Metrics that drive capacitive load of wires are traditional literal or gate-count and fanout –based wire load metrics and they don’t take into account interconnect repeaters

– Gate count metric can lead to wrong heuristic choices during early stage of synthesis due to more delay migration to repeated interconnects.

– Amount of logic available in a single pipeline stage shrinks

Fanout-based metrics can be misleading due to isolation of some sinks of an interconnect from its driver by a repeater

Maximum possible benefit of a good logic synthesis solution reduces

Impact on POST-RTL CAD

• Placement and Routing:– Biggest impact of repeaters on placement stage– # of repeaters required by an interconnect is strongly dependant on placement of cells

• Current Placement Algorithms:– Handle repeater insertion by reserving a certain fraction of block area for repeaters prior placement

and then inserts them into long nets using ECO’s after placement• ECO technique breaks down when more than 5-10%nets in netlist changes

– Block level placement algorithms at any level have to deal with the complications that arising from repeater requirements for nets at any other levels of hierarchy.

– When CSL shrink below to the dimensions of synthesizable block placement algorithms has to handle clocked repeater insertion which is not as straight forward as buffering.

• Routing:– Routers can’t operate in a purely geometric world, it must understand buffering– Complications due to large number of via blockage – Complications due to the multi pin nets

• Presented by Cesare Ferri

SOI technology for the GHz era(by G. G. Shahidi – 2002 IBM)

• Silicon-on-Insulator (SOI) : Technology Introduction

• Brief History

• SOI vs. Bulk (power, performance, scaling)

• Applications

• Future Trends

smaller Capacitance of the switch

SOI - Introduction

SOI : Process Technology Basic Idea : placing a

thin layer of insulator upon the substrate

p-substrate

n+SiO2

Si-polyS D

CSBCDB

A lot of capacitance here (i.e. slow)

p-substrate

SiO2Si-poly D

Faster Transistor!

Less area junction capacitance

No capacitance here (i.e. fast)

CSBCDB

SOI : video (© IBM 2002)

SOI : Brief History

• First developed by IBM in early 70s• Not suitable until `90 (expensive process,

progress in bulk CMOS by scaling)• FD(fully depleted)–SOI vs. PD(partially

depleted)–SOI• IBM Fabrication technique : SIMOX

(Separation by Implantation of Oxygen)

Implant Oxigen Annealing

SOI vs. Bulk• Pros:

– Less Capacitance (~9-25%) , NO Body effect (floating body, Vbs>0)

– Same frequency but Lower VDD Lower power – Reduced Short Channel Effects (higher doping

concentrations)– No latch-up -> Layout simplicity (no wells, plugs, …) – Same scaling rules of Bulk

• Cons:– History-dependent timing (floating body)– Floating Body (Vsb) Reduced effective VT

(=F(Vsb)) higher off current, Ioff (OSS: on the other hand, we are decreasing VDD Ioff is the same than in Bulk..)

– Self heating (the channel is isolated from the bulk)

SOI : Applications & Future Trends

• High Performance processors (servers, Cell Processors, XBOX360..)

• Low-Power Devices (MPSoC)

• Wireless Technology (high-resistivity substrate less crosstalk)

• spacecraft, satellites and military electronics (less sensitive to alpha radiation)

TURNING SILICON ON ITS EDGE

--Overcoming Silicon scaling barriers with

double-gate and FinFET technology

by Edward J.Nowak, Ingo Aller, Thomas Ludwig, Keunwoo Kim,

Rajiv V.Joshi, Ching-Te Chuang, Kerry Bernstein, and Ruchir Puri

• Presenter: Yiwen Shi

Overcoming Obstacles by Doubling Up

Two dominant barriers

for further CMOS

scaling:• Subthreshold• Gate-dielectric

leakages

Double-gate (DG) FET

Reduce drain-induced-barrier lowering (DIBL) Improve subthreshold swing (S)

Lower threshold voltage for a given off-current Higher drive current at lower power-supply voltage

Centering of Double-gate Threshold Voltage• Body doping

• Asymmetric gate work function

• Symmetric mid-gap work- function gate-electrodes

Halo doping Two gate electrodes of differing work functions e.g. degenerately doped n+ & p+ polysilicon

Metal gates

e.g. nickel-silicide

Double-gate Taxonomy

Planar DG Vertical DG FinFET

Four significant obstacles

FinFET-DGCMOS Process Flow & Circuit Demonstration

• Demonstration of DGCMOS static operation: to prove the device

parametrics can all be centered to the practical values demanded for VLSI (W,L,T)

transient operation: to prove the numerous parasitic elements that can degrade circuit performance can be tamed (inverter delay)

• Achieve numerous landmarks

• May indeed prove manufacturable

A ring of 60 inverters with a single two-way NAND

Microprocessor Design with FinFETs

Device width quantization

Sidewall image transfer (SIT)(a converted six-transistor-SRAM cell)

Potential for double-gate applicationsa. Low-power designb. Variable threshold CMOSc. Simplified logic gates

Still a lot of challenges…Overall, promising!

1 reading assignment presentations for en0291 s40 “effect of increasing chip density on the...

clocked slide

yiwen slide

clock skew slide

distributed computing

brendan repeater scaling

normal scaling

cmos scaling

scaling challenge

Documents

s40 weapp guidelines

s40 local - august 2011

s40 local - april 2012

s40 local - december 2011

2010 volvo s40 ms

modeling service oriented architectures in a command and...

bitfusion nimbix dev summit heterogeneous architectures

caterpillar cat s40 datasheet

my07 volvo s40 - personalizeyourvolvo.com volvo s40... ·...

primergy s40-bs/-es storage subsystem -...

s40 von thonet

volvo s40 owner's...

skandix catalog: volvo s40 (2004-) v50 -...

volvo s40 v40 2003

s40/45 deutz/perkins 2wd & 4wd - genie...

s40 local - august 2012

function-blocks s40 e

2007 s40 us

s40 ts handset guidelines

2019 iacc - s40: grant writing 10/23/2019 · 2019. 11....