2019 ucla chips report 22019 uc claa ch hiippss ps rreep€¦ · 2019 ucla chips report subramanian...

31
1 2019 UCLA CHIPS Report Subramanian Iyer [email protected] 2 2019 UC CL 2019 UC CL S S Su b b A A CH HI IP PS S am m ma HI PS R Re ep

Upload: others

Post on 05-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

1

2019 UCLA CHIPS Report

Subramanian Iyer

[email protected]

22019 UCCL2019 UCCL

SSSubb

AA CHHIIPPSS

ammma

HIPS RReep

Page 2: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

UCLA CHIPS

2

A UCLA Led partnership to develop Applications, Enablement and Core technologies and the eco-system required for continuing Moore’s Law at the Package and System Integration levels and develop our students & scholars to lead this effort

Simplify hardware development through novel architectures, integration methods, technologies, and devices.

hardware develontegration methegration metth

fy hy hardware develd d l

this effort

ifint

nd dentinuing velop oup

ment andMoore’s L

studennd dentinuing velop oup

ent andMoore’s

studen &

Page 3: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

What we do @UCLA CHIPS

3

Large Scale Energy Efficient Systems

Medical Engineering applications

Advanced Packaging Technologies Novel Compute architectures

Silicon as a heterogeneous fine pitch packaging Platform, Si IF

FlexTrate as a flexible Biocompatible Heterogeneous

Integration Platform

The CTT as an in-memory compute

device

ed Packaging Techno

ae pitch Bpppitch B

a

olackaging TechnP k Tnced Packaging Technolo

s ne p

Si IF

gies

Mediacal Engine

plication

gies

Mediacal Engine

plication

NNoovvelveel CCoommpp

Page 4: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Some observations• If Moore’s law has enabled

miniaturization, why have chips gotten larger ?– More complex problems– More cores – More cache memory

• Main memory capacity and access limits performance

• I/Os take up more space and power as system size increases

• Power density and thermal challenges limit performance

Eg. IBM Z14 cpu and cache are each ~700 mm2

with 6.8B Xtors (2018) and a separate cache die

Intel Pentium cpu ~300mm2 -3.1 Million Xtors (1993)

memoryory capacity an

ormancemore space an

ystem size increty and ther

fy and ther

stem size incup ystem size inp more space anp more space an

oormamancery capacity an

remomhe memory capacity andrfousysity

it per

access access

dEggw

gwit

. Ith

BMM Z114

Page 5: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

The “Early” Origins of Wafer Scale Integration

Gene Amdahl with Michael Current (2008)

Bumped 100 mm wafer (Ca 1982)

Trilogy

ichael Current (200hael Current (2008)Mic

Page 6: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

The public talk at the time (and echoed in the (generally ill-informed) modern blogosphere) was that the motivation was processing speed. Going on about reduced interconnect line length, etc.

But the reality was that computer system speed was dominated by architecture and memory access (not much change in that in

The real advantages were in "board" physical size and the mechanical ruggedness of the assembled wafer package.

One could put the logic of an entire PDP-11 computer on a single wafer and DEC (one of the supporting consortium) was pleading with Amdahl to build some modules for sale to the US Air Force for airborne radar processing (bombers have lots of electrical power on board). But Gene kept after the goal of building an IBM-scale mainframe (a la his earlier Amdahl Corp that Fujitsu took over). And we lacked the scale to compete with IBM out of the box. So it goes.

(- Michael Current via email)

cal rugged

of an entire PDP-11 cng with Amdahl to buers have lots of e

nframe (a la hfrs have lots of el

ng with Amdahl to ic ofi h t

entire PDentire PDPc -en PDP cn P

nical ruggee real adv

anical ruggedn

ogadmbe

ainfraf th

ages ess of t

wethe

ot ere ine ass

mn "bse

eed wmuch ch

boar

was domange inphages

eswethe

ot ere ine ass

mn "b

e

eed wuch chboar

was domange in

h

m

mmbleddpphhyysicccaal

n

dd wwaffe

Page 7: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS
Page 8: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Fast Forward to todayCerebras has taken WSI to a new scale

And addressed a significant number of problems

This is absolutely the right first Step

But at its core the system is still homogeneous and probably memory starved

TTTT

adroblem

i

drms

sedd a signific

to a ne

ant nuroblem

i

drms

edd a signific

to a ne

nt nu

s abssolut

ummbe

luuteelyy thh

Page 9: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Wafer Scale Integration @UCLA CHIPS

Lets revisit this concept but with a different approach

Page 10: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Chiplets or Dielets ?A die, in the context of integratedcircuits, is a small block of semiconducting material on which a given functional circuit is fabricated. We usually think of dies as bare. Dies are diced out of a wafer

A chip is more or less the same thing but connotes the design aspects rather than the physical attributes. We usually think of chips as packaged.

We will use these terms interchangeably though dielet is more correct

usually thinare diced out of

We will use though dieh

tWe will use t

re diced out ousually th

al circuitWe usually thin

s are diced out of

Wthoug

of da

thethan

t

e dn t

e thing besign ahe

re or lebut conn

pects

of da

thethan

e dn t

e thing besign ahe

re or lebut conn

pects

oofftrribbuttees

pphhyysiccaalrraath

ooff chhippsess. WWee u

all

Page 11: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Three Important Questions• What is the optimal pitch at which dies should be

interconnected ?

• What is the optimal dielet size

• How close should we assemble dies

Hint: how do we make a SOW look like an ginonormous SOC

e optimal diel

should we asse

we ma

e s ee should we assed

p optimal didthe optimal diele

os

do we

t size

hould bbe

t size

hould bbe

mmbb di

Page 12: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

The Dielet Golden Regime

Mechanicalconstraints

Electrical/logical constraints

Die handling constraint

Die yielding constraint

SerDes-like

CMOS wire-like

Packaging-likeSoC-like

Optimal pitch2 to 10 μm

Optimal dielet size

1 to 100 mm2

500 μmBGA/LGA

50 nmGate pitch

Interconnect pitch

g

PackaginC like

2 to 10 μm

Optimal dielet size

1 to 100 mm2

IP reuseI/O complexity/powerDielet/chiplet size (# of circuits)Testing complexity

Iyer, Jangam & Vaisband, IBM J R&D 2019

ekekC-l

100 mm2

E

100 mm2

E

SeerD

Page 13: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

What is the optimal I/O pitch ?

Chip Area (mm2)

Transistor count (x109)

Technology node (nm)

IBM POWER9 [26] 695 814

AMD Zen [27] 44 1.4IBM POWER8 [28] 649 4.2

22Intel Xeon Haswell E5 [29] 663 5.56IBM POWER7 + 80 MB [30] 567 2.1

32Intel Itanium Poulson [31] 544 3.1IBM POWER7 + 32 MB [32] 567 1.9

45Intel Xeon 7400 [33] 503 1.9

Actual pitches ~100s of m

Iyer, Jangam & Vaisband, IBM J R&D 2019

69544 1.4

649 4.25 [29] 663 5.56

30] 567 2.1[31] 544 3.1[32] 567 1.9

503.92 52

4MB [3n 35B

6B

55 9]6

1.1695

4

l E0 Mson

MB

22

Actual pitches ~1100s of m

22

A tual pitches ~100s of m

Page 14: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

• Multiple packaging levels• Disparate materials (Si, Cu, FR4,

Molding compound)• Limited heat sinking• Solder based interconnects• Underfill needed • Die-to-die connections limited by

BGA pitch

• Single package level• Mainly three materials (Si, Cu,

Oxide)• Excellent heat sinking• Metal-metal interconnects• No underfill• Allows for Wafer Scale Integration• Interconnect pitch: 2-10 μm

PCBs Si-IFInterposer vs.

• Additional level in packaging hierarchy

• All PCB limitations still exist especially thermo-mechanicals

• Limited in size: Sub-system integration

• Requires fine TSVs that add cost• Interconnect pitch: 50 μm

Fine pitch 2-

lsSi, Cu, FR4,

nects

ed by

• Additiohierarc

• All PespeciaLimitedinteg

• ReRegi

n st imespe a

chPCB

aAll PCB

Cu, FR4 ehdleve

s (d)gcon

mited by

onhyal level in paccka

ni e pitch 2-

al level in paccka

Fi e pi ch 2

y thhetaermatioonss st

g• SSingl

mestech

tillh

eexiist

• MMaiglee paacccka

Page 15: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

We do this by Thermal Compression Bonding

Formic Acid

Vapor

Si-Interconnect Fabric (IF)

Heated chuck: 100 C

Dielet (Si)Bond head: 100 CTouch Down pressure: <1 MPa

Dielet (Si)Bond head: 380 C

Bond pressure: 250 MPa

Dielet (Si)Bond head: 380 C

TouchchcBBBBoo whhhhDonnnn p 2ss1111pradadadad MP<<CC0rer0000 MPa

Lift Up

Di l (Sil

Dielet (Si)Bond head:380 C

Step 3

~250oC

Step 1Step 2Dies assembled on the Si-IF

Shear strength 160 MPa

FormiAci

V

ct Fabric (IF)

100 C

t (Si)Si)

)

~250oC

000

bFae F0o~

daporr

rmS c

ne

k: 1

let

t (Si)

Boo

Page 16: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

The eco system

• The Silicon IF is a versatile platform for heterogeneous integration, but several issues remain– Testing, Repair and Reliability– Communication on and off the IF– Clock distribution– Power delivery and heat extraction– Dielet supply-chain and ecosystem

pair and Reliication on and o

stributionelivery and heat epply-chain any ay ay nandelivery a

istdelivery and hehd

onstribution

tcation on and pair and Rel

ut seveRepair and Relia

unication on and ofddupp

es relitythe IF

emfor

ainr heterogeneous

es reitythe IF

emfoin

r heterogeneous

Page 17: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

A 50KW High Performance Wafer Scale System on Si IF

Connector collar

Populated Si-IF (Si and III-V dies)

Photonic interconnect

Si

quartz

Cu ground plane

qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqquuuuuuuuuuuuuuuuuuuuaaaaaaaaaaaaaaaaaaaaaaaaaaaaarrrrrrrrr zz

antenna

qqquuuuuuuuuuuuuuuaaaaaaaaaaaaaaaaaaaaaaarrrrrrrrrrrrrrrrrrrtttttttzzzzz

antennnnn RF interconnect

Reliability

PowerTherm AssemblyPower board

PowerTherm two phase cooling chamber (up to 50 KW)

TWVs to deliver power to front

USB

hhhhhhhhhh bblDeep Trench decoupling

pulated Si-IF (Si a ies)

R li

u round plane

qqqqqqqqqqqqqqqqqqqqqq aarrrrrrrrrrrrrrrrttzz

a

rrrrrrrttttzzz

ies)

R l

Cu round plane

qqqqqqqq rrrrrrrrrrrrtttttttttttttttttzztttttttttzzzzz

y

Page 18: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

A Modest Graph Processor implementationDesign Details

Compute dielet area 6 mm2 (mostly memory)

Memory Dielet area 2 mm2

Power per tile 0.5W

tiles per 100mm wafer 625# of ARM cores 8750Memory BW Network BW 9 TB/s 8TB/s

To be taped out in TSMC 40nm Technology in January 2020

To be Assembled at UCLA using the SI IF at 10 m die to wafer pitch Cu pillars

P

tile# of

Meowe

es pe

Dieler per tile

m

et a

rea

e

6 mm (momemory2 mm2

ile# o

Meowe

s pe

Dieer per tile

et a

rea

ea

6 mm2 (momemory)2 mm

MMeemMM cocores

waafeer

00.55WW

bemmory BBWW

62255

Page 19: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

The “Ultimate” System on Si IF

3D stacked combination of memories (DRAM, emerging etc.)

With additional High performance compute in the bottom layer

Memory capacity >12 TB

Intranode BW >1TB/sInternode BW > 7.5 PB/s

PowerTherm cooledScalable to stacked system

W

Mem

Withye

mo

ed c

additiona

omb

l Hig

ination of me

gh p

ories (DRAM

W

Mem

Withyer

ed c

additiona

omb

l Hig

ination of me

h p

ries (DRAM

Int

y ccaappaccityty

cee ccoommpuute

tterannodde BW

y >122 TTB

Page 20: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Gen 2 FHE* Packaging for Medical ElectronicsToday

Source: Uniqarta

Printed electronics & wiring, ultra-thin die, organic thin film semiconductors - at best bendable

FlexTratetm @ UCLA

* Flexible Hybrid Electronics

• Fan-Out Wafer Level Packaging with ultralow die shift• Dielet concept on flexible molding compound (PDMS) – bicycle chain principle• Corrugated lithographically defined high performance interconnects• Thermally conductive PDMS using metallic nanowires • Biocompatible • Transparent

wiring, ultra-

dable

• Fan Ou• Diele• C••

Cong, ultu - ier

FF O& w

lmendable

UCLACL

Page 21: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Applications

Wireless Power Transfer

Segmented flexible displays

The picture can't be displayed.

Wireless Surface electromyography

Flexible LED displays fabricated using Laser Lift Off mass Transfer

Applicatnted

ionsble ddisplays

Applicatted

ionsble disp ays

Page 22: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Background of CTT

22

• Novel multi-time-programmable non-volatile memory element for HKMG CMOS technologies

• CTTs are as-fabricated CMOS logic devices operated under enhanced charge trapping mode

• Intrinsic device self-heating enhanced charge trapping in HKMG

• Use as an analog memory element for unsupervised learning and in memory compute

CTTs have been demonstrated as digital MTPM in multiple nodes - And is being productized

der enhancedode

self-heatinggrapping in

memory elemeemory elem

se tr p n npe np is ee

dder enhance

ed CMOS log under enhanced

g m

erg

merning a

Page 23: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Use of CTT as Analog Memory Device

VD–+

t

VG

BOXOXBOX

tVG

BOX

Source

Source

Drain

Drain

IDVG

IDVGGOFF

IFL + HfO2

IFL + HfO2

Gate

Gate

GOFF

Reducing threshold voltage:

Increasing threshold voltage:

0 0.2 0.4 0.6 0.8 1

Pre-PRG

VG (V)

pA

nA

μA

I D@

VD

= 50

mV

The CTT can be programmed and erased

0 0.2 0.4 0.6 0.8 1

Post-PRG250mV

1000

x ch

ange

0 0.2 0.4 0.6 0.8 1

Post-ERS

Pre-PRGPost-PRG250mV

1000

x ch

ange

23

BOX

t

Drain

IDVGGOFFFF

+ HfOHfO2

HfOO22

Gate

threshold voltage: BOXBOX

HGaGG

FFt VV

threshold voltageBB gXX olO

Hf VDDDD–

FL +FL

FL + Hate

g tnA

μ

@ V

@@DDD

0 m

V A GGPRGGPPPRG

25025000225022 0

st-E

GG

mm

RS

PP

med and

nA

μ

@ V

@0

mV A RGRGPRRRGRGPRPRPRGR

2502500252522250222255555500

t E

G

mVmVVVV

RS

med and

I DDDD

xchchchch

RRRRRGGRGRGRGGRGGG

000

000

000

00000000 000000000

Page 24: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

NeuroCTT V1 (in GF 22FDX)

Test macros:Devices, arrays, WL driver, neuron, etc.

OUTPUTBUFFERS

NEURON

INPUTBUFFERS

DIGITAL BLOCK

WL DRIVER& ARRAY

DECAPS

WL Driver Macro

ESD Macro

Device macroArray macro(428nm)

Neuron Macro

Array macro(170nm)

• An inference engine • with 1024 x 20 CTTs

• Fabricated in GF 22FDX• Die size: 2.5x2 mm2

• Packaging: Wire-bond• Lots of test macros

, WL driver, neuron, e

WL DRIV& ARRAY

Neuron Mac

Array(

ds

ra

, WL driver, neiu ouronN

A AY& AWnd D

:ys,

BL

VEm2

-boacro

R

))

Page 25: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Array Inference Engine Macro Results (V1.0)CTT array Vector-Matrix Multiplication VMM

Room Temperature stability(< 6% over 10 years) Schematic of inference array VMM

(VRoom Tem(< 6% over VMM

(Room Tem(< 6% over

Page 26: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Benchmarking NeuroCTTsCTT is very competitive for neural network apps.

– CMOS-compatible and ready at advanced node– Third terminal (access switch) is free– Acceptable variation (no dead cells like in RRAM [6])– Energy and area efficient both for the device and the periphery – Accurate programming (including retention)

26

Device Benchmark Using NeuroSim [1] (with Prof. Shimeng Yu @GaTech)PCM [2] STT-MRAM [3] RRAM Twin-CTT cell CTT vs. PCM

Tech.Node 32nm 32nm 32nm 22nm N/ACell bit 4bit 1bit 4bit 4bit 1x

RON 40K 14.8K 6K 41.67K ~1xChip area (mm2) 16.26 76.13 28.11 8.11 0.499x

Latency (ms) 2.75 8.42 16.77 1.63 0.593xEnergy efficiency (TOPS/W) 9.92 1.347 2.79 17.96 1.81x

Throughput (FPS) 363.02 118.65 59.61 612.87 1.69x

[1] Chen, Pai-Yu, Xiaochen Peng, and Shimeng Yu. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 37.12 (2018): 3067-3080. [2]. G. W. Burr et al., 2014 IEDM, pp. 29.5.1-29.5.4. [3]. Y. Kim et al., "Integration of 28nm MJT for 8 16Gb level MRAM with full investigation of thermal stability," 2011 Symposium on VLSI Technology - Digest of Technical Papers, Honolulu, HI, 2011, pp. 210-211. [4] G. Burr, et al., IEEE Journal on Emerging and Selected Topics in Circuits and Systems 2016 [5] X. Guo, et al., IEDM 2017[6] X. Zheng, et al., IEDM 2018

CTT/PCM[4]

RRAM[6]

Flash [5]

Accuracy of ResNet Using Analog Devices

t both fng (including rete

sing NeuroSim [1] (with [2] STT RAM [3] RRAM

2nm 32nm 32nm4bit 1bit 4bi

0K 14.8K6 76.13

8 4136

6b 44b

PCM

3 nm3

ProMMRAC

s 1]euin roN [i

gg (including reficient both for t

amming (including reten

k U f.

406.26

ce and tho

])e peeriphery

CTT/PC

y of esNet

ce and thon)

)e peeriphery

CTT/PC

y of ResNet

Twiin-CTTT

ngg Yu @@GaT

Fla

]

CTTT

Teecch))

sh

Page 27: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Unsupervised Learning Using CTT• Proof-of-concept network simulated using CTT characteristics to

cluster stylized letters z, v, n, and their noisy versions

• Perfect clustering achieved after on average 24 presentations

27

X. Gu and S. S. Iyer, IEEE EDL, 2017

hieved after on averahieved after on avg achieved after on averag 2424

Page 28: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Experimental Platform for Demonstration

Standalone CTT array (10WL X 8BL)Probe card to access half of the array at onceLargest hardware demonstration of WTA to our knowledge

tandalProbLargest ha

oneard t

ardw

CTT arto acces

rrays h

(10Walf

BLtandalProbLargest ha

onerd tardw

CTT ato acces

rrays h

(10Walf

BL

e ddeemoonnsttraf tthee aarra

)

strratatioonn oof Wayy at

Page 29: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Four years and going strong……..• We have developed of three core technologies (Si IF FlexTratetm and CTT) to

acceptable usability levels• They promise to significantly impact the packaging, medical electronics and in-

memory analog computing areas• Our focus going forward is to

– move these technologies into manufacturing entities with our consortium partners and helping them leverage these technologies

– Focus on further technology innovations such as PowerTherm, RF and Photonic interconnects, adaptive patterning, high resolution displays …..

– Leverage our technologies in novel architectural applications

• Turn out the best and most prepared students to continue in Industry and academia

rward is thnologies into m

leverage these techurther technology inncts, adaptive pattern

technologies in nove

est and most pst and most prd

ctur t gur technologies in nove

novningcts, adaptive patterningd

rther technology innoieverage these tecc nologies into

g forward is toe technologies into ma

hem leverage these technfurther technology innova

nects, adaptive patterning,o

bes

ufaologies

ons su

ent

ing

w

g, m

th

medical ele

and CT

ctronics a

ufaologies

ons su

ent

ing

w

g, m

h

medical ele

and CT

ctronics a

arigrchgh h

rresesoh

oluasts PPoow

ouur r ccoonsso

uruttion d

weiserrThheermm

soortiuumm

Page 30: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Acknowledgements

https://chips.ucla.edu/students

https://www.chips.ucla.edu/faculty

https://www.chips.ucla.edu/alumnia.edu/alumni/ ula.edu/edu/alumni/

/

s.uc

Page 31: 2019 UCLA CHIPS Report 22019 UC CLAA CH HIIPPSS PS RReep€¦ · 2019 UCLA CHIPS Report Subramanian Iyer s.s.iyer@ucla.edu 22019 UC CL SSub AA CH HIIPPSS ammma PS RReep. UCLA CHIPS

Our Government and Industrial Sponsors