max - darpa | eri summit

Post on 18-Mar-2022

12 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

DISTRIBUTION STATEMENT A. Approved for public release

MAX

ASSISTANT PROFESSOR

SHULAKER

MIT

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

1001010101010101010101010110010100101010101010011010101010101001110100110001010101010110010100011100101001010101011000101110101010101010100110100101010101010101011010100110010101100101010101011010010110101010101010011111001111101110100100101110101011010101101010101010101010101110010010010101010101001010100011101010001001101010100110010101100001110101010101010101010001101001010101010100

Abundant-dataInternet of Everything

Smart Cities

Military Science

Finance

Security

Government

Genomics

Health Care

WORLD RELIES ON ELECTRONICS

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

1001010101010101010101010110010100101010101010011010101010101001110100110001010101010110010100011100101001010101011000101110101010101010100110100101010101010101011010100110010101100101010101011010010110101010101010011111001111101110100100101110101011010101101010101010101010101110010010010101010101001010100011101010001001101010100110010101100001110101010101010101010001101001010101010100

Abundant-dataInternet of Everything

Smart Cities

Military Science

Finance

Security

Government

Genomics

Health Care

WORLD RELIES ON ELECTRONICS

STOP

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

MANY CHALLENGES SIMULTANEOUSLY

Power Wall Memory WallExecution time

Compute

Memory access

Also:Communication wall, interconnect wall,

complexity wall, resilience wall…

10M

Hz

5G

Hz

1980 2010

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

HOW TO IMPROVE COMPUTING?sy

stem

inte

grat

ion

device performance

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

OPTION 1: BETTER DEVICESsy

stem

inte

grat

ion

device performance

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

OPTION 2: DESIGN TRICKSsy

stem

inte

grat

ion

device performance

Multi-cores

Power management

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

HOW TO IMPROVE COMPUTINGsy

stem

inte

grat

ion

device performance

Multi-cores

Power management

New innovations required!

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

COMPUTING TODAY

memory

compute

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

3DSOC: OUR GOAL

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

3DSOC

Compute Memory

Conventional BEOL nano-scale vias

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

3DSOC

Compute Memory

Conventional BEOL nano-scale vias

BUT…

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Low-temperature fabrication: <400 °C• Major obstacle for silicon CMOS

REALIZING MONOLITHIC 3D

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Low-temperature fabrication: <400 °C• Major obstacle for silicon CMOS

REALIZING MONOLITHIC 3D

Carbon Nanotubes Resistive RAM

<200 °C <200 °C

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Combine device + architectural benefits

REALIZING MONOLITHIC 3D

+ +Emerging

LogicEmerging Memory

Monolithic 3D Integration

Naturally enabled

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

MAJOR BENEFITS

>50× energy efficiencyvs. state-of-the-art 7 nm Si-CMOS

for challenging abundant-data applications

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

CARBON NANOTUBE (CNT)

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

CNT FET (CNFET)

d~1nmCNT

high-kgate oxide

100 nm

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

CNFET CMOS

Metal gates

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

CNFET CMOS

High-k dielectric

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

CNFET CMOS

Low temperature

Solution CNT deposition

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

CNFET CMOS

Channel definition

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

CNFET CMOS

Source/ drain metal

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

Work function engineering

CNFET CMOS

Source/ drain metal

NMOS PMOS

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

Work function engineeringWork function engineering + electrostatic doping

CNFET CMOS

Encapsulation

NMOS PMOS

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

-1.8 -0.9 0 0.9 1.8

VDS

(V)

0

1

2

3

4

5

6

7

I D (

A)

PMOS NMOS

Work function engineeringWork function engineering + electrostatic doping

CNFET CMOS

NMOS PMOS

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

MAJOR BENEFITS

• Enables new monolithic 3D architectures• Unique low-temperature fabrication

• >10× energy efficiency • vs. silicon CMOS

• Future opportunities• Sub-3nm node experimental demo

energy efficiency: energy-delay product (EDP)

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

Robust doping; CNFET CMOS

materials processing design manufacturing

VIN

V OU

T

Si-compatible;>99.99% purity

Commercial facilities

Immune to metallic CNTs

CNFETS: FROM “LAB” TO “FAB”

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public releaseMIT, ISSCC ‘18

WORLD’S FIRST: CNFET CMOS MIXED-SIGNAL

SAR ADC (~100s CNFETs)

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

~100s CNFETs

MIT, VLSI ‘19light

ComputeLogicImagerPre-processing

ComputingHighly-processed information

WORLD’S FIRST: MONOLITHIC 3D IMAGER

Transforms Raw Data Information (~2800 CNFETs)

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public releaseMIT, VLSI ‘19

WORLD’S FIRST: CNFET CMOS SRAM

Kbit 6T SRAM (6144 CNFETs)

DISTRIBUTION STATEMENT A. Approved for public release

Monolithic 3D integration

Non-volatile

\

Multiple bits-per-cell

Low Resistance State

High ResistanceState

SetReset

RESISTIVE RAM (RRAM) TECHNOLOGY

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release[Le, IEEE TED 19; Wu, ISSCC 19] Collaborator: CEA-LETI

Bits-per-cell RRAM2 3 6

Cherry-picked cells

Full system

RR

AM

cel

ls

Existing methods ad hoc

First multiple bits-per-cell RRAM systemNew RRAM read & write algorithms key

NEW RESULT: MULTIPLE BITS-PER-CELL RRAM

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

Multiple bits-per-cell RRAM array

• Flash techniques don’t work for RRAM

• Correct read & write despite RRAM variations

Cel

l cou

nt

Measured resistance (kOhm)

3 bits-per-cell RRAM arrays with new algorithms

3 4 5 7 10 16 35

7 6 5 4 3 2 1 01500

1000

500

0

NEW RRAM READ & WRITE ALGORITHMS KEY

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

Non-volatile IoT edge microcontroller

Deep learningOptimized weight encoding

RRAM arrays multiple bits-per-cell

Cross-layer optimization

Bigger neural net on same hardware

2.3× improved inference accuracy

FIRST MULTIPLE BITS-PER-CELL RRAM SYSTEM

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

Without ENDURER

10-year lifetime with ENDURER(measured)

Mea

sure

d de

ep le

arni

ng

infe

renc

e ac

cura

cy

100%

50%

0%1 hour 1 day 1 month 10 years1 minute1 second

Non-volatile IoT edge microcontroller

System test setup

Continuous deep learning inference

ENDURER: 10-YEAR SYSTEM LIFETIME ACHIEVED

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

TECHNOLOGY TRANSFER: U.S. FOUNDRY

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

TECHNOLOGY TRANSFER: U.S. FOUNDRY

• 90 nm technology node• Enables relaxed technology requirements

• All process modules

• CNFET CMOS + RRAM + monolithic 3D

• Foundry MPW capabilities• End-to-end design infrastructure

MPW = multi-project wafer

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• 1st foundry integration• Beyond-Si nanotech

FOUNDRY CNFET CMOS

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

FOUNDRY CNFET CMOS

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• RRAM integration in back-end-of-line

• 2 bits-per-cell RRAM feasibility demonstrated

• Smart writes for yield boost demonstrated

FOUNDRY RRAM

Automated test setup Packaged DUTRRAM array

RRAM Cell

RRAM cell

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

FOUNDRY DESIGN INFRASTRUCTURE

Models + Calibration

PMOS

I D I DVDS VDS

experimentalmodel

NMOSexperimentalmodel

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

FOUNDRY DESIGN INFRASTRUCTURE

Models + Calibration Process Design Kit

rr1_met1rr1_via

rr1_cell

rr1_viarr1_met2

rr1_via2rr2_met1

rr2_via

rr2_cell

rr2_viarr2_met2

rr2_via2cnt1_gate

cnt1_activecnt1_sd

cnt1_pp

cnt1_met1cnt1_via

cnt1_met2cnt1_via2

met1via

met2via2

met3via3

met4via4

met5

!(cnt1_goxcut)

cnt1_sdcnt1_sdp

rr2_via2cnt1_gate

rr2_via2cnt1_gate

cnt1_activecnt1_sd

!(cnt1_goxcut)

cnt1_sd!(cnt1_dopecut)!(cnt1_dopecut)

cnt1_con

!(cnt1_dopecut)

cnt1_pp1cnt1_pp !(cnt1_dopecut)

!cnt1_dopecut

cnt2_gate

cnt2_activecnt2_sd

cnt2_pp

!(cnt2_goxcut)

cnt2_sdcnt2_sd

cnt2_gate cnt2_gate

cnt2_activecnt2_sd

!(cnt2_goxcut)

cnt2_sd!(cnt2_dopecut)!(cnt2_dopecut)

cnt2_con

!(cnt2_dopecut)

cnt2_ppcnt2_pp !(cnt2_dopecut)

!cnt2_dopecut

cnt1_via2 cnt1_via2

• Commercial-grade process design kit:• 3D Design rule checker (DRC)• 3D Layout-versus-schematic (LVS)• 3D Parasitic extraction (PEX)• Electromigration + voltage drop (EMIR)• 3D Thermal analysis• 3D Place-and-route (P&R)

• Monolithic 3D standard cell libraries• >1000 standard cells• Experimentally validated

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

FOUNDRY DESIGN INFRASTRUCTURE

Models + Calibration Process Design Kit Design Flow

Logic synthesisClock tree generation

Metal + via fillOptical proximity correction

Mask generation

Industry standard design tools

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Layer 1: 90 nm node CNFET CMOS logic• compute

3DSOC: FABRICATION

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Conventional interconnects• dense nano-scale vias

3DSOC: FABRICATION

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Layer 2: CNFET CMOS logic• compute

3DSOC: FABRICATION

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Conventional interconnects• dense nano-scale vias

3DSOC: FABRICATION

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Layer 3: CNFET CMOS logic• memory access

3DSOC: FABRICATION

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Conventional interconnects• dense nano-scale vias

3DSOC: FABRICATION

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

• Layer 4+: >4GB RRAM on-chip memory

3DSOC: FABRICATION

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

FOUNDRY MONOLITHIC 3D

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

3DSOC TEAM

DISTRIBUTION STATEMENT A. Approved for public releaseDISTRIBUTION STATEMENT A. Approved for public release

3DSOC: TRANSFORMING IDEAS TO REALITY

top related