high density asynchronous lut based on non...

19
High Density Asynchronous LUT Based on Non-Volatile MRAM Technology Sumanta Chaudhuri, Weisheng Zhao, Jacques-Oliver Klein , Claude Chappert, Institut dʼElectronique Fondamentale CNRS UMR8622 Université Paris-Sud 11 France ST-Microelectronics Pascale Mazoyer 20th International Conference on Field Programmable Logic and Applications Milano, ITALY, Aug. 31st - Sep. 2nd, 2010 1

Upload: truongthu

Post on 06-Mar-2019

255 views

Category:

Documents


0 download

TRANSCRIPT

High Density Asynchronous LUT

Based on Non-Volatile MRAM Technology

Sumanta Chaudhuri, Weisheng Zhao,Jacques-Oliver Klein, Claude Chappert,

Institut dʼElectronique FondamentaleCNRS UMR8622

Université Paris-Sud 11France

ST-Microelectronics

Pascale Mazoyer

20th International Conferenceon Field Programmable Logic and ApplicationsMilano, ITALY, Aug. 31st - Sep. 2nd, 2010

1

Outline

Introduction and motivation

‣ Asynchronous FPGA‣ Magnetic RAM (MRAM)‣ MRAM based synchronous FPGA

MRAM based Asynchronous LUT

‣ Operation and Configuration‣ Hybrid Simulation

Conclusion

2

Why Asynchronous FPGA ?

R. Manohar, HPEC 2009 Asynchronous FPGA

• Higher throughput• Lower power • (Smaller die area)

SRAM based synchronous FPGA

Look Up Table

SRAM

Input

Flip-Flop

Output

Configurable logic block (CLB)

Clk

3

Why Magnetic RAM (MRAM)?

Leakage issue : Configuration memory and FF !

SRAM based synchronous FPGA

Conventional CMOS technology

Non-volatile memory is required

Look Up Table

SRAM

Input

Flip-Flop

Output

Configurable logic block (CLB)

Clk

4

Zerostandbypower

PowerON Power

OFFPowerOFF

ON ON

W/O loss of data !

Why Magnetic RAM (MRAM)?

‣ High Write/Read Access speed‣ High density and 3D integration‣ Infinite endurance‣ Low Write/Read power ‣ Radiation hardness‣……

Storage based on the spin property of electron

1st Commercial MRAM product(2006) ‏

2009‏

2008‏

Thermally Assisted Switching (TAS) Approach: 2011

Spin Transfer Torque (STT) Approach: 2012

Field induced Magnetic Switching (FIMS):

High switching power and errors!!5

MRAM based Synchronous FPGA

MRAM based synchronous FPGAWS. ZHAO et al., FPT, 2007

WS. ZHAO et al., ACM TECS, 2009

Y. Guillemenet, et al., FPL, 2008

Configurable logic block (CLB)

Look Up Table

MRAM

Input

MagneticFlip-Flop

OutputClk

6SENSEN

Q

Qbar

Q

Qbar

MRAM based Synchronous FPGA

• Zero standby power• Dynamic Reconfiguration • Multi-context configuration• instant on/off• Radiation hardness•……

3D integration: high density and high access speed

High dynamic power, in particular for the Magnetic Flip-Flop

WS. ZHAO et al., IEEE-ICSICT, 2006

N. SAKIMURA et al., IEEE-CICC, 20087

Need to be reduced ...

MRAM based Asynchronous FPGA

•Higher throughput•Lower power

Lower dynamic powerZero standby power (MRAM)

•Smaller die area or high densityWithout global clockWithout external memory (MRAM)

•Dynamic configuration and multi-context configuration (MRAM) •Instant on/off capability (MRAM)•Radiation hardness (MRAM)

What’s the circuits, architectures, control protocol … ?

8

Outline

Introduction and motivation

‣ Asynchronous FPGA‣ Magnetic RAM (MRAM)‣ MRAM based synchronous FPGA

MRAM based Asynchronous LUT

‣ Operation and Configuration‣ Hybrid Simulation

Conclusion

9

Pre-Charge Sense Amplifier(PCSA) for MRAM:

WS. ZHAO et al., IEEE TMAG, 2009

Sensing delay< 200ps

• High sensing speed• High reliability • Easy to realize multi-context • Small die area• ……

10

WS. ZHAO et al., FPT, 2007

WS. ZHAO et al., ACM TECS, 2009

Y. Guillemenet, et al., FPL, 2008

Q

Qbar

Q

PCSA based Magnetic LUT (MLUT):

PCSA based Magnetic LUT (MLUT)

Decoder

PCSAInput

Output

Example of 1-input MLUT

Less transistors than 6-T SRAM LUT

11

Asynchronous Configurable Logic Block (CLB)

PCSA MRAM circuits and Pre-Charge Principle based protocol 12

A0A1

A0A1

"0" + req

"1" + req

ACK

D0D1

inputs valid

evaluate /precharge

(Re)configuration of asynchronous MRAM LUT:

Thermally Assisted Switching (TAS) Approach

JP. Nozieres et al., US 7411817, 2006

Configuration circuits

The drivers can be shared globally

13

Multi-context of asynchronous MRAM LUT:

Multi-context can be easily implemented and only one selection transistor is used for one bit in each context

14

CMOS 130nm (STMicro)MRAM 120nm

M. Elbaraji et al., JAP, 2010

Whole configuration delay is about 25ns

Hybrid MRAM/CMOS Simulations: MALUT Configuration

15

Hybrid MRAM/CMOS Simulations: MALUT Operation

The 2-input asynchronous CLB has been configured with a bit-stream “1010”.

CMOS 130nm (STMicro)MRAM 120nm

Operation speed: ~1GHz

M. Elbaraji et al., JAP, 2010

16

Prototype Development: layout examples

Hybrid design kitCMOS 130nm (STMicro)

MRAM 120nm

Layout of a 4-input MALUTWith 8 contexts

17

Conclusion

• Higher throughput (~1GHz)• Lower power• Smaller die area or high density• Dynamic configuration and multi-context configuration • Instant on/off capability • Radiation hardness

1st MRAM based Asynchronous LUT

Circuits: Pre-Charge Sense Amplifier

Architectures: Pre-Charge Asynchronous protocol based

Simulations: TAS-MRAM + CMOS 130nm and 65nm

Prototypes: 8 contexts asynchronous LUT (130nm technology) Innovative routing architecture

18

Thank you for your attention !

20th International Conferenceon Field Programmable Logic and ApplicationsMilano, ITALY, Aug. 31st - Sep. 2nd, 2010

19