risc-v meets 22fdx: an open source ultra-low power ... · benchmark rf performance (>400ghz),...

15
2 Integrated Systems Laboratory 1 Department of Electrical, Electronic and Information Engineering RISC-V Meets 22FDX: an Open Source Ultra-low Power Microcontroller Platform for Advanced FDSOI Techonologies 09.05.2018 Pasquale Davide Schiavone 1 , Sanjay Charagulla 3 , Gerd Teepe 3 , Luca Benini 1,2 and the PULP team 1 Integrated System laboratory, ETH, Zurich, Switzerland 2 Energy Efficient Embedded Systems Laboratory, University Of Bologna, Bologna, Italy 3 GLOBALFOUNDRIES

Upload: others

Post on 03-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

2Integrated Systems Laboratory

1Department of Electrical, Electronic

and Information Engineering

RISC-V Meets 22FDX: an Open Source Ultra-low Power

Microcontroller Platform for Advanced FDSOI Techonologies

09.05.2018

Pasquale Davide Schiavone1, Sanjay Charagulla3, Gerd Teepe3, Luca Benini1,2 and the PULP team 1Integrated System laboratory, ETH, Zurich, Switzerland 2Energy Efficient Embedded Systems Laboratory, University Of Bologna, Bologna, Italy 3GLOBALFOUNDRIES

Page 2: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

Near Sensor (aka Edge) Processing

Davide Schiavone – ETH Zurich 2

1 ÷ 3 GOPS

1 ÷ 30 mW

Idle: ~1µW

Active: ~ 50mW 100 uW ÷ ~10 mW

Smart Architecture

Parallel Processing

Power-saving Design

Near-Threshold

Low Power Technology

Page 3: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

PULP is Open-Source & Free

3

1 February 2016 First release of PULPino, our single-core

microcontroller

2

3

5

May 2016 Toolchain and compiler for our RISC-V

implementation (RI5CY), DSP extensions

August 2017 PULPino updates, new cores Zero-riscy and Micro-

riscy, FPU, toolchain updates

Begin February 2018 PULPissimo

ARIANE: 64-bit RISC-V core

End February 2018 OPEN-PULP

1

2

4

September 2017 Porting of ARM CMSIS to PULPino

https://github.com/misaleh/CMSIS-DSP-PULPino

June 2017 Porting of Verilator and BEEBS benchmarks to

PULPino

https://github.com/embecosm/ri5cy

Releases Community Contributions

4

November 2017 Numerous Bug fixes to RiscV in PULPino

https://github.com/pulp-platform/riscv

3

December 2017 STING: Open-Source Verification Environment for

PULPino

http://valtrix.in/programming/running-sting-on-pulpino

Davide Schiavone – ETH Zurich

Download PULP @ https://github.com/pulp-platform

Page 4: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

Quentin: GF22FDX PULPissimo Implementation

4

• RISC-V based advanced microcontroller – 512kB of L2 Memory

– 16kB of energy efficient latch-based memory (L2 SCM BANK)

• Rich set of peripherals: – QSPI (up to 280 Mbps)

– HyperRam + HyperFlash (up to 100 MB/s)

– Camera Interface (up to 320x240@60fps)

– I2C, I2S (up to 4 digital microphones)

– JTAG (Debug), GPIOs,

– Interrupt controller, Bootup ROM

• Autonomous IO DMA Subsystem (µDMA)

• Power management – 2 low-power FLLs (IO, SoC, …)

Davide Schiavone – ETH Zurich

Page 5: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

Efficient I/O subsystem: the µDMA

5

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

0 1000 2000 3000 4000 5000 6000

Ban

dw

idth

Lim

it [

(Mb

it/s

]

Buffer Size [bytes] (a)

50Mhz

100Mhz

150Mhz

0

500

1000

1500

2000

2500

3000

1RX 2RX 3RX 1RX+1TX 2RX+2TX 3RX+3TX

• Transfers bandwidth close to the physical limit of the architecture

• A single channel can saturate the memory port (can work at full BW)

• Bandwidth is not affected by peripherals buffer size

Architectural limit for

single channel

@50Mhz(1.6Gbit/s)

Architectural limit for

dual channel

@50Mhz(3.2Gbit/s)

Davide Schiavone – ETH Zurich

Page 6: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

Efficient Interconnect

6

D

Memory

Cut

Memory Bank

Memory

Cut Memory

Cut

Memory

Cut

Memory Bank

Memory

Cut Memory

Cut

Memory

Cut

Memory Bank

Memory

Cut Memory

Cut

RX

Channels

TX

Channels Peripheral

Peripheral

Peripheral uDMA Subsystem CPU Subsystem APB Subsystem

APB Bridge

L2 multiport w/interleaving support

PW

M

CL

K

SO

C

Davide Schiavone – ETH Zurich

Low Latecy interconnect

SCM Bank

RegFile

I

… Interleaving to reduce

contention: 4 Banks

2 Banks non interleaved

for private data and code

1 SCM Bank for NTC

Page 7: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

RI5CY Processor

7

• 4-stage pipeline

– RV32IMFCXpulp

– 70K GF22 nand2 equivalent

gate (GE) + 30KGE for FPU

– Coremark/MHz 3.19

• Includes various extensions

– pSIMD

– Fixed point

– Bit manipulations

– HW loops

• Floating Point Unit: – IEEE 754 single precision

• Iterative DIV/SQRT (7 cycles)

• Pipeline MAC, MUL, ADD, SUB, Cast

• Single cycle load, store, min, max, cmp etc

Davide Schiavone – ETH Zurich

Thanks for all the external users bugs report!

Page 8: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

Implementation Results

8

Quentin SoC layout

Floorplan Area: 2,31 mm2

Effective Area: 1,22 mm2

Area Breakdown

Peak Density: 90%

Davide Schiavone – ETH Zurich

Page 9: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

Estimated Energy Efficiency

9

Total Power @ 350 MHz, 0.65V = 8,5 mW

Leakage Power @ 0.65V = 1,2 mW

Energy Efficiency:

40 MOPS/mW @ 350 MOPS

TT, 25°C SS, -40°C, ocv

1.4x better performance (350MHz vs. 250 MHz)

4x better energy efficiency (40 vs. 14 MOPS/mW)

Than our previous design in 40nm technology

Davide Schiavone – ETH Zurich

Page 10: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

Execution on Energy-Efficient SCM

10

Energy Efficiency:

65 MOPS/mW @ 350 MOPS

TT, 25°C SS, -40°C, ocv

Total Power @ 350 MHz, 0.65V = 5,5 mW

1.5x better energy efficiency than execution from SRAM

Davide Schiavone – ETH Zurich

Page 11: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

Up to 10M Wafers / Year 200mm equivalents

TECHNOLOGY NODES

CAPACITY (WAFERS / MONTH)

GF – Global Presence in Semiconductor Manufacturing

11 © 2018 GLOBALFOUNDRIES

14, 12, 7nm

Up to 60k

(300mm)

350–90nm

40k

(200mm)

180–40nm

68k (300mm)

93k (200mm)

28, 22,12nm

Up to 80k

(300mm)

180-22nm

Up to 85k

(300mm)

90–22nm

Up to 20k

(300mm)

Malta, New York Burlington, Vermont Singapore Dresden, Germany Chengdu, China East Fishkill, New York

Page 12: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

FinFET FD-SOI

eMRAM

eMRAM

GF Has the Industry’s Best Dual-Track Roadmap

© 2018 GLOBALFOUNDRIES

7LP

12LP

14LPP 22FDX®

40/55nm

12FDX™

Power Optimized Performance Optimized

28nm

12LP/14LPP Applications - Mid-Low Tier AP

- Networking Switches

- 5G DFE, RF Connectivity

- High Perf. Compute

7LP Applications - Servers, CPU, AI/ML

- Graphics

- High-end AP

- Networking ASIC

- Auto high-end ADAS

22FDX Applications - IoT: BLE, GPS, NB-IoT

- High-speed ADC

- 5G / mmWave RF

- Auto and Industrial MCU

- WiFi Connectivity

12FDX Applications - AI/ML Inferencing

- Next Gen 5G Infrastructure

- Level 2-3 ADAS/Vision

- Embedded MPU

- <6GHz FEM Transceiver

Page 13: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

22FDX® for IoT, Automotive and RF

© 2018 GLOBALFOUNDRIES

Compelling

FinFET-like performance for customer who still want to use planar devices

Ultra-low voltage (0.4V)

Ultra-low leakage (1pA/μm)

MRAM integration for IoT and Auto-MCU

Automotive grade 2 and 1, mmWave / Radar

Benchmark RF performance (>400GHz), and PA integration

Relevant

IoT - ARM, RISC-V processors for NB-IoT and AI – ML functions

Automotive - ADAS /Vision, Infotainment, Body Electronics MCU, Radar

RF - <6GHz: Connectivity (BLE, Wi-Fi, Zigbee), Cellular (3G, 4G LTE, 5G)

RF – mmWave >26GHz, 5G infrastructure, ADC/DAC integration

Page 14: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

Accelerating RISC-V Developers and Partners

© 2017 GLOBALFOUNDRIES 14

Enabling Universities driving RISC-V innovations – ETH-Zurich / University of Bologna – PULP Architecture

– Berkeley Labs, IIT Chennai

SiFive Core IP targeting Data Center, ML,

Automotive and Embedded applications – E31, E51 Configurable RISC-V Cores

Reduced Energy Microsystems - Drones, Robotics,

Camera Applications – RISC-V IP hardware validated on 22FDX platforms

– 32bit, 64bit Cores with Neural network IP solutions

ANDES Cores for IoT, RF Connectivity applications – 32bit IP cores for 22FDX platform

– Power efficient, smaller foot print designs

Reduce time to market and facilitate FDX™ SoC product design

Page 15: RISC-V Meets 22FDX: an Open Source Ultra-low Power ... · Benchmark RF performance (>400GHz), and PA integration Relevant IoT - ARM, RISC-V processors for NB-IoT and AI – ML

| |

Challenges for next generations IoT End-Nodes: Energy Efficiency Near Threshold Computing

Performance FDX 22nm technology offers HP at low VDD

PVT Variations Body Biasing

We presented Quentin: FDX 22 Implementation of PULPissimo open-source architecture Optimized IO subsystem for efficient data transfers

Processor optimized for energy efficient near-sensor data analytics

Up to 500 MOPS @ 0.8V

45 MOPS/mW @350 MOPS, 0.65V

First step towards silicon-qualified free-open-source RISC-V IPs on GF FDX22 process.

Conclusion

15 Davide Schiavone – ETH Zurich