system implications of integrated photonics · technology sep tem ber 11, 20 08 4 pin-ordering,...

56
System Implications of Integrated Photonics © 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Norman P. Jouppi and Parthasarathy Ranganathan

Upload: others

Post on 11-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

System Implications ofIntegrated Photonics

© 2008 Hewlett-Packard Development Company, L.P.The information contained herein is subject to change without notice

Integrated Photonics

Norman P. Jouppi and Parthasarathy Ranganathan

Page 2: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Today’s talk

• Introduction (Partha)

• Nanophotonics and Capabilities (Norm)• Nanophotonics and Capabilities (Norm)

• Potential Impact and Early Results (Norm)

• Some System Implications (Partha)

2 May 14, 2008 OIHPC – HP Laboratories

Page 3: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Low Power important in all markets

September11,2008 3

• from processors to data centers

• from handhelds to supercomputers

Page 4: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Large body of prior work

• Voltage scaling/islands

• Clock gating/routingClock-tree distribution, half-swing clocks

• Redesigned latches/flip-flops

• Voltage/freq scaling

• Gating

Pipeline, clock, functional units,branch prediction, data path

• Split instrucn windows

• Switching controlRegister relabeling, operand swapping,instruction scheduling

• Memory access reduceLocality optimizations, register allocation

Average power, peak power, power density, energy-delay, …

CIRCUITS ARCHITECTURE COMPILER, OS, APPEnergy-efficienttechnology

September11,2008 4

• Redesigned latches/flip-flopspin-ordering, gate restructuring, topologyrestructuring, balanced delay paths, optimized bittransactions

• Redesigned memory cellsLow-power SRAM cells, reduced bit-line swing,multi-Vt, bit line/word line isolation/segmentation

• Other optimizationsTransistor resizing, GALS, low-power logic

• Split instrucn windows

• SMT thread throttling• Bank partitioning

• Cache redesignSequential, MRU, hash-rehash, column-associative, filter cache, sub-banking,divided word line, block buffers, multi-divided module, scratch

• Low-power states

• DRAM refresh-control• Switching controlGray, bus-invert, address-increment

• Code compression

• Data packing/buffering

• Power-mode-control• CPU/resource schedule

• Memory/disk controlDisk spinning, page allocation, memorymapping, memory bank control

• NetworkingPower-aware routing, proximity-basedrouting, balancing hop count, …

• Distributed computingMobile agents placement, network-drivencomputation• Fidelity control• Dynamic data types• Power API

Energy-efficientresource mgmt

Page 5: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Today’s talk

Integratedphotonics

Disaggregateddatacenters

Page 6: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Nanophotonics

© 2008 Hewlett-Packard Development Company, L.P.The information contained herein is subject to change without notice

Nanophotonics

Page 7: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Current Trends

Page 8: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Cores Per Die

Past

Present

8 August 12, 2008 ISLPED Keynote

Present

Future

Page 9: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Speed Bumps on the Road to 2017

• Off-chip bandwidth requirements will scale geometrically

− (Up to 10 TB/s)

• ITRS pin counts increase from a max of 3072 pins today to:

− 3072 pins in 2017!

• On-chip bandwidths scale geometrically too

9 August 12, 2008 ISLPED Keynote

• On-chip bandwidths scale geometrically too

− Interconnect power is a tougher constraint at each generation

− Mesh and ring bandwidth and latency vary based on data placement

• Non-uniform latencies & bandwidth complicate programming

− Programmer has to worry about placement of data & threads

− Placement needs to change with each new chip

We need a disruptive technology

Page 10: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Capabilities of EmergingIntegrated Photonics

Page 11: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

What are Integrated Photonics?

• The 2000 telecom bubble based on discrete optics

−Think pre-Noyce/Kilby era in electronics

−Components are measured in mm

−Hand alignment

−Expensive and not scalable

Source: Newport Corp.,

Assembly Magazine,

11 August 12, 2008 ISLPED Keynote

−Expensive and not scalable

• Recent research is on integrated photonics

−Think post-Noyce/Kilby era in electronics

−Components are measured in a few mm

−Manufacture many thousands per die

−Advances in lithography yield better devices

September 2001

Page 12: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Important Technology Characteristics

• Several things are important for a successfultechnology, including:

−Gain (leading to fan-in and fan-out)

−Power efficiency

• Reference:

12 August 12, 2008 ISLPED Keynote

• Reference:

“The Physics of VLSI Systems” byRobert W. Keyes, 1987.

Page 13: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Gain

• Transistors have good gain

−Electronics is good for computation

• Photons don’t like to interact

−Photonics is good for long distance communication

−How long is long?

13 August 12, 2008 ISLPED Keynote

−How long is long?

• Depends on size -> capacitance -> power of device

• mm devices: ~30 meters

• mm devices: ~30 millimeters

Page 14: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Fan-in and Fan-out

• Important for efficient system design

−Not economically feasible at signaling rates >2Gbs inelectrical systems due to stub problems

−Possible in optics by using splitters and combiners

• Electrical point-to-point links do not scale well

14 August 12, 2008 ISLPED Keynote

• Electrical point-to-point links do not scale well

−Adds to pin bandwidth limitations

−Repeated buffering of signals adds delay & power

• FBDIMM example

Page 15: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

FBDIMM Memory System

MemoryController AMB AMB AMB AMB

Control, address,& write data10 southbound

15 August 12, 2008 ISLPED Keynote

14 northboundRead data

24 differential signals@ 4.8Gbps ≈ 13GB/s

• Latency: Multi-hops

• Power: re-transmission

• Cost

Page 16: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Example: 5 cascaded microring

resonators, slightly different radii ~

1.5 mm.

High Q of 9,000 (BW ~ 20 GHz)

and high extinction ratio of 16 dB.

Si Microrings

16 August 12, 2008 ISLPED Keynote

b

0.17 nm

Q. Xu, D. Fattal, and RGB, Opt. Express 16, 4309-4315 (2008) — World Record!

Page 17: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Si Ring Resonator in Action

17 August 12, 2008 ISLPED Keynote

Page 18: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Ring ResonatorsOne basic structure, 3 applications

SiGe Doped

18 August 12, 2008 ISLPED Keynote

• A modulator – move in and out of resonance to modulatelight on adjacent waveguide

• A switch – transfers light between waveguides only whenthe resonator is tuned

• A wavelength specific detector - add a doped junction toperform the receive function

Page 19: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Power Efficiency

• Hybrid actively mode-locked lasers or comb lasers

− Produce all wavelengths from a single source

− Track with temperature

• Si microring modulators

− Parallel buses with clock forwarding (no SERDES)

19 August 12, 2008 ISLPED Keynote

− Parallel buses with clock forwarding (no SERDES)

− DWDM: 256 waveguides × 64 wavelengths each = 256 × 64 Xbar

− Analog drivers for both modulators and detectors (no A/D)

− Femtofarad-class low-power receiverless detectors

=> Low power 10 Gb/s signaling

Page 20: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Potential Impact in 2017

Page 21: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

The Corona Manifesto

•Take full advantage of nanophotonics−Don’t just replace today’s wires with optics

−Redesign the multi-core processor from the ground up

−No off-chip or cross-chip electrical wires

21 August 12, 2008 ISLPED Keynote

−Restore balance: memory bandwidth scales with cores

−All memory readily reachable from all cores

Page 22: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

10 teraflops compute performance

10 terabytes/s memory bandwidth

20 terabytes/s on-chip interconnect

All off-socket and cross-socket

Corona System Overview

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

22 August 12, 2008 ISLPED Keynote

Optical connections to I/O and other Corona sockets

All off-socket and cross-socketcommunication is optical

OCM OCM OCM

Corona compute socket

Page 23: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Optically Connected Memory (OCM)

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

23 August 12, 2008 ISLPED Keynote

OCM OCM OCM

Corona compute socket

Page 24: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Optically Connected Memory

• Master/slave bus on waveguide loop− Optical power from processor

− Processor modulates for data out

− OCM modulates for return data

• Multiple optical interfaces per chip stack− Eliminates electronic global wiring

24 August 12, 2008 ISLPED Keynote

− Eliminates electronic global wiring

• OCMs communicate via DWDM− High bandwidth

• Accessed in parallel, no receive and retransmit like FBDIMM− Large capacities with low latency and power

• OCM only activates one DRAM mat per cache line fill/write− Less overfetching (in conventional DIMM 128X) much lower power

• High bandwidth at low power

Page 25: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

OCM Chip Stack

25 August 12, 2008 ISLPED Keynote

Optical Die

Control & Interface Silicon

Through Silicon Vias

forming vertical data buses

Package

DRAM

Fiber: from processorFiber to other OCM

Page 26: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Corona Compute Socket

Core

Share

dL2

Cache

MemoryController

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

26 August 12, 2008 ISLPED Keynote

S

Share

dL2

Cache

Hub Directory

NetworkInterface

Crossbar

Core

Core

Core

OCM OCM OCM

Corona compute socket

Cluster0

Cluster1

Cluster63

Optical Crossbar

Page 27: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Corona Cluster Parameters

• Per each of 64 clusters:− Cores: 4

− Memory controllers: 1

− L2 cache: 4 MB, 16-way, 64B lines

• Per-core:− Frequency: 5 GHz

S

Core

Share

dL2

Cache

Hub Directory

MemoryController

OCM

Core

Core

27 August 12, 2008 ISLPED Keynote

− Frequency: 5 GHz

− Threads: 4

− L1 I-Cache: 16 KB, 4-way, 64B lines

− L1 D-Cache: 32 KB, 4-way, 64B lines

− Issue: 2-wide in-order

− 64 b SIMD FP width 4 + Fused FP operations

Share

dL2

Cache

NetworkInterface

Crossbar

Core

Page 28: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Corona Chip Stack

28 August 12, 2008 ISLPED Keynote

Page 29: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

On-chip Interconnect

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

OCM

29 August 12, 2008 ISLPED Keynote

OCM OCM OCM

Corona compute socket

Cluster0

Cluster1

Cluster63

Optical Crossbar

Page 30: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

The Optical Crossbar

30 August 12, 2008 ISLPED Keynote

Optical

hub

Page 31: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

All-optical Arbitration

•A single micro-ring bothasserts request and detectssuccess or failure

•Requester tries to divert onewavelength

•Detected power:success/failure

grant

request

Theimageca

request

Theimageca

optical “available” signal

grant

31 August 12, 2008 ISLPED Keynote

success/failure

•Off resonance micro-rings addno delay and negligible loss –> highly scalable

•Arbitration time is lightpropagation time

•DWDM –> many concurrentarbitrations

arbitration propagation

equivalent electronic circuit

grantrequest grantrequest

Page 32: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Performance

• Compare 5 systems using:

−Three different on-chip interconnects

• Electrical 2D on-chip mesh, 0.64 TB/s and 5 cycle hops (LMesh)

• Electrical 2D on-chip mesh, 1.28 TB/s and 5 cycle hops (HMesh)

• Optical crossbar, 20.48 TB/s and 8 cycles total

32 August 12, 2008 ISLPED Keynote

• Optical crossbar, 20.48 TB/s and 8 cycles total

−Two different memory subsystems

• Electrical 0.96 TB/s, 1536 signal pins, memory latency is 20 ns

• Optical 10.24 TB/s, 256 fibers, memory latency is 20 ns

• Simulate using COTSon + M5

−4 synthetic benchmarks

−SPLASH-2

Page 33: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Performance (LMesh/ECM = 1)

33 August 12, 2008 ISLPED Keynote

Applications that don’t fit in cache show 4-6X improvements with Xbar

Page 34: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

On-chip Network Power

34 August 12, 2008 ISLPED Keynote

Optics can reduce network power of aps that don’t fit in cache by 6X

Page 35: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Optics Can Remove the Bottlenecks

• Bandwidth scales to 1,000 threads

−10 TB/s off-chip bandwidth

−20 TB/s bandwidth between cores

−Modest power requirements

Low, uniform latencies between cores & memory

35 August 12, 2008 ISLPED Keynote

• Low, uniform latencies between cores & memory

• Coherent shared memory still possible

Page 36: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Near-term Technologies

Page 37: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Optical Buses

• Preview of upcoming Hot Interconnects presentation

−“A High-Speed Optical Multi-drop Bus for ComputerInterconnections,” Mike Tan et. al.

37 August 12, 2008 ISLPED Keynote

Page 38: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Optical Multidrop Bus – A Master Slave BusMaster Module A Module B Module C Module D

M

Tx Rx Rx Tx Rx Tx Rx Tx Rx Tx

BS1

BS1

M BS2

BS2

BS3

BS3 BS4

BS4 12

12

38 August 12, 2008 ISLPED Keynote

• Replace electrical transmission line with optical waveguides

• Replace electrical stubs with optical taps

• Two Unidirectional buses: 12 bit wide @ 10Gb/s = 30GB/s

− Master broadcasts to each module on the bus;

− Distribute optical power equally among modules

− Each module sends data back to the master at full bus bandwidth

• Lower latency with reduced power

Page 39: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Optical Waveguide

• Hollow Metal Waveguides(1) (HMWG)

− Low propagation loss – light rays travel at near grazingangle to metal walls

− Low numerical aperture

− Prop delay 33psec/cm

39 August 12, 2008 ISLPED Keynote

EH11 mode

− Prop delay 33psec/cm

Air core

Ag clad (n, k) = (0.15+i 5.68)

w =150µm, h =150µm

a = 0.0015 dB/cm

neff ~ 1

NA ~ 0.01w

h

(1) E. Marcatili et al., Bell Syst. Tech. J. 43, 1783 (1964).

Page 40: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Optical Taps

• Non-Polarizing Pellicle Beam Splitters

− Low cost VCSELs randomly polarized

−Negligible beam-walk off

Optical coatingNPPBS Optical coatingNPPBS Optical coatingOptical coatingNPPBS

40 August 12, 2008 ISLPED Keynote

11% NPPBS – 15 layers ofSiO2/TiO2 on 250nm SiN

NPPBS

gap

135o

HMWG

Optical coatingon SiN film

Etched holeSi frame

NPPBS

NPPBS

gap

135o

HMWG NPPBS

gap

135o

HMWG

Optical coatingon SiN film

Etched holeSi frame

NPPBS Optical coatingon SiN film

Etched holeSi frame

Optical coatingon SiN film

Etched holeSi frame

NPPBS

Insertion into HWMG @ 45o

Page 41: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

1x8 Fanout

12345678 M

3cm

VCSEL drivenfrom BERT thrubias-tee

30 cm

41 August 12, 2008 ISLPED Keynote

Light beams from taps.

busLast tap exitsbus end IR camera image

Light input

Page 42: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

1x8 Fanout @ 10.3125Gbps (L=30cm)

Tap 2Tap 1 Tap 320ps/div

42 August 12, 2008 ISLPED Keynote

Tap 4 Tap 5 Tap 6

Tap 7 Tap 8

Page 43: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Optical Bus Summary

• Can build today

• Provides good fan-in and fan-out (>8)

• Distance not an issue

• Composite structures (e.g., crossbars) possible

43 August 12, 2008 ISLPED Keynote

• Composite structures (e.g., crossbars) possible

Page 44: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Conclusions From Norm’s Section

• Integrated photonics has the potential to:

−Dramatically improve memory bandwidth

−Significantly improve many-core performance

−Reduce power

−Simplify programming

44 August 12, 2008 ISLPED Keynote

−Simplify programming

−All at the same time!

• Near term applications such as optical buses

−Add significant system flexibility

−Save latency and power

Page 45: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Acknowledgements

• This includes contributions from many people:

−All my ISCA 2008 coauthors

−All my 2008 Hot Interconnects coauthors

• Special thanks for slide materials:

−Mike Tan, Ray Beausoleil, Moray McLaren, Nathan

45 August 12, 2008 ISLPED Keynote

−Mike Tan, Ray Beausoleil, Moray McLaren, NathanBinkert, Jung Ho Ahn, Qianfan Xu

Page 46: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

System Implications

© 2008 Hewlett-Packard Development Company, L.P.The information contained herein is subject to change without notice

System Implications

Page 47: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Confluence of Optics andOther Systems Trends

multicores, virtualization, fabric convergence, non-volatile storage,manageability, power, resilience trends, volume/value blurring, web2.0

datacenters,SMB/BRIC, costs, flexibility, commodity…

=

47 May 14, 2008 OIHPC – HP Laboratories

=Interesting opportunity to

rethink system arch & mgmt

Page 48: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Designing Future Servers & Datacenters

Proposal:

power-efficient building blocks

co-designed across hardware/software

dynamically shared & configured as ensembles

48 11 September 2008

dynamically shared & configured as ensembles

as needed, when needed

Why?

One design: address power, m’gbility, scale, costs

Page 49: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Designing Future Servers & Datacenters

49 11 September 2008

cost-efficient building blocks across hardware/software,

dynamically shared and configured at datacenter level

Page 50: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Several Interesting Research Directions

•Rethink architecture[Beyond the “box” to the datacenter]

•Rethink management[Beyond the “platform” to the solution]

50 May 14, 2008 OIHPC – HP Laboratories

Page 51: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

E.g., Disaggregated systems

51 May 14, 2008 OIHPC – HP Laboratories

• Reduce memory power

• Enable non-volatile storage

Page 52: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

E.g., “Dematerialized” systems

• Improved cable management

• Improved packaging efficiencies

52 May 14, 2008 OIHPC – HP Laboratories

[Chandrakant Patel, Dematerializing the Ecosystem, Usenix08]

Page 53: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Early results (for web2.0) [isca2008]

400%

500%

600%

700%

N1 N2

1.5XImprovement

2.0XImprovement

Perfo

rmance

per

dolla

r

53 11 September 2008

0%

100%

200%

300%

400%

websearch webmail ytube mapred-wc mapred-wr Hmean

Improvement

Perfo

rmance

per

dolla

r

Even higher results possible with photonics…

Page 54: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Closing Remarks

• Integrated photonics had disruptive potential

−Energy efficiency

− Improved bandwidth

−Simpler programming

• Future systems implications

−New architectures & flexibility (e.g., optical buses)

−Disaggregation and dematerialization enablement

54 May 14, 2008 OIHPC – HP Laboratories

Page 55: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

Closing Remarks

Integratedphotonics

Disaggregateddatacenters

Page 56: System Implications of Integrated Photonics · technology Sep tem ber 11, 20 08 4 pin-ordering, gate restructuring, topology restructuring, balanced delay paths, optimized bit transactions

56 August 12, 2008 ISLPED Keynote