ece/cs 757: advanced computer architecture ii interconnectsece/cs 757: advanced computer...

28
ECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison Lecture notes created by Natalie Enright Jerger

Upload: others

Post on 09-Oct-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

ECE/CS 757: Advanced Computer Architecture II

Interconnects

Instructor:Mikko H Lipasti

Spring 2017

University of Wisconsin-Madison

Lecture notes created by Natalie Enright Jerger

Page 2: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Lecture Outline

• Introduction to Networks

• Network Topologies

• Network Routing

• Network Flow Control

• Router Microarchitecture

2Mikko Lipasti-University of Wisconsin

Page 3: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

What is an Interconnection Network?

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)3

Logic Mem Logic Mem

Logic Mem Logic Mem

Communication

ApplicationAlgorithm

Programming LanguageOperating System

Instruction Set Architecture

Microarchitecture

Register Transfer Level

CircuitsDevices

Technology

Com

pute

r A

rchit

ectu

re

Page 4: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

What is an Interconnection Network?

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)4

• Application: Ideally wants low-latency, high-bandwidth, dedicated channels between logic and memory

• Technology: Dedicated channels too expensive in terms of area and power

ApplicationAlgorithm

Programming LanguageOperating System

Instruction Set Architecture

Microarchitecture

Register Transfer Level

CircuitsDevices

Technology

Logic Mem Logic Mem

Logic Mem Logic MemCom

pute

r A

rchit

ectu

re

Dedicated

Channels

Page 5: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

What is an Interconnection Network?

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)5

• An Interconnection Network is a programmable system that transports data between terminals

• Technology: Interconnection network helps efficiently utilize scarce resources

• Application: Managing communication can be critical to performance

Logic Mem Logic Mem

Logic Mem Logic Mem

Interconnection Network

ApplicationAlgorithm

Programming LanguageOperating System

Instruction Set Architecture

Microarchitecture

Register Transfer Level

CircuitsDevices

Technology

Com

pute

r A

rchit

ectu

re

Page 6: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Interconnection Networks Introduction

• Interconnection networks should be designed

• to transfer the maximum amount of information

• within the least amount of time (and cost, power

constraints)

• so as not to bottleneck the system

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)6

Page 7: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Types of Interconnection Networks

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)7

• Interconnection networks can be grouped into four domains– Depending on number and proximity of devices to be

connected

• On-Chip Networks (OCNs or NoCs)– Devices include microarchitectural elements (functional

units, register files), caches, directories, processors– Current/Future systems: dozens, hundreds of devices

• Ex: Intel TeraFLOPS research prototypes – 80 cores• Intel Single-chip Cloud Computer – 48 cores

– Proximity: millimeters

Page 8: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Types of Interconnection Networks (2)

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)8

• System/Storage Area Networks (SANs)– Multiprocessor and multicomputer systems

• Interprocessor and processor-memory interconnections

– Server and data center environments• Storage and I/O components

– Hundreds to thousands of devices interconnected • IBM Blue Gene/L supercomputer (64K nodes, each with 2

processors)

– Maximum interconnect distance: tens of meters (typical) to a few hundred meters

– Examples (standards and proprietary)• InfiniBand, Myrinet, Quadrics, Advanced Switching Interconnect

Page 9: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Types of Interconnection Networks (3)

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)9

• Local Area Networks (LANs)– Interconnect autonomous computer systems

• Machine room or throughout a building or campus

– Hundreds of devices interconnected (1,000s with bridging)– Maximum interconnect distance

• few kilometers to few tens of kilometers

– Example (most popular): Ethernet, with 10 Gbps over 40Km

• Wide Area Networks (WANs)– Interconnect systems distributed across globe

• Internetworking support required

– Many millions of devices interconnected– Max distance: many thousands of kilometers– Example: ATM (asynchronous transfer mode)

Page 10: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Interconnection Network Domains

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)10

Local Area

Networks

Dis

tan

ce (

met

ers)

10-3

100

103

106

Number of devices interconnected

1 10 100 1,000 10,000 >100,000

System/Storage

Area Networks

Wide Area

Networks

Slide courtesy Timothy Mark Pinkston and José Duato

Metropolitan Area

Networks

Page 11: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Interconnection Network Domains

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)11

Local Area

Networks

Dis

tan

ce (

met

ers)

10-3

100

103

106

Number of devices interconnected

1 10 100 1,000 10,000 >100,000

System/Storage

Area Networks

Wide Area

Networks

Slide courtesy Timothy Mark Pinkston and José Duato

Metropolitan Area Networks

On-Chip Interconnection

Networks/Networks-on-

Chip

Page 12: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Why Study Networks on Chip?

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)12

Page 13: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Why Study On-Chip Networks?

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)13

Page 14: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Example of Multi- and Many-Core Architectures

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)14

Page 15: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Why study interconnects?

• They provide external connectivity from system to outside world– Also, connectivity within a single computer system at many

levels• I/O units, boards, chips, modules and blocks inside chips

• Trends: high demand on communication bandwidth – increased computing power and storage capacity– switched networks are replacing buses

• Computer architects/engineers must understand interconnect problems and solutions in order to more effectively design and evaluate systems

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)15

Slide courtesy Timothy Mark Pinkston and José Duato

Page 16: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

On-Chip Networks (OCN or NoCs)

• Why On-Chip Network?– Ad-hoc wiring does not

scale beyond a small number of cores• Prohibitive area

• Long latency

• OCN offers – scalability

– efficient multiplexing of communication

– often modular in nature (ease verification)

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)16

Page 17: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Differences between on-chip and off-chip networks

• Significant research in multi-chassis interconnection networks (off-chip)– Supercomputers

– Clusters of workstations

– Internet routers

• Leverage research and insight but…– Constraints are different

– New opportunities

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)17

Page 18: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Off-chip vs. on-chip• Off-chip: I/O bottlenecks

– Pin-limited bandwidth

– Inherent overheads of off-chip I/O transmission

• On-chip– Wiring constraints

• Metal layer limitations

• Horizontal and vertical layout

• Short, fixed length

• Repeater insertion limits routing of wires– Avoid routing over dense logic

– Impact wiring density

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)18

Page 19: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Off-Chip vs On-Chip

• On-Chip

– Power

• Consume 10-15% or more of die power budget

– Latency

• Different order of magnitude

• Routers consume significant fraction of latency

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)19

Page 20: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

New opportunities• Abundant wiring

– Change in relative cost of wires and buffers

– Many short flits

• Tightly integrated into system– Not commodity – fully customized design

– Allows for optimization with uncore• Cache coherence

• Emerging technology– Optics

– 3D

Spring 2014 20ECE 1749H: Interconnection

Networks (Enright Jerger)

Page 21: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

On-Chip Network Evolution

• Ad hoc wiring– Small number of nodes

• Buses and Crossbars– Simplest variant of on-chip networks

– Low core counts

– Like traditional multiprocessors• Bus traffic quickly saturates with a modest number of cores

– Crossbars: higher bandwidth• Poor area and power scaling

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)21

Page 22: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Multicore Examples (1)

• Niagara 2: 8x9 crossbar (area ~= core)• Rock: Hierarchical crossbar (5x5 crossbar

connecting clusters of 4 cores)Spring 2014

ECE 1749H: Interconnection Networks (Enright Jerger)

22

Sun Niagara

XBAR

0

1

2

3

0 1 2 3

4

5

4 5

Page 23: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Multicore Examples (2)

• IBM Cell

• Element Interconnect Bus

– 12 elements

– 4 unidirectional rings• 16 Bytes wide

• Operates at 1.6 GHz

Spring 2014ECE 1749H: Interconnection

Networks (Enright Jerger)23

IBM Cell

RING

Page 24: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Many Core Example

• Intel TeraFLOPS

– 80 core prototype

– 5 GHz

– Each tile:• Processing engine +

on-chip network router

Spring 2014 ECE 1749H: Interconnection Networks (Enright Jerger)

2D MESH

Page 25: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Many-Core Example (2): Intel SCC

Spring 2014ECE 1749H: Interconnection Networks (Enright

Jerger)

• Intel’s Single-chip Cloud Computer (SCC) uses a 2D mesh with state of the art routers

25

Page 26: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Performance and Cost

• Performance: latency and throughput• Cost: area and powerSpring 2014

ECE 1749H: Interconnection Networks (Enright Jerger)

26

Lat

ency

(se

c)

Offered Traffic (bits/sec)

Zero load latency

Sat

ura

tion

thro

ughp

ut

Page 27: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Lecture Outline

• Introduction to Networks

• Network Topologies

• Network Routing

• Network Flow Control

• Router Microarchitecture

27Mikko Lipasti-University of Wisconsin

Page 28: ECE/CS 757: Advanced Computer Architecture II InterconnectsECE/CS 757: Advanced Computer Architecture II Interconnects Instructor:Mikko H Lipasti Spring 2017 University of Wisconsin-Madison

Readings• Read: [18] N. Enright Jerger, L.-S. Pei, “On-Chip

Networks,” Synthesis Lectures on Computer Architecture, http://www.morganclaypool.com/doi/abs/10.2200/S00209ED1V01Y200907CAC008

• Review: [19] D. Wentzlaff, P. Griffin, H. Hoffmann, L. Bao, B. Edwards, C. Ramey, M. Mattina, C.-C. Miao, J. F. B. III, and A. Agarwal. On-Chip Interconnection Architecture of the Tile Processor. IEEE Micro, vol. 27, no. 5, pp. 15-31, 2007

Mikko Lipasti-University of Wisconsin 28