indirect (dynamic) networks
DESCRIPTION
Indirect (dynamic) Networks. Communication between any two nodes has to be carried through some switches . Classified into: Crossbar network Multistage interconnection networks (MIN) Nonblocking networks Rearrangeable nonblocking networks Blocking networks - PowerPoint PPT PresentationTRANSCRIPT
NC 論2 (No.2) 1
Indirect (dynamic) Networks
• Communication between any two nodes has to be carried through some switches.
• Classified into:– Crossbar network
– Multistage interconnection networks (MIN)• Nonblocking networks
• Rearrangeable nonblocking networks
• Blocking networks
– Single-stage interconnection networks
NC 論2 (No.2) 2
Crossbar (nonblocking, 1-hop)
An N x M crossbar
The cost increases as N x M.
Switch point
(crosspoint)
M outputs
N
inputs
N:1 multiplexer
NC 論2 (No.2) 3
A large crossbar0
1
2
3
NC 論2 (No.2) 4
MINs
Three-stage MIN
first stage
second stage
third stage
N
inputs
M outputs
NC 論2 (No.2) 5
Nonblocking MIN
• Any input port can be connected to any free output port without affecting the existing connections.
• Same functionality as a crossbar.
• The Clos network is proposed to reduce the cost of crossbar by decreasing the number of switches (1953).
NC 論2 (No.2) 6
History (telephone net.)
• Early telephone networks were built from elector-mechanical crossbars.
• Key developments in telephone switching include the Clos network in 1953 and the Benes network in 1962.
• Many large telephone switches today are still built from Clos or Clos-like networks.
NC 論2 (No.2) 7
3-stage Clos network (m,n,r)
Nonblocking: m >= 2n-1
Rearrangeable: 2n-1 > m >= n
N
inputsN
outputs
1
2
r
1
n
1
n
1
n
1
2
m
1
2
r
1
n
1
n
1
n
n×m r×r m×n
NC 論2 (No.2) 8
Rearrangeable nonblocking MIN
• Any input port can be connected to any free output port with rearrangement of paths for the existing connections.
• Less stages or smaller switches than a nonblocking networks.
• The best known example is the Benes network.
NC 論2 (No.2) 9
Benes network
An 8x8 Benes network (Eσ 3-1 E σ 2
- 1 Eσ 2 Eσ 3 E)
N(2log2N - 1) / 2 switches, 2log2N - 1 hops, E: stage
0
1
2
3
4
5
6
7
0
1
2
3
4
5
6
7
σ 3-1 σ 2
-1 σ 2 σ 3
010
001
(2,2,2) clos
(2,2,2) clos
(2,4,2) Clos
NC 論2 (No.2) 10
2x2 switch (radix: 2)
straight exchange lowerbroadcast
upperbroadcast
NC 論2 (No.2) 11
Permutation • Permutations are traffic patterns in which all traffic
from each source is directed to one destination.• It can be compactly represented by a permutation
function that maps source to destination.• Bit (digit) permutation are those in which each bit
(digit) of the destination address is a function of one bit (digit) of the source address.– di = sj or dx = f(sx)
NC 論2 (No.2) 12
Permutation functions (1/2)Communication from node address x(an,an-1,an-2,…, a1) to
• Perfect shuffle: σ(x)=(an-1,an-2,…, a1,an)
• k sub-shuffule: σk(x)=(an,an-1,…, ak+1,ak-1, … , a1,ak)
• k supershuffule: σk(x)=(an-1,an-2,…, an-k+1, an, an-k, … , a1)
• Inverse perfect shuffle: σ-1(x)=(a1 , an , an-1,an-2,…, a2)
• k sub-inverse shuffule: σk-1
(x)=(an , …, ak+1 , a1,ak,…, a2)
• Butterfly: β(x)=(a1 , an-1,an-2,…, a2 , an)
• k sub-butterfly: βk (x)=(an,…, ak+1 , a1,ak-1, … , a2,ak)
• k superbutterfly: βk (x)=(an-k+1, an-1,…, an-k+2 , an,an-k, … , a1)
NC 論2 (No.2) 13
Permutation functions (2/2)Communication from node address x(an,an-1,an-2,…, a1) to
• Bit reversal: ρ(x)=(a1,a2,…, an-1, an)
• k sub-bit reversal : ρk(x)=(an,an-1,…, ak+1,a1, a2,… , ak-1,ak)
• k super-bit reversal: ρk(x)=(an-k+1,an-k+2,…, an-1, an, an-k, … , a1)
• shift: α(x)= | x+1| (2n), where modulo by 2n
• k sub-shift: αk(x)= | x+1| (2n) +floor(x/2k)2k
• k super-shift: αk (x)= | x+ 2n-k | (2n)
• Exchange: Ei(x)=(an,an-1, …, ~ai,…, a1)
NC 論2 (No.2) 14
Perfect shuffle (1/2)000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
Perfect shuffle
σ(x)=(an-1,an-2,…, a1,an)Inverse perfect shuffle
σ-1(x)=(a1,an an-1,an-2,…, a2)
NC 論2 (No.2) 15
Perfect shuffle (2/2)000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
3 sub-inverse shuffule
σ3-1
(x)=(a1 , a3 , a2)2 sub-inverse shuffle
σ2-1
(x)=(a3, a1, a2)
NC 論2 (No.2) 16
Butterfly000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
butterfly (bit reversal)
β3(x)=(a1 , a2 , a3)
= ρ 3 (x)
2 sub-butterfly
β2(x)=(a3, a1, a2)
NC 論2 (No.2) 17
shift000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
shift
α(x)= | x+1 | (23)2 sub-shift
α 2(x)=| x+1 | (22)+floor(x/ 22) 22
NC 論2 (No.2) 18
exchange000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
E3(x) = (~a3, a2, a1) E2(x) = (a3, ~a2, a1)
NC 論2 (No.2) 19
Blocking MIN
• A free input/output pair is not always possible because of conflicts with the existing connections.
• Minimizing the number of switches and stages.
• Unidirectional MIN (Cenju-3, IBM PR3)
• Bidirectional MIN (IBM SP, TMC CM5)
NC 論2 (No.2) 20
Unidirectional MINs• Channels and switches are unidirectional.• At least ceiling(logkN) stages are required for a MIN
of N input and output ports with k×k switches.• Every path through the MIN crosses all the stages
(same length).• Examples: baseline, butterfly, indirect cube
logN stages for N inputs/outputs with 2x2 switches( binary logN-fly in the form of k-ary n-fly)the number of communication patterns are (√N)N
NC 論2 (No.2) 21
Baseline networks (1/2)(B = Eσn
-1 Eσn-1-1 E ・・・ Eσ 2
- 1 E)
000
001
010
011
100
101
110
111
000
001 (UUL)
010 (ULU)
011
100
101
110
111
abc cab cbaσ 2
- 1σ 3- 1
001
NC 論2 (No.2) 22
Baseline networks (2/2)
• To route (an,an-1,an-2,…, a1) to (bn,bn-1,bn-2,…, b1),
select upper link when bi=0, otherwise select lower link on (n-i+1) stage (MSB to LSB).
(path is determined only by the destination address)
switch
upper link
lower linkinputs
NC 論2 (No.2) 23
Omega networks (1/2) Ω= (σE)n
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
σσσabc bca abccab
NC 論2 (No.2) 24
Omega networks (2/2)
• To route (an,an-1,an-2,…, a1) to (bn,bn-1,bn-2,…, b1),
set the switch as straight when ai^bi=0, otherwise set it as exchange on (n-i+1) stage (MSB to LSB).
straight exchange
NC 論2 (No.2) 25
Indirect binary n-cube (1/2)(C = Eβ 2 Eβ 3 ・・・ E β n Eσ - 1)
000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
σ-1β 3β 2
000001
100101
010
110
011
111
abc acb abcbca
NC 論2 (No.2) 26
Indirect binary n-cube (2/2)
• To route (an,an-1,an-2,…, a1) to (bn,bn-1,bn-2,…, b1),
set the switch as straight when ai^bi=0, otherwise set it as exchange on i-th stage (LSB to MSB).
• A k-ary n-fly network consists of kn source/destination terminals, n stages of kn-1 k×k crossbar switches.– Degree of the switches: 2k
– Diameter: n+1 or logkN +1 where N = kn
NC 論2 (No.2) 27
Relation among blocking networks
• For omega, indirect n-cube and Banyan(Y):Ω-1=C=Yσ
So they are almost functionally equal (isomorphic).
• The communication patterns are different, but same number of combinations ((√N)N).
• Smaller number of communication patterns than nonblocking networks (N!).
NC 論2 (No.2) 28
Bitonic sort
8 inputs sorter4 inputs sorter2 inputs sorter
A L
B H
NC 論2 (No.2) 29
Batcher-banyan network
Bitonic
Sorter
(Batcher stage)
Ω
(banyan
stage)
Non-blocking
5
∞
3
1
∞
・・∞
1
3
5
∞
∞
・・∞
1
3
5
receiver Ordered addresses
NC 論2 (No.2) 30
Tree saturation (e.g. on Omega)000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
Combining: combine two messages
hotspot
NC 論2 (No.2) 31
Fault-tolerant MINs
• Blocking MINs provide a single path between any input/output pair.
• If a link becomes congested or fails, the unique path property can easily disrupt the communication (no path diversity).
• A solution is to provide multiple routing paths by adding extra stages or channels.
NC 論2 (No.2) 32
Fault-tolerant MINs (Omega’s case)000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
σσσAdditional
stage
NC 論2 (No.2) 33
Fault-tolerant MINs (Baseline’s case)000
001
010
011
100
101
110
111
000
001
010
011
100
101
110
111
σ 2- 1σ 3
- 1
NC 論2 (No.2) 34
Bidirectional MINs (BMINs)
• Channels and switches are bidirectional.• Information can be transmitted simultaneously in
opposite directions between neighboring switches.• Paths are established in BMINs by crossing stages
in forward, then turnaround, and finally backward.
• Several paths are possible in forward direction, but a single path is available in backward direction.
NC 論2 (No.2) 35
Bidirectional Banyan networks000
001
010
011
100
101
110
111
β 3β 2
000001
100101
010
110
011
111
000001
100101
010
110
011
111
forward
backward
turnaround
NC 論2 (No.2) 36
A fat tree BMIN
forwardbackward
NC 論2 (No.2) 37
Hybrid networks
• Combine mechanisms from shared-medium networks and direct or indirect networks.
• Increase bandwidth and reduce the distance between nodes.
• Examples:– Multiple backplane buses– Hierarchical buses– Cluster-based networks
NC 論2 (No.2) 38
Multiple backplane buses
device device device
bus
bus
bus
NC 論2 (No.2) 39
Hierarchical buses
Global bus
Cluster bus
bridge
NC 論2 (No.2) 40
Cluster-based networks
Cluster bus
ClusterCluster Cluster
ClusterCluster Cluster
router
NC 論2 (No.2) 41
History (Clos networks, etc.)
• Clos introduced non-blocking networks in1953.• Benes discovered rearrangeable non-blocking
networks in 1965.• Bachter introduced the bitonic sort and the
Batcher sorting network in 1968.• Knuth’s book in 1998 treats sorting networks in
detail.• Multicast in Clos networks was first considered by
Masson and Jordan(1972), and has since been studied by many researchers.
NC 論2 (No.2) 42
History (indirect networks)
• Processor-memory networks emerged in the late 1960s in the form of indirect netwoks.
• The smallest machine employed crossbar, whereas larger sytems used butterfly, etc..
• Since the early 1990s, a variant of the Clos and Benes networks of telephony has emerged in SMP (MPP) networks in the form of the fat tree.
• Today, the high pin bandwidth of router chips relative to message length motivates the use of networks of higher node degree, such as Clos and fat free.