parallel computer architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf ·...

53
Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0 Parallel Computer Architectures Chapter 8

Upload: trinhnhi

Post on 26-May-2018

227 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Parallel Computer Architectures

Chapter 8

Page 2: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Parallel Computer Architectures

(a) On-chip parallelism. (b) A coprocessor. (c) A multiprocessor.

(d) A multicomputer. (e) A grid.

Page 3: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Instruction-Level Parallelism

(a) A CPU pipeline. (b) A sequence of VLIW instructions. (c) An instruction stream with bundles marked.

Page 4: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

The TriMedia VLIW CPU (1)

A typical TriMedia instruction, showing five possible operations.

Page 5: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

The TriMedia VLIW CPU (2)

The TM3260 functional units, their quantity, latency, and which instruction slots they can use.

Page 6: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

The TriMedia VLIW CPU (3)

The major groups of TriMedia custom operations.

Page 7: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

The TriMedia VLIW CPU (4)

(a) An array of 8-bit elements. (b) The transposed array.

(c) The original array fetched into four registers.

(d) The transposed array in four registers.

Page 8: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

On-Chip Multithreading (1)

(a) – (c) Three threads. The empty boxes indicated that the thread

has stalled waiting for memory. (d) Fine-grained multithreading.

(e) Coarse-grained multithreading.

Page 9: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

On-Chip Multithreading (2)

Multithreading with a dual-issue superscalar CPU. (a) Fine-grained multithreading.

(b) Coarse-grained multithreading. (c) Simultaneous multithreading.

Page 10: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Hyperthreading on the Pentium 4

Resource sharing between threads in the

Pentium 4 NetBurst microarchitecture.

Page 11: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Homogeneous Multiprocessors on a Chip

Single-chip multiprocessors.

(a) A dual-pipeline chip. (b) A chip with two cores.

Page 12: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Heterogeneous Multiprocessors on a Chip (1)

The logical structure of a simple DVD player contains a heterogeneous

multiprocessor containing multiple cores for different functions.

Page 13: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Heterogeneous Multiprocessors on a Chip (2)

An example of the IBM CoreConnect architecture.

Page 14: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Introduction to Networking (1)

How users are connected to servers on the Internet.

Page 15: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Introduction to Networking (2)

A packet as it appears on the Ethernet.

Page 16: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Introduction to Network Processors

A typical network processor board and chip.

Page 17: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

The Nexperia Media Processor

The Nexperia heterogeneous multiprocessor on a chip.

Page 18: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Multiprocessors

(a) A multiprocessor with 16 CPUs sharing a common memory.

(b) An image partitioned into 16 sections, each being analyzed by a different CPU.

Page 19: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Multicomputers (1)

(a) A multicomputer with 16 CPUs, each with its own private memory.

(b) The bit-map image of Fig. 8-17 split up among the 16 memories.

Page 20: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Multicomputers (2)

Various layers where shared memory can be implemented. (a) Thehardware. (b) The operating system. (c) The language runtime system.

Page 21: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Taxonomy of Parallel Computers (1)

Flynn’s taxonomy of parallel computers.

Page 22: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Taxonomy of Parallel Computers (2)

A taxonomy of parallel computers.

Page 23: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Sequential Consistency

(a) Two CPUs writing and two CPUs reading a common memoryword. (b) - (d) Three possible ways the two writes and four

reads might be interleaved in time.

Page 24: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Weak Consistency

Weakly consistent memory uses synchronization operations todivide time into sequential epochs.

Page 25: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

UMA Symmetric Multiprocessor Architectures

Three bus-based multiprocessors. (a) Without caching. (b) With

caching. (c) With caching and private memories.

Page 26: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Snooping Caches

The write through cache coherence protocol. The empty boxes indicate that no action is taken.

Page 27: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

The MESI Cache Coherence Protocol

The MESI cache coherence protocol.

Page 28: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

UMA Multiprocessors Using Crossbar Switches

(a) An 8 × 8 crossbar switch. (b) An open crosspoint. (c) A closed crosspoint.

Page 29: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

UMA Multiprocessors Using Multistage Switching Networks (1)

(a) A 2 × 2 switch.

(b) A message format.

Page 30: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

UMA Multiprocessors Using Multistage Switching Networks (2)

An omega switching network.

Page 31: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

NUMA Multiprocessors

A NUMA machine based on two levels of buses. The Cm* was

the first multiprocessor to use this design.

Page 32: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Cache Coherent NUMA Multiprocessors

(a) A 256-node directory-based multiprocessor. (b) Division of a 32-bit

memory address into fields. (c) The directory at node 36.

Page 33: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

The Sun Fire E25K NUMA Multiprocessor (1)

The Sun Microsystems E25K multiprocessor.

Page 34: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

The Sun Fire E25K NUMA Multiprocessor (2)

The SunFire E25K uses a four-level interconnect. Dashed linesare address paths. Solid lines are data paths.

Page 35: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Message-Passing Multicomputers

A generic multicomputer.

Page 36: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Topology

Various topologies. The heavy dots represent switches. The CPUsand memories are not shown. (a) A star. (b) A complete interconnect.

(c) A tree. (d) A ring. (e) A grid. (f) A double torus. (g) A cube. (h) A 4D hypercube.

Page 37: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

BlueGene (1)

The BlueGene/L custom processor chip.

Page 38: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

BlueGene (2)

The BlueGene/L. (a) Chip. (b) Card. (c) Board. (d) Cabinet. (e) System.

Page 39: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Red Storm (1)

Packaging of the Red Storm components.

Page 40: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Red Storm (2)

The Red Storm system as viewed from above.

Page 41: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

A Comparison of BlueGene/L and Red Storm

A comparison of

BlueGene/L and Red Storm.

Page 42: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Google (1)

Processing of a Google query.

Page 43: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Google (2)

A typical Google

cluster.

Page 44: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Scheduling

Scheduling a cluster. (a) FIFO. (b) Without head-of-line blocking. (c) Tiling. The shaded areas indicate idle CPUs.

Page 45: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Distributed Shared Memory (1)

A virtual address space consisting of 16 pages

spread over four nodes of a multicomputer.

(a) The initial situation. ….

Page 46: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Distributed Shared Memory (2)

A virtual address space consisting of 16 pages

spread over four nodes of a multicomputer. …

(b) After CPU 0 references page 10. …

Page 47: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Distributed Shared Memory (3)

A virtual address space consisting of 16 pages

spread over four nodes of a multicomputer. …

(c) After CPU 1 references page 10, here assumed to be a read-only page.

Page 48: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Linda

Three Linda tuples.

Page 49: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Orca

A simplified ORCA stack object, with internal data and two operations.

Page 50: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Software Metrics (1)

Real programs achieve less than the perfect speedup indicated by the dotted line.

Page 51: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Software Metrics (2)

(a) A program has a sequential part and a parallelizable part. (b) Effect of running part of the program in parallel.

Page 52: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Achieving High Performance

(a) A 4-CPU bus-based system. (b) A 16-CPU bus-based system.

(c) A 4-CPU grid-based system. (d) A 16-CPU grid-based system.

Page 53: Parallel Computer Architectures - pdplab.it.uom.grpdplab.it.uom.gr/teaching/tanenbaum/8.pdf · Parallel Computer Architectures ... Taxonomy of Parallel Computers (1) Flynn’s taxonomy

Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved. 0-13-148521-0

Grid Computing

The grid layers.