computer architecture: a quantitative approach - cap4 - section 3

14
Multiprocessors and Thread-Level Parallelism Performance of SSM Multiprocessors “The overall cache performance is a combination of the behavior of uniprocessor cache miss traffic and the traffic caused by communication, which results in invalidations and subsequent cache misses.” Hennessy and Patterson

Upload: marcelo-arbore

Post on 25-May-2015

1.048 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: Computer Architecture: A quantitative approach - Cap4 - Section 3

Multiprocessors and Thread-Level Parallelism

Performance of SSM Multiprocessors

“The overall cache performance is a combination of the behavior of uniprocessor

cache miss traffic and the traffic caused by communication, which results in invalidations

and subsequent cache misses.”

Hennessy and Patterson

Page 2: Computer Architecture: A quantitative approach - Cap4 - Section 3

True sharing miss X False sharing miss

Page 3: Computer Architecture: A quantitative approach - Cap4 - Section 3

A Commercial Workload

AlphaServer 4100

300MHz, 3 level caches

3 Benchmarks

Page 4: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Commercial Workload

Page 5: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Commercial Workload

Page 6: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Commercial Workload

Page 7: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Commercial Workload

Page 8: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Commercial Workload

Page 9: Computer Architecture: A quantitative approach - Cap4 - Section 3

A Multiprogramming and OS Workload

2 level caches

2 copies of Andrew benchmark

Execution time is broken into 4 components: Idle, User, Synchronization and Kernel

Page 10: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Multiprog. And OS Workload

Page 11: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Multiprog. And OS Workload

Page 12: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Multiprog. And OS Workload

Page 13: Computer Architecture: A quantitative approach - Cap4 - Section 3

Performance of the Multiprog. And OS Workload

Page 14: Computer Architecture: A quantitative approach - Cap4 - Section 3

Thank you!

Author: Prof. Sergio Takeo, Marcelo Arbore.

Bibliography: Patterson, D. A.; Hennessy, J. L. Computer Architecture: A quantitative Approach, 4th Ed. Morgan Kaufmann Publishers.

“The overall cache performance is a combination of the behavior of uniprocessor

cache miss traffic and the traffic caused by communication, which results in invalidations

and subsequent cache misses.”

Hennessy and Patterson