computer architecture: a quantitative approach - cap4 - section 3
TRANSCRIPT
![Page 1: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/1.jpg)
Multiprocessors and Thread-Level Parallelism
Performance of SSM Multiprocessors
“The overall cache performance is a combination of the behavior of uniprocessor
cache miss traffic and the traffic caused by communication, which results in invalidations
and subsequent cache misses.”
Hennessy and Patterson
![Page 2: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/2.jpg)
True sharing miss X False sharing miss
![Page 3: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/3.jpg)
A Commercial Workload
AlphaServer 4100
300MHz, 3 level caches
3 Benchmarks
![Page 4: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/4.jpg)
Performance of the Commercial Workload
![Page 5: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/5.jpg)
Performance of the Commercial Workload
![Page 6: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/6.jpg)
Performance of the Commercial Workload
![Page 7: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/7.jpg)
Performance of the Commercial Workload
![Page 8: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/8.jpg)
Performance of the Commercial Workload
![Page 9: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/9.jpg)
A Multiprogramming and OS Workload
2 level caches
2 copies of Andrew benchmark
Execution time is broken into 4 components: Idle, User, Synchronization and Kernel
![Page 10: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/10.jpg)
Performance of the Multiprog. And OS Workload
![Page 11: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/11.jpg)
Performance of the Multiprog. And OS Workload
![Page 12: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/12.jpg)
Performance of the Multiprog. And OS Workload
![Page 13: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/13.jpg)
Performance of the Multiprog. And OS Workload
![Page 14: Computer Architecture: A quantitative approach - Cap4 - Section 3](https://reader036.vdocument.in/reader036/viewer/2022082921/5562cd94d8b42a49398b48f7/html5/thumbnails/14.jpg)
Thank you!
Author: Prof. Sergio Takeo, Marcelo Arbore.
Bibliography: Patterson, D. A.; Hennessy, J. L. Computer Architecture: A quantitative Approach, 4th Ed. Morgan Kaufmann Publishers.
“The overall cache performance is a combination of the behavior of uniprocessor
cache miss traffic and the traffic caused by communication, which results in invalidations
and subsequent cache misses.”
Hennessy and Patterson