Download - HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea
![Page 1: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/1.jpg)
Get Performance on Intel® Xeon Phi™ with Allinea MAP and Allinea DDT
Discovering bottlenecks without pain
![Page 2: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/2.jpg)
… we develop new antibiotics faster than bacteria develop resistance
... every household can prototype and evolve their own 3D-printed designs
… accurate simulation of the natural world is taken for granted
In my Parallel Universe…
![Page 3: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/3.jpg)
… create parallel development tools for scientists:
So I decided to…
We’re accelerating the pace of scientific progress
![Page 4: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/4.jpg)
HPC on the critical path to progress
Single Core Era Multi-Core Era Many-Core Era
Constraints :- Power- Complexity of algorithms
Constraints :- Power- Parallel software
availability- Scalability
Constraints :- Programming models
Per
form
ance
Time(years)
![Page 5: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/5.jpg)
• Parallel profiler designed for:
‒ C/C++, Fortran
‒ MPI code
Interdependent or independent processes
‒ Multithreaded code
Monitor the main threads for each process
‒ Accelerated codes
GPUs, Intel® Xeon Phi™
• Improve productivity :‒ Helps you detect performance issues quickly and easily
‒ Tells you immediately where your time is spent in your source code
‒ Helps you to optimize your application efficiently
Allinea MAPIncrease application performance
![Page 6: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/6.jpg)
• Support for I/O metrics
‒ I/O can be a major bottleneck in HPC systems
‒ Find the optimal configuration for your file system.
Benefit : Broader profiling and analysis capabilities to solve
even more performance issues.
• Support for Intel® Xeon Phi™
‒ Already supported on Allinea DDT
‒ Officially extended to profiling
Benefit : Ensure you are getting the best performance from
new technology.
Allinea MAP 4.2New features in 2013
NEW
![Page 7: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/7.jpg)
Optimizing for Intel® Xeon Phi™Where do you start?
“Code that’s well-optimized for the host usually performs pretty well on the cards”
- Almost everybody
![Page 8: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/8.jpg)
Optimizing for Intel® Xeon Phi™But what matters?
Vectorization
Other stuff
Performance
![Page 9: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/9.jpg)
Optimizing for Intel® Xeon Phi™Is my code well-vectorized?
… maybe?
![Page 10: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/10.jpg)
Allinea Performance ReportsIs my code well-vectorized?
![Page 11: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/11.jpg)
Optimizing for Intel® Xeon Phi™Is my code well-vectorized?
… maybe?
![Page 12: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/12.jpg)
Optimizing for Intel® Xeon Phi™Is my code well-vectorized?
… maybe?
Not in this loop
(16.5% of total time)
![Page 13: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/13.jpg)
• Full, graphical debugger designed for :
‒ C/C++, Fortran, Intel® Xeon Phi™, UPC, …
‒ MPI, OpenMP and mixed-mode code
• Unified interface with Allinea MAP :
‒ Just what you need when you’ve added
OpenMP and now everything segfaults!
‒ One interface eliminates learning curve
‒ Spend more time on your results
• Slash your time to develop :
‒ Reproduces and triggers your bugs instantly
‒ Helps you easily understand where issues come from quickly
‒ Helps you to fix them as swiftly as possible
Allinea DDTUnified interface for debugging
![Page 14: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/14.jpg)
Allinea at the forefront of science with COSMOS and Intel® Xeon Phi™
“While I was porting CAMB to offload certain parts of it to Intel® Xeon
Phi™, I wasted weeks debugging it because the offloads were basically
opaque. I only had print statements to help me.”
![Page 15: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/15.jpg)
Allinea at the forefront of science with COSMOS and Intel® Xeon Phi™
“Using DDT's new offload debugging I can now look at the offload
code and look at the state of the array on the Intel® Xeon Phi™ side
before it is manipulated”
![Page 16: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/16.jpg)
Allinea at the forefront of science with COSMOS and Intel® Xeon Phi™
Fix is easy - either set NOCOPY->IN or just set the thing
to zero on the MIC side which is probably cheaper.”
![Page 17: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/17.jpg)
Allinea at the forefront of science with COSMOS and Intel® Xeon Phi™
“I’m now using MAP – it shows that the code is fairly well vectorised at 70%.
This will have to be improved a bit to get the most out of the coprocessors.”
![Page 18: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/18.jpg)
• Ten years of high-quality development tools
‒ Leading in HPC software tools market worldwide
‒ Global customer base
• Making parallel programming accessible to the widest range of
scientists and programmers
‒ Design an unrivaled productive and easy-to-use development
environment…
‒ … To help you reach the highest level of performance and scalability
‒ Define a new standard of customer support
Allinea Software
![Page 19: HPC Performance & Development Tuning tools for scientists to go parallel faster with allinea](https://reader037.vdocument.in/reader037/viewer/2022110115/54c691af4a7959c2788b45a2/html5/thumbnails/19.jpg)
Summary
The premier Intel® Xeon Phi™ development environment from Allinea
– Is your code ready for Intel® Xeon Phi™? Run a Performance Report!
– See which loops are important to vectorize with Allinea MAP
– Stay productive with full profiling and debugging on both host and
coprocessor
– Powerful unified interface with industry-leading technical support to help
you get the job finished faster
Visit us at our booth #1719 to see this in action!
Enter our Performance Reports competition to win a Kindle Fire every day!