robert bell, allen d. malony, sameer shende {bertie,malony,shende}@cs.uoregon.edu department of...

20
Robert Bell , Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science Institute / NeuroInformatics Center University of Oregon ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis

Post on 19-Dec-2015

224 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu

Department of Computer and Information Science

Computational Science Institute / NeuroInformatics Center

University of Oregon

ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance

Profile Analysis

Page 2: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Outline

Motivation ParaProf Objectives Related Work ParaProf Features and Functionality Examples

512-processor SAMRAI execution Interactive demonstration

Software engineering of ParaProf Recent advancements Future work Concluding remarks

Page 3: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Motivation

Profiling is well-known and broadly applied technique Profiling tools are not the same

Different profile instrumentation and measurement Sequential vs. parallel profiling System-specific, proprietary, and incompatible Complicates cross-platform performance studies Slows development of portable, robust profile analysis

Increased detail and complexity of profile data Hardware performance counters Integration of system and application performance data Parallel profile data / analysis and large-scale parallelism

Page 4: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

ParaProf Objectives

Portable, extensible, and scalable tool for profile analysis Offer “best of breed” capabilities to performance analysts Build as profile analysis framework for extensibility Work with different (most) types of profile data Support input of profile data from different sources Universal performance profile analysis capabilities Large-scale analysis and display support Multi-profile (multi-experiment) Programmable analysis Modular, object-oriented software engineering Broadly applied

Page 5: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Related Work Rich history of sequential and parallel profiling tools Sequential profilers

prof and gprof Unix profiling of execution time using sampling method gprof includes callgraph profiling (parent-child distribution)

cxperf and ssrun (SGI) Hardware performance counter profiling

vprof (Visual Profiler) DynaProf

PAPI-based profiling using dynamic instrumentation HPCView

Support for multiple profile analysis

Page 6: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Related Work (continued)

Parallel profilers GuideView and VGV

OpenMP applications (VGV also supports MPI profiling) Proprietary

Aksum Targeted to Linux systems with multiple experiment support

SvPablo Cross-platform with source-based views

Expert Trace-generated profile data Performance property/problem analysis and display

HPM Toolkit

Page 7: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

ParaProf Features

Parallel profile data “Experiment” gives profile for every thread of execution Multiple performance metrics (time, HPC, …) Based on TAU performance system Event-based profiles Support for callpath profiles

Profile data input Post-mortem from raw files Post-mortem from performance database Online from running program (in progress) Multiple experiment profiles active simultaneously

Page 8: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

ParaProf Features (continued)

Profile analysis Statistical analysis per thread and across threads Individual events and event groups Value-based and percent-based analysis Derived statistics and distribution statistics for scalability Experiment profile integration

Profile performance displays Bargraph displays Hyperlink navigation

Page 9: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

TAU Performance System Framework

Tuning and Analysis Utilities (aka Tools Are Us) Performance system framework for scalable parallel and

distributed high-performance computing Targets a general complex system computation model

nodes / contexts / threads Multi-level: system / software / parallelism Measurement and analysis abstraction

Integrated toolkit for performance instrumentation, measurement, analysis, and visualization Portable performance profiling/tracing facility Open software approach

Page 10: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

TAU Performance System Architecture

EPILOG

Paraver

Page 11: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

ParaProf Architecture

Page 12: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

ParaProfile Manager

Page 13: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

ParaProf Profile Display (VTF)

Page 14: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Full Profile Display (SAMRAI)

512

proc

esse

s

Page 15: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Profile Statistics Histogram (SAMRAI)

Need to address profile display scalability Statistical analysis to show performance distributions

Value histogramming showing # threads in value range Define # bins and value distribution function

Execution time (wallclock) Floating point operations

Page 16: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Recent ParaProf Enhancements

Integration of ParaProf with DynaProf Convert DynaProf profile data to TAU format

Page 17: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Future Work

Profile translators Sequential: prof/grof (vprof), cxperf/ssrun Parallel: SvPablo, Aksum, HPM Toolkit

Cross-experiment analysis Generalized programmable analysis engine Integration with online performance profiling in TAU

Online profile monitor in TAU currently available Analysis of profiles generated from trace phase analysis

Trace-based phase profile tool in development More sophisticated performance display graphics

Use 3D performance visualization library (in progress)

Page 18: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

Concluding Remarks

ParaProf is a portable parallel profile analysis tool ParaProf provides broad, integrated functionality Designed to analyze and display large-scale profile Designed for multi-experiment performance studies Intended to serve as a universal profile analysis system Robust design and software engineering Future work on extended analysis and visualization Future work on performance database integration

Page 19: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool

More Information

TAU performance systemwww.cs.uoregon.edu/research/paraducks/tau

Acknowledgements DOE project, “Performance Technology for Tera-Class

Parallel Computers: Evolution of the TAU Performance System,” 2001-2004.

Page 20: Robert Bell, Allen D. Malony, Sameer Shende {bertie,malony,shende}@cs.uoregon.edu Department of Computer and Information Science Computational Science

EuroPar, August 2003 00:53ParaProf: A Portable, Extensible Profile Analysis Tool