tau performance system tau-team@cs.uoregon.edu

Post on 15-Jan-2016

236 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

TAU Performance Systemhttp://tau.uoregon.edu

tau-team@cs.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 2

TAU Parallel Performance System

http://tau.uoregon.edu/ Multi-level performance instrumentation

Multi-language automatic source instrumentation Flexible and configurable performance measurement Widely-ported parallel performance profiling system

Computer system architectures and operating systems Different programming languages and compilers

Support for multiple parallel programming paradigms Multi-threading, message passing, mixed-mode, hybrid

TAU Performance SystemIBM Blue Gene Consortium 3

TAU Port to IBM BG/P Supports automatic instrumentation at:

Source level (PDT, tau_instrumentor; KOJAK, opari) MPI

Flexible and configurable performance measurement Support for profiling and tracing Support for PAPI counters on BG/P Uses bgxlC_r, bgxlc_r, bgxlf90_r as compilers

To configure TAU: ./installtau -arch=bgp -mpi -pdt=<dir> -pdt_c++=xlC -papi=<dir> ./tau_validate --html --build bgp >& results.html

Parallel Profile Analysis: Paraprof profile browser PerfDMF profile database Perfexplorer cross-experiment data analysis toolkit

TAU Performance SystemIBM Blue Gene Consortium 4

Using TAU on IBM BGP (surveyor.alcf.anl.gov) Choose measurement configuration

% ls /soft/apps/tau/tau_latest/bgp/lib/Makefile.*Makefile.tau-mpi-pdtMakefile.tau-mpi-pdt-traceMakefile.tau-callpath-mpi-pdtMakefile.tau-callpath-mpi-compensate-pdtMakefile.tau-depthlimit-mpi-pdtMakefile.tau-mpi-compensate-pdtMakefile.tau-multiplecounters-mpi-papi-pdtMakefile.tau-multiplecounters-mpi-papi-pdt-traceMakefile.tau-multiplecounters-papi-pdtMakefile.tau-multiplecounters-pthread-papi-pdtMakefile.tau-pdtMakefile.tau-phase-multiplecounters-mpi-compensate-papi-pdtMakefile.tau-phase-multiplecounters-mpi-papi-pdtMakefile.tau-pthread-pdt …

% setenv TAU_MAKEFILE /soft/apps/tau/tau-2.17/bgp/lib/Makefile.tau-mpi-pdt

% set path=(/soft/apps/tau/tau-2.17/ppc64/bin $path) # Front-end binaries

Replace mpixlf90_r with tau_f90.sh and compile your application Use tau_cxx.sh and tau_cc.sh for C++ and C compilers respectively

TAU Performance SystemIBM Blue Gene Consortium 5

Using TAU on IBM BGP (surveyor.alcf.anl.gov) Choose measurement configuration

% ls /soft/apps/tau/tau_latest/bgp/lib/Makefile.*Makefile.tau-mpi-pdtMakefile.tau-mpi-pdt-traceMakefile.tau-callpath-mpi-pdtMakefile.tau-callpath-mpi-compensate-pdtMakefile.tau-depthlimit-mpi-pdtMakefile.tau-mpi-compensate-pdtMakefile.tau-multiplecounters-mpi-papi-pdtMakefile.tau-multiplecounters-mpi-papi-pdt-traceMakefile.tau-multiplecounters-papi-pdtMakefile.tau-multiplecounters-pthread-papi-pdtMakefile.tau-pdtMakefile.tau-phase-multiplecounters-mpi-compensate-papi-pdtMakefile.tau-phase-multiplecounters-mpi-papi-pdtMakefile.tau-pthread-pdt …

% setenv TAU_MAKEFILE /soft/apps/tau/tau-2.17/bgp/lib/Makefile.tau-mpi-pdt% set path=(/soft/apps/tau/tau-2.17/ppc64/bin $path) # Front-end binaries

Replace mpixlf90_r with tau_f90.sh and compile your application Use tau_cxx.sh and tau_cc.sh for C++ and C compilers respectively

Visualize performance data with paraprof, pprof, vampir, jumpshot

TAU Performance SystemIBM Blue Gene Consortium 6

TAU’s ParaProf 3D Profile Browser: Matmult

TAU Performance SystemIBM Blue Gene Consortium 7

Profiling FLASH3 on IBM BG/P

TAU Performance SystemIBM Blue Gene Consortium 8

Sedov 2D Auto

• Initial test run did not include a load balanced problem• Small problem: too little work for 1024 processor• Proof of concept to validate porting of tools

TAU Performance SystemIBM Blue Gene Consortium 9

PerfExplorer: Cross Experiment Analysis

TAU Performance SystemIBM Blue Gene Consortium 10

TAU PerfExplorer: Runtime Breakdown

MPI_Barrier

IO_OUTPUT

TAU Performance SystemIBM Blue Gene Consortium 11

Relative Efficiency

TAU Performance SystemIBM Blue Gene Consortium 12

Relative Speedup for One Event

TAU Performance SystemIBM Blue Gene Consortium 13

TAU’s PerfExplorer: IBM BG/P

TAU Performance SystemIBM Blue Gene Consortium 14

TAU Portal

TAU portal supports the FLASH regression testing Allows groups to share profiling data in a secure way Allows users to launch TAU performance tools (paraprof,

perfexplorer) Nightly regression testcases uploaded to the database

automatically SVN checkout each night TAU: http://www.cs.uoregon.edu/research/tau TAU Portal: http://tau.nic.uoregon.edu

TAU Performance SystemIBM Blue Gene Consortium 15

Portal: Nightly Performance Regression Testing

TAU Performance SystemIBM Blue Gene Consortium 16

TAU Portal: Launch ParaProf/PerfExplorer

TAU Performance SystemIBM Blue Gene Consortium 17

PerfExplorer: Regression Testing

TAU Performance SystemIBM Blue Gene Consortium 18

PerfExplorer: Limiting Events (> 3% ), Oct 2007

TAU Performance SystemIBM Blue Gene Consortium 19

PerfExplorer: Exclusive Time for Events (2007)

TAU Performance SystemIBM Blue Gene Consortium 20

ParaProf: 3D Visualization

TAU Performance SystemIBM Blue Gene Consortium 21

Support Acknowledgements

Department of Energy (DOE)

Office of Science LLNL, LANL, ASC Argonne National Laboratory University of Chicago

Department of Defense NSF

top related