mpisee: MPI Profiling for Communication and Communicator Structure

Vardas, Ioannis; Hunold, Sascha; Ajanohoun, Jordy I.; Traff, Jesper Larsson

doi:10.1109/IPDPSW55747.2022.00092

Record link:

http://hdl.handle.net/20.500.12708/136174

Title:

mpisee: MPI Profiling for Communication and Communicator Structure

Citation:

Vardas, I., Hunold, S., Ajanohoun, J. I., & Traff, J. L. (2022). mpisee: MPI Profiling for Communication and Communicator Structure. In 2022 IEEE 36th International Parallel and Distributed Processing Symposium Workshops (IPDPSW 2022) (pp. 520–529). IEEE. https://doi.org/10.1109/IPDPSW55747.2022.00092

Publisher DOI:

10.1109/IPDPSW55747.2022.00092

Publication Type:

Inproceedings - Full-Paper Contribution

Language:

English

Authors:

Vardas, Ioannis
Hunold, Sascha
Ajanohoun, Jordy I.
Traff, Jesper Larsson

Organisational Unit:

E191-04 - Forschungsbereich Parallel Computing

Published in:

2022 IEEE 36th International Parallel and Distributed Processing Symposium Workshops (IPDPSW 2022)

ISBN:

978-1-6654-9747-3

DOI of the book:

10.1109/IPDPSW55747.2022

Date (published):

2022

Event name:

27th Workshop on High-level Parallel Programming Models and Supportive Environments (HIPS 2022) in conjunction with IEEE IPDPS 2022

Event date:

30-May-2022 - 3-Jun-2022

Event place:

Lyon, France

Number of Pages:

Publisher:

IEEE

Peer reviewed:

Yes

Keywords:

MPI Profiling

Abstract:

Cumulative performance profiling is a fast and lightweight method for gaining summary information about where and how communication time in parallel MPI applications is spent. MPI provides mechanisms for implementing such profilers that can be transparently used with applications. Existing profilers typically profile on a process basis and record the frequency, total time, and volume of MPI operations per process. This can lead to grossly misleading cumulative information for applications that make use of MPI features for partitioning the processes into different communicators. We present a novel MPI profiler, mpisee, for communicator-centric profiling that separates and records collective and point-to-point communication information per communicator in the application. We discuss the implementation of mpisee which makes significant use of the MPI attribute mechanism. We evaluate our tool by measuring its overhead and profiling a number of standard applications. Our measurements with thirteen MPI applications show that the overhead of mpisee is less than 3%. Moreover, using mpisee, we investigate in detail two particular MPI applications, SPLATT and GROMACS, to obtain information on the various MPI operations for the different communicators of these applications. Such information is not available by other, state-of-the-art profilers. We use the communicator-centric information to improve the performance of SPLATT resulting in a significant runtime decrease when run with 1024 processes.

Project title:

Algorithm Engineering für Prozess Mapping: P31763-N31 (Fonds zur Förderung der wissenschaftlichen Forschung (FWF))
Offline- und Online-Autotuning von Parallelen Programmen: P33884-N (Fonds zur Förderung der wissenschaftlichen Forschung (FWF))

Research Areas:

Computer Engineering and Software-Intensive Systems: 90%
Computer Science Foundations: 10%

Science Branch:

1020 - Informatik: 100%

Appears in Collections:

Conference Paper

Show full item record

Page view(s)

360

checked on Nov 23, 2023

Download(s)

checked on Nov 23, 2023

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM