<div class="csl-bib-body">
<div class="csl-entry">Hunold, S. (2022). <i>Performance Tuning of MPI Collectives - Status Quo and Open Problems</i> [Presentation]. CaSToRC HPC National Competence Center Fall Seminar Series 2022, Unknown. http://hdl.handle.net/20.500.12708/153709</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/153709
-
dc.description.abstract
MPI collective operations such as MPI_Allreduce are fundamental basic blocks of large-scale applications in High Performance Computing. Since the MPI standard only defines the semantics of MPI communication operations, MPI implementations (e.g., Open MPI or MPICH) are free to implement the collective operations the best way possible. For important collective operations, e.g. MPI_Allreduce and MPI_Bcast, MPI libraries provide several algorithms for each operation.
In this talk, we investigate the problem of tuning MPI collective operations on a given supercomputer, i.e., selecting the best algorithm for a communication problem. For example, we would like to answer the following question: what is the fastest algorithm to execute an MPI_Bcast with 100 Bytes of data using 16 compute nodes and 32 processes per compute node. We also need to discuss the accuracy of methods that support the analysis of MPI applications, such as profiling or tracing. In addition, we show that basic methods from statistics and machine learning can help us to find efficient algorithms for collective operations in a practical setting.
en
dc.description.sponsorship
Fonds zur Förderung der wissenschaftlichen Forschung (FWF)
-
dc.language.iso
en
-
dc.subject
MPI Collectives
en
dc.title
Performance Tuning of MPI Collectives - Status Quo and Open Problems
en
dc.type
Presentation
en
dc.type
Vortrag
de
dc.relation.grantno
P33884-N
-
dc.type.category
Presentation
-
tuw.publication.invited
invited
-
tuw.project.title
Offline- und Online-Autotuning von Parallelen Programmen
-
tuw.researchTopic.id
I2
-
tuw.researchTopic.id
C5
-
tuw.researchTopic.name
Computer Engineering and Software-Intensive Systems
-
tuw.researchTopic.name
Computer Science Foundations
-
tuw.researchTopic.value
90
-
tuw.researchTopic.value
10
-
tuw.publication.orgunit
E191-04 - Forschungsbereich Parallel Computing
-
tuw.event.name
CaSToRC HPC National Competence Center Fall Seminar Series 2022