<div class="csl-bib-body">
<div class="csl-entry">Schuchart, J., Hunold, S., & Bosilca, G. (2023). Synchronizing MPI Processes in Space and Time. In <i>EuroMPI “23: Proceedings of the 30th European MPI Users” Group Meeting</i> (pp. 1–11). ACM. https://doi.org/10.1145/3615318.3615325</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/188980
-
dc.description.abstract
Performance benchmarks are an integral part of the development and evaluation of parallel algorithms, both in distributed applications as well as MPI implementations themselves. The initial step of the benchmark process is to obtain a common timestamp to mark the start of an operation across all involved processes, and the state-of-the-art in many applications and widely used MPI benchmark suites is the use of MPI barriers. In this paper, we show that the synchronization in space provided by an MPI_Barrier is insufficient for proper benchmark results of parallel distributed algorithms, using MPI collective operations as examples. The resulting lack of a global start timestamp for an operation leads to skewed results, with a significant impact of the used barrier algorithm. In order to mitigate these issues, we propose and discuss the implementation of MPIX_Harmonize, which extends the synchronization in space provided by MPI_Barrier with a time synchronization to guarantee a common starting timestamp across all involved processes. By replacing the use of MPI_Barrier with MPIX_Harmonize, benchmark implementors can eliminate skews resulting from barrier algorithms and achieve stable performance benchmark results. We will show that the proper time synchronization can have significant impact on the benchmark results for various implementations of MPI_Allreduce, MPI_Reduce, and MPI_Bcast.
en
dc.description.sponsorship
FWF Fonds zur Förderung der wissenschaftlichen Forschung (FWF)
-
dc.language.iso
en
-
dc.subject
MPI
en
dc.subject
collective communication
en
dc.subject
process synchronization
en
dc.subject
clock synchronization
en
dc.subject
OSU benchmarks
en
dc.subject
reduce
en
dc.subject
allreduce
en
dc.subject
broadcast
en
dc.subject
barrier
en
dc.title
Synchronizing MPI Processes in Space and Time
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.affiliation
University of Tennessee at Knoxville, United States of America (the)
-
dc.contributor.affiliation
University of Tennessee at Knoxville, United States of America (the)
-
dc.relation.isbn
9798400709135
-
dc.relation.doi
10.1145/3615318
-
dc.description.startpage
1
-
dc.description.endpage
11
-
dc.relation.grantno
P33884-N
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
EuroMPI '23: Proceedings of the 30th European MPI Users' Group Meeting
-
tuw.relation.publisher
ACM
-
tuw.relation.publisherplace
New York
-
tuw.book.chapter
7
-
tuw.project.title
Offline- und Online-Autotuning von Parallelen Programmen
-
tuw.researchTopic.id
I2
-
tuw.researchTopic.id
C5
-
tuw.researchTopic.name
Computer Engineering and Software-Intensive Systems
-
tuw.researchTopic.name
Computer Science Foundations
-
tuw.researchTopic.value
90
-
tuw.researchTopic.value
10
-
tuw.publication.orgunit
E191-04 - Forschungsbereich Parallel Computing
-
tuw.publisher.doi
10.1145/3615318.3615325
-
dc.description.numberOfPages
11
-
tuw.author.orcid
0000-0003-2041-7877
-
tuw.author.orcid
0000-0002-5280-3855
-
tuw.author.orcid
0000-0003-2411-8495
-
tuw.event.name
EUROMPI '23: 30th European MPI Users' Group Meeting