pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations

Laso Rodriguez, Ruben; Krupitza, Diego; Hunold, Sascha

doi:10.48550/arXiv.2402.06384

Datensatz Zitierlink:

http://hdl.handle.net/20.500.12708/198793

Titel:

pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations

Zitat:

Laso Rodriguez, R., Krupitza, D., & Hunold, S. (2024). pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations. arXiv. https://doi.org/10.48550/arXiv.2402.06384

CatalogPlus:

AC17228368

Verlags-DOI:

10.48550/arXiv.2402.06384

Publikationstyp:

Preprint

Sprache:

Englisch

Autor_innen:

Laso Rodriguez, Ruben
Krupitza, Diego
Hunold, Sascha

Organisationseinheit:

E191-04 - Forschungsbereich Parallel Computing

ArXiv-ID:

2402.06384

Datum (veröffentlicht):

9-Feb-2024

Umfang:

Preprint-Server:

arXiv

Keywords:

Performance Portability; C++; Standard Template Library; Threading Building Blocks; OpenMP; CUDA

Abstract:

Since the advent of parallel algorithms in the C++17 Standard Template Library (STL), the STL has become a viable framework for creating performance-portable applications. Given multiple existing implementations of the parallel algorithms, a systematic, quantitative performance comparison is essential for choosing the appropriate implementation for a particular hardware configuration. In this work, we introduce a specialized set of micro-benchmarks to assess the scalability of the parallel algorithms in the STL. By selecting different backends, our micro-benchmarks can be used on multi-core systems and GPUs. Using the suite, in a case study on AMD and Intel CPUs and NVIDIA GPUs, we were able to identify substantial performance disparities among different implementations, including GCC+TBB, GCC+HPX, Intel's compiler with TBB, or NVIDIA's compiler with OpenMP and CUDA.

Forschungsschwerpunkte:

Computer Engineering and Software-Intensive Systems: 90%
Computer Science Foundations: 10%

Wissenschaftszweig:

1020 - Informatik: 100%

Lizenz:

CC BY 4.0