Exploring Scalability in C++ Parallel STL Implementations

Laso Rodriguez, Ruben; Krupitza, Diego; Hunold, Sascha

doi:10.1145/3673038.3673065

DC Element

Wert

Sprache

dc.contributor.author

Laso Rodriguez, Ruben

dc.contributor.author

Krupitza, Diego

dc.contributor.author

Hunold, Sascha

dc.date.accessioned

2024-11-19T11:51:08Z

dc.date.available

2024-11-19T11:51:08Z

dc.date.issued

2024

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Laso Rodriguez, R., Krupitza, D., & Hunold, S. (2024). Exploring Scalability in C++ Parallel STL Implementations. In <i>ICPP ’24: Proceedings of the 53rd International Conference on Parallel Processing</i> (pp. 284–293). ACM. https://doi.org/10.1145/3673038.3673065</div> </div>

dc.identifier.uri

http://hdl.handle.net/20.500.12708/204481

dc.description.abstract

Since the advent of parallel algorithms in the C++17 Standard Template Library (STL), the STL has become a viable framework for creating performance-portable applications. Given multiple existing implementations of the parallel algorithms, a systematic, quantitative performance comparison is essential for choosing the appropriate implementation for a particular hardware configuration. In this work, we introduce a specialized set of micro-benchmarks to assess the scalability of the parallel algorithms in the STL. By selecting different backends, our micro-benchmarks can be used on multi-core systems and GPUs. Using the suite, in a case study on AMD and Intel CPUs and NVIDIA GPUs, we were able to identify substantial performance disparities among different implementations, including GCC+TBB, GCC+HPX, Intel's compiler with TBB, or NVIDIA's compiler with OpenMP and CUDA.

dc.description.sponsorship

FWF - Österr. Wissenschaftsfonds

dc.language.iso

dc.rights.uri

https://creativecommons.org/licenses/by-nc-sa/4.0/

dc.subject

C++

dc.subject

CUDA

dc.subject

OpenMP

dc.subject

Performance Portability

dc.subject

Standard Template Library

dc.subject

Threading Building Blocks

dc.title

Exploring Scalability in C++ Parallel STL Implementations

dc.type

Inproceedings

dc.type

Konferenzbeitrag

dc.rights.license

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International

dc.rights.license

Creative Commons Namensnennung - Nicht-kommerziell - Weitergabe unter gleichen Bedingungen 4.0 International

dc.relation.isbn

9798400717932

dc.description.startpage

284

dc.description.endpage

293

dc.relation.grantno

P 33884-N

dc.rights.holder

dc.type.category

Full-Paper Contribution

tuw.booktitle

ICPP '24: Proceedings of the 53rd International Conference on Parallel Processing

tuw.peerreviewed

true

tuw.relation.publisher

ACM

tuw.relation.publisherplace

New York, NY, United States

tuw.project.title

Offline- und Online-Autotuning von Parallelen Programmen

tuw.researchTopic.id

tuw.researchTopic.name

Computer Engineering and Software-Intensive Systems

tuw.researchTopic.name

Computer Science Foundations

tuw.researchTopic.value

tuw.publication.orgunit

E191-04 - Forschungsbereich Parallel Computing

tuw.publisher.doi

10.1145/3673038.3673065

dc.identifier.libraryid

AC17364268

dc.description.numberOfPages

tuw.author.orcid

0000-0003-2574-4025

tuw.author.orcid

0009-0007-2123-733X

tuw.author.orcid

0000-0002-5280-3855

dc.rights.identifier

CC BY-NC-SA 4.0

dc.rights.identifier

CC BY-NC-SA 4.0

tuw.event.name

53rd International Conference on Parallel Processing (ICPP 2024)

tuw.event.startdate

12-08-2024

tuw.event.enddate

15-08-2024

tuw.event.online

On Site

tuw.event.type

Event for scientific audience

tuw.event.place

Gotland

tuw.event.country

tuw.event.presenter

Laso Rodriguez, Ruben

wb.sciencebranch

Informatik

wb.sciencebranch.oefos

1020

wb.sciencebranch.value

100

item.openaccessfulltext

Open Access

item.openairecristype

http://purl.org/coar/resource_type/c_5794

item.mimetype

application/pdf

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.grantfulltext

open

item.openairetype

conference paper

item.languageiso639-1

crisitem.author.dept

E191-04 - Forschungsbereich Parallel Computing

crisitem.author.dept

E191-04 - Forschungsbereich Parallel Computing

crisitem.author.orcid

0000-0003-2574-4025

crisitem.author.orcid

0009-0007-2123-733X

crisitem.author.orcid

0000-0002-5280-3855

crisitem.author.parentorg

E191 - Institut für Computer Engineering

crisitem.author.parentorg

E191 - Institut für Computer Engineering

crisitem.project.funder

FWF - Österr. Wissenschaftsfonds

crisitem.project.grantno

P 33884-N

Enthalten in den Sammlungen:

Conference Paper

Volltext (Version of Record (published version))

Adobe PDF

(995.99 kB)

CC BY-NC-SA 4.0

Zur Kurzanzeige

Seiten Aufrufe

162

aufgerufen am 19.11.2024

Download(s)

120

aufgerufen am 19.11.2024

Google Scholar^TM

Check

Seiten Aufrufe

Download(s)

Google ScholarTM

Google Scholar^TM