Efficient process mapping for cartesian topologies

Lehr, Markus

doi:10.34726/hss.2019.65323

DC Field

Value

Language

dc.contributor.advisor

Träff, Jesper Larsson

dc.contributor.author

Lehr, Markus

dc.date.accessioned

2020-06-27T20:52:05Z

dc.date.issued

2019

dc.date.submitted

2020-01

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Lehr, M. (2019). <i>Efficient process mapping for cartesian topologies</i> [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2019.65323</div> </div>

dc.identifier.uri

https://doi.org/10.34726/hss.2019.65323

dc.identifier.uri

http://hdl.handle.net/20.500.12708/1480

dc.description.abstract

In this thesis we introduce different algorithms for mapping physical processes to Cartesian grids. We assume that processes within this grid communicate with certain neighboring processes as defined by a given stencil. We show that this mapping problem is already NP-complete for two dimensional grids and a very simple isomorphic neighborhood. We compare the current state of solutions in the field of High Performance Computing, specifically the Message Passing Interface (MPI). With his algorithm from 2018, W. D. Gropp showed promising performance, which is why we compare our approaches to his work, as well as MPI's standard behaviour. For qualitatively comparing concrete mappings, we define fitting optimality criteria, based on the core concept of minimizing inter-computation-node communication. We benchmarked using MPI_Irecv and MPI_Isend and show that our algorithms can find mappings, where Gropp cannot and that we could match or improve upon his resulting mappings in terms of quality and runtime. The first algorithm, which we present is similar to other graph partitioning approaches, since it utilizes recursive splitting in order to guarantee a logarithmic runtime wrt. the number of vertices in the Cartesian grid. We also guarantee a quality bound for this algorithm, which becomes better with increasing number of dimensions. In another implementation-variant, we adapted this algorithm for accommodating differently shaped stencils by weighting dimensions before the recursive splitting depending on how much communication happens across them. The third approach attempts to find hyperrectangular strips within the Cartesian grid, which are then filled similar to MPI's default row-major rank assignment. Although the theoretical bound of this approach is not as good as the first, this assignment strategy yielded the most compact mappings most of the time. Its main shortcoming, however, is its inability to adapt these strips, depending on different stencils.

dc.language

English

dc.language.iso

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

Parallel, distributed memory computing

dc.subject

High-Performance computing

dc.subject

Process mapping

dc.subject

Communication patterns

dc.subject

Grids

dc.subject

Stencils

dc.subject

MPI

dc.subject

Parallel, distributed memory computing

dc.subject

High-Performance computing

dc.subject

Process mapping

dc.subject

Communication patterns

dc.subject

Grids

dc.subject

Stencils

dc.subject

MPI

dc.title

Efficient process mapping for cartesian topologies

dc.type

Thesis

dc.type

Hochschulschrift

dc.rights.license

In Copyright

dc.rights.license

Urheberrechtsschutz

dc.identifier.doi

10.34726/hss.2019.65323

dc.contributor.affiliation

TU Wien, Österreich

dc.rights.holder

Markus Lehr

dc.publisher.place

Wien

tuw.version

vor

tuw.thesisinformation

Technische Universität Wien

tuw.publication.orgunit

E191 - Institut für Computer Engineering

dc.type.qualificationlevel

Diploma

dc.identifier.libraryid

AC15563558

dc.description.numberOfPages

dc.identifier.urn

urn:nbn:at:at-ubtuw:1-134088

dc.thesistype

Diplomarbeit

dc.thesistype

Diploma Thesis

dc.rights.identifier

In Copyright

dc.rights.identifier

Urheberrechtsschutz

tuw.advisor.staffStatus

staff

tuw.advisor.orcid

0000-0002-4864-9226

item.languageiso639-1

item.openairetype

master thesis

item.grantfulltext

open

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.mimetype

application/pdf

item.openairecristype

http://purl.org/coar/resource_type/c_bdcc

item.openaccessfulltext

Open Access

crisitem.author.dept

E191-04 - Forschungsbereich Parallel Computing

crisitem.author.parentorg

E191 - Institut für Computer Engineering

Appears in Collections:

Thesis

Fulltext (Version of Record (published version))

Adobe PDF

(7.01 MB)

In Copyright

Show simple item record

Page view(s)

438

checked on Nov 19, 2023

Download(s)

191

checked on Nov 19, 2023

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM