<div class="csl-bib-body">
<div class="csl-entry">Lackinger, A., Frangoudis, P., Cilic, I., Furutanpey, A., Murturi, I., Podnar Zarko, I., & Dustdar, S. (2024). <i>Inference Load-Aware Orchestration for Hierarchical Federated Learning</i>. arXiv. https://doi.org/10.34726/8212</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/208559
-
dc.identifier.uri
https://doi.org/10.34726/8212
-
dc.description.abstract
Hierarchical federated learning (HFL) designs introduce intermediate aggregator nodes between clients and the global federated learning server in order to reduce communication costs and distribute server load. One side effect is that machine learning model replication at scale comes "for free" as part of the HFL process: model replicas are hosted at the client end, intermediate nodes, and the global server level and are readily available for serving inference requests. This creates opportunities for efficient model serving but simultaneously couples the training and serving processes and calls for their joint orchestration. This is particularly important for continual learning, where serving a model while (re)training it periodically, upon specific triggers, or continuously, takes place over shared infrastructure spanning the computing continuum. Consequently, training and inference workloads can interfere with detrimental effects on performance. To address this issue, we propose an inference load-aware HFL orchestration scheme, which makes informed decisions on HFL configuration, considering knowledge about inference workloads and the respective processing capacity. Applying our scheme to a continual learning use case in the transportation domain, we demonstrate that by optimizing aggregator node placement and device-aggregator association, significant inference latency savings can be achieved while communication costs are drastically reduced compared to flat centralized federated learning.
en
dc.description.sponsorship
European Commission
-
dc.description.sponsorship
European Commission
-
dc.language.iso
en
-
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
-
dc.subject
Federated learning
en
dc.subject
service orchestration
en
dc.subject
continual learning
en
dc.subject
edge computing
en
dc.title
Inference Load-Aware Orchestration for Hierarchical Federated Learning
en
dc.type
Preprint
en
dc.type
Preprint
de
dc.rights.license
Creative Commons Namensnennung 4.0 International
de
dc.rights.license
Creative Commons Attribution 4.0 International
en
dc.identifier.doi
10.34726/8212
-
dc.identifier.arxiv
2407.16836
-
dc.contributor.affiliation
University of Zagreb, Croatia
-
dc.contributor.affiliation
Faculty of Electrical Engineering and Computing in Zagreb, Croatia
-
dc.relation.grantno
101079214
-
dc.relation.grantno
101135576
-
tuw.project.title
Twinning action for spreading excellence in Artificial Intelligence of Things
-
tuw.project.title
Intent-based data operation in the computing continuum