DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge

May, Daniel; Tundo, Alessandro; Ilager, Shashikant; Brandic, Ivona

doi:10.48550/arXiv.2410.23881

DC Field

Value

Language

dc.contributor.author

May, Daniel

dc.contributor.author

Tundo, Alessandro

dc.contributor.author

Ilager, Shashikant

dc.contributor.author

Brandic, Ivona

dc.date.accessioned

2026-02-09T12:42:30Z

dc.date.available

2026-02-09T12:42:30Z

dc.date.issued

2024-10-31

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">May, D., Tundo, A., Ilager, S., & Brandic, I. (2024). <i>DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge</i>. arXiv. https://doi.org/10.48550/arXiv.2410.23881</div> </div>

dc.identifier.uri

http://hdl.handle.net/20.500.12708/226151

dc.description.abstract

The deployment of ML models on edge devices is challenged by limited computational resources and energy availability. While split computing enables the decomposition of large neural networks (NNs) and allows partial computation on both edge and cloud devices, identifying the most suitable split layer and hardware configurations is a non-trivial task. This process is in fact hindered by the large configuration space, the non-linear dependencies between software and hardware parameters, the heterogeneous hardware and energy characteristics, and the dynamic workload conditions. To overcome this challenge, we propose DynaSplit, a two-phase framework that dynamically configures parameters across both software (i.e., split layer) and hardware (e.g., accelerator usage, CPU frequency). During the Offline Phase, we solve a multi-objective optimization problem with a meta-heuristic approach to discover optimal settings. During the Online Phase, a scheduling algorithm identifies the most suitable settings for an incoming inference request and configures the system accordingly. We evaluate DynaSplit using popular pre-trained NNs on a real-world testbed. Experimental results show a reduction in energy consumption up to 72% compared to cloud-only computation, while meeting ~90% of user request's latency threshold compared to baselines.

dc.language.iso

dc.subject

Edge AI

dc.subject

Split Computing

dc.subject

Hardware–Software Co-Design

dc.subject

Multi-Objective Optimization

dc.title

DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge

dc.type

Preprint

dc.type

Preprint

dc.identifier.arxiv

2410.23881

dc.contributor.affiliation

University of Amsterdam, Netherlands (the)

tuw.researchTopic.id

tuw.researchTopic.name

Information Systems Engineering

tuw.researchTopic.value

100

tuw.publication.orgunit

E191-05 - Forschungsbereich Computational Sustainability

tuw.publication.orgunit

E056-23 - Fachbereich Innovative Combinations and Applications of AI and ML (iCAIML)

tuw.publication.orgunit

E056-28 - Fachbereich Computational Sustainability

tuw.publisher.doi

10.48550/arXiv.2410.23881

dc.description.numberOfPages

tuw.author.orcid

0009-0009-8434-4639

tuw.author.orcid

0000-0001-8840-8948

tuw.author.orcid

0000-0001-7424-0208

tuw.publisher.server

arXiv

wb.sciencebranch

Informatik

wb.sciencebranch

Elektrotechnik, Elektronik, Informationstechnik

wb.sciencebranch

Mathematik

wb.sciencebranch.oefos

1020

wb.sciencebranch.oefos

2020

wb.sciencebranch.oefos

1010

wb.sciencebranch.value

item.fulltext

no Fulltext

item.grantfulltext

none

item.openairecristype

http://purl.org/coar/resource_type/c_816b

item.cerifentitytype

Publications

item.languageiso639-1

item.openairetype

preprint

crisitem.author.dept

E191-05 - Forschungsbereich Computational Sustainability

crisitem.author.dept

E191-05 - Forschungsbereich Computational Sustainability

crisitem.author.dept

E194-04 - Forschungsbereich Data Science

crisitem.author.dept

E191-05 - Forschungsbereich Computational Sustainability

crisitem.author.orcid

0009-0009-8434-4639

crisitem.author.orcid

0000-0001-8840-8948

crisitem.author.orcid

0000-0003-1178-6582

crisitem.author.orcid

0000-0001-7424-0208

crisitem.author.parentorg

E191 - Institut für Computer Engineering

crisitem.author.parentorg

E191 - Institut für Computer Engineering

crisitem.author.parentorg

E194 - Institut für Information Systems Engineering

crisitem.author.parentorg

E191 - Institut für Computer Engineering

Appears in Collections:

Preprint

Show simple item record

Page view(s)

checked on Feb 9, 2026

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Google Scholar^TM