Cost-Aware Neural Network Splitting and Dynamic Rescheduling for Edge Intelligence

Luger, Daniel; Aral, Atakan; Brandic, Ivona

doi:10.1145/3578354.3592871

DC Field

Value

Language

dc.contributor.author

Luger, Daniel

dc.contributor.author

Aral, Atakan

dc.contributor.author

Brandic, Ivona

dc.date.accessioned

2023-12-01T11:36:24Z

dc.date.available

2023-12-01T11:36:24Z

dc.date.issued

2023-05

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Luger, D., Aral, A., & Brandic, I. (2023). Cost-Aware Neural Network Splitting and Dynamic Rescheduling for Edge Intelligence. <i>ACM Journal on Experimental Algorithmics</i>, 42–47. https://doi.org/10.1145/3578354.3592871</div> </div>

dc.identifier.isbn

9798400700828

dc.identifier.uri

http://hdl.handle.net/20.500.12708/190014

dc.description

EdgeSys '23: Proceedings of the 6th International Workshop on Edge Systems, Analytics and Networking

dc.description.abstract

With the rise of IoT devices and the necessity of intelligent applications, inference tasks are often offloaded to the cloud due to the computation limitation of the end devices. Yet, requests to the cloud are costly in terms of latency, and therefore a shift of the computation from the cloud to the network's edge is unavoidable. This shift is called edge intelligence and promises lower latency, among other advantages. However, some algorithms, like deep neural networks, are computationally intensive, even for local edge servers (ES). To keep latency low, such DNNs can be split into two parts and distributed between the ES and the cloud. We present a dynamic scheduling algorithm that takes real-Time parameters like the clock speed of the ES, bandwidth, and latency into account and predicts the optimal splitting point regarding latency. Furthermore, we estimate the overall costs for the ES and cloud during run-Time and integrate them into our prediction and decision models. We present a cost-Aware prediction of the splitting point, which can be tuned with a parameter toward faster response or lower costs.

dc.language.iso

dc.publisher

Association for Computing Machinery (ACM)

dc.relation.ispartof

ACM Journal on Experimental Algorithmics

dc.subject

cost-Awareness

dc.subject

DNN splitting

dc.subject

edge computing

dc.subject

edge intelligence

dc.title

Cost-Aware Neural Network Splitting and Dynamic Rescheduling for Edge Intelligence

dc.type

Article

dc.type

Artikel

dc.identifier.scopus

2-s2.0-85159359631

dc.identifier.url

https://api.elsevier.com/content/abstract/scopus_id/85159359631

dc.contributor.affiliation

University of Vienna, Austria

dc.contributor.affiliation

Umeå University, Sweden

dc.description.startpage

dc.description.endpage

dc.type.category

Original Research Article

tuw.journal.peerreviewed

true

tuw.peerreviewed

true

wb.publication.intCoWork

International Co-publication

tuw.researchinfrastructure

TRIGA Mark II-Nuklearreaktor

tuw.researchinfrastructure

Universitäre Service-Einrichtung für Transmissionselektronenmikroskopie

tuw.researchinfrastructure

Vienna Scientific Cluster

tuw.researchinfrastructure

Zentrum für Kernspinresonanzspektroskopie

tuw.researchinfrastructure

Zentrum für Mikro & Nanostrukturen

tuw.researchTopic.id

tuw.researchTopic.name

Logic and Computation

tuw.researchTopic.name

Computer Engineering and Software-Intensive Systems

tuw.researchTopic.name

Computer Science Foundations

tuw.researchTopic.value

dcterms.isPartOf.title

ACM Journal on Experimental Algorithmics

tuw.publication.orgunit

E194-04 - Forschungsbereich Data Science

tuw.publisher.doi

10.1145/3578354.3592871

dc.identifier.eissn

1084-6654

dc.description.numberOfPages

tuw.author.orcid

0009-0002-1926-5087

tuw.author.orcid

0000-0002-2281-8183

tuw.author.orcid

0000-0001-7424-0208

wb.sciencebranch

Informatik

wb.sciencebranch

Wirtschaftswissenschaften

wb.sciencebranch.oefos

1020

wb.sciencebranch.oefos

5020

wb.sciencebranch.value

item.openairetype

research article

item.cerifentitytype

Publications

item.grantfulltext

none

item.languageiso639-1

item.openairecristype

http://purl.org/coar/resource_type/c_2df8fbb1

item.fulltext

no Fulltext

crisitem.author.dept

E194-04 - Forschungsbereich Data Science

crisitem.author.dept

E191-05 - Forschungsbereich Computational Sustainability

crisitem.author.orcid

0000-0001-7424-0208

crisitem.author.parentorg

E194 - Institut für Information Systems Engineering

crisitem.author.parentorg

E191 - Institut für Computer Engineering

Appears in Collections:

Article

Show simple item record

Page view(s)

329

checked on Dec 1, 2023

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Google Scholar^TM