<div class="csl-bib-body">
<div class="csl-entry">Huang, Y., Qiao, X., Lai, W., Dustdar, S., Zhang, J., & Li, J. (2022). Enabling DNN Acceleration With Data and Model Parallelization Over Ubiquitous End Devices. <i>IEEE Internet of Things Journal</i>, <i>9</i>(16), 15053–15065. https://doi.org/10.1109/JIOT.2021.3112715</div>
</div>
-
dc.identifier.issn
2327-4662
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/81445
-
dc.description.abstract
Deep neural network (DNN) shows great promise in providing more intelligence to ubiquitous end devices. However, the existing partition-offloading schemes adopt data-parallel or model-parallel collaboration between devices and the cloud, which does not make full use of the resources of end devices for deep-level parallel execution. This article proposes eDDNN (i.e., enabling Distributed DNN), a collaborative inference scheme over heterogeneous end devices using cross-platform Web technology, moving the computation close to ubiquitous end devices, improving resource utilization, and reducing the computing pressure of data centers. eDDNN implements D2D communication and collaborative inference among heterogeneous end devices with WebRTC protocol, divides the data and corresponding DNN model into pieces simultaneously, and then executes inference almost independently by establishing a layer dependency table. Besides, eDDNN provides a dynamic allocation algorithm based on deep reinforcement learning to minimize latency. We conduct experiments on various data sets and DNNs and further employ eDDNN into a mobile Web AR application to illustrate the effectiveness. The results show that eDDNN can achieve the latency decrease by 2.98 × , reduce mobile energy by 1.8 × , and relieve the computing pressure of the edge server by 2.57 × , against a typical partition-offloading approach.
en
dc.language.iso
en
-
dc.publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
-
dc.relation.ispartof
IEEE Internet of Things Journal
-
dc.subject
Collaborative inference
en
dc.subject
cross-platform
en
dc.subject
deep learning (DL)
en
dc.subject
distributed deep neural network (DNN)
en
dc.subject
ubiquitous end devices
en
dc.title
Enabling DNN Acceleration With Data and Model Parallelization Over Ubiquitous End Devices