An Off-Chip Memory Access Optimization for Embedded Deep Learning Systems

Putra, Rachmad Vidya Wicaksana; Hanif, Muhammad Abdullah; Shafique, Muhammad

doi:10.1007/978-3-031-19568-6_6

DC Field

Value

Language

dc.contributor.author

Putra, Rachmad Vidya Wicaksana

dc.contributor.author

Hanif, Muhammad Abdullah

dc.contributor.author

Shafique, Muhammad

dc.contributor.editor

Pasricha, Sudeep

dc.contributor.editor

Shafique, Muhammad

dc.date.accessioned

2024-01-15T15:49:33Z

dc.date.available

2024-01-15T15:49:33Z

dc.date.issued

2023-10-01

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Putra, R. V. W., Hanif, M. A., & Shafique, M. (2023). An Off-Chip Memory Access Optimization for Embedded Deep Learning Systems. In S. Pasricha & M. Shafique (Eds.), <i>Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing : Hardware Architectures</i> (pp. 175–198). Springer. https://doi.org/10.1007/978-3-031-19568-6_6</div> </div>

dc.identifier.uri

http://hdl.handle.net/20.500.12708/191922

dc.description.abstract

Implementations of Deep Neural Networks (DNNs) or Deep Learning (DL) for embedded applications may improve the users’ quality of life, as DL has become a prominent solution for many machine learning (ML) tasks, like personalized healthcare assistance. Such implementations require high energy efficiency since embedded applications usually have tight operational constraints, such as small memory and low operational power/energy. Therefore, specialized hardware accelerators are typically employed to expedite the DL inference. However, previous works have shown that DL accelerators still suffer from high energy consumption from the DRAM-based off-chip memory accesses, thereby hindering the embedded DL implementations. In this chapter, we discuss our design methodology for optimizing the energy consumption of DRAM accesses for the DL accelerators targeting embedded applications. Our design methodology employs an exploration technique to find the data partitioning and scheduling that offer minimum DRAM accesses for the given DNN model and exploits the low latency DRAMs to efficiently perform data accesses that incur minimum DRAM access energy.

dc.language.iso

dc.subject

deep learning

dc.subject

hardware accelerator

dc.subject

off-chip dram accesses

dc.subject

data partitioning and scheduling

dc.subject

energy efficiency

dc.subject

embedded systems

dc.title

An Off-Chip Memory Access Optimization for Embedded Deep Learning Systems

dc.type

Book Contribution

dc.type

Buchbeitrag

dc.contributor.affiliation

New York University Abu Dhabi, United Arab Emirates (the)

dc.contributor.affiliation

New York University Abu Dhabi, United Arab Emirates (the)

dc.relation.isbn

978-3-031-19568-6

dc.description.startpage

175

dc.description.endpage

198

dc.type.category

Edited Volume Contribution

tuw.booktitle

Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing : Hardware Architectures

tuw.relation.publisher

Springer

tuw.relation.publisherplace

Cham

tuw.researchTopic.id

tuw.researchTopic.name

Computer Engineering and Software-Intensive Systems

tuw.researchTopic.value

100

tuw.publication.orgunit

E191-02 - Forschungsbereich Embedded Computing Systems

tuw.publisher.doi

10.1007/978-3-031-19568-6_6

dc.description.numberOfPages

wb.sciencebranch

Informatik

wb.sciencebranch.oefos

1020

wb.sciencebranch.value

100

item.languageiso639-1

item.openairetype

book part

item.grantfulltext

restricted

item.fulltext

no Fulltext

item.cerifentitytype

Publications

item.openairecristype

http://purl.org/coar/resource_type/c_3248

crisitem.author.dept

E191-02 - Forschungsbereich Embedded Computing Systems

crisitem.author.dept

E191-02 - Forschungsbereich Embedded Computing Systems

crisitem.author.dept

E191-02 - Forschungsbereich Embedded Computing Systems

crisitem.author.parentorg

E191 - Institut für Computer Engineering

crisitem.author.parentorg

E191 - Institut für Computer Engineering

crisitem.author.parentorg

E191 - Institut für Computer Engineering

Appears in Collections:

Book Contribution

Show simple item record

Page view(s)

140

checked on Jan 15, 2024

Download(s)

checked on Jan 15, 2024

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM