Weighted Quantization-Regularization in DNNs for Weight Memory Minimization Toward HW Implementation

Wess, Matthias; Dinakarrao, Sai Manoj Pudukotai; Jantsch, Axel

doi:10.1109/TCAD.2018.2857080

DC Element

Wert

Sprache

dc.contributor.author

Wess, Matthias

dc.contributor.author

Dinakarrao, Sai Manoj Pudukotai

dc.contributor.author

Jantsch, Axel

dc.date.accessioned

2024-01-15T15:05:40Z

dc.date.available

2024-01-15T15:05:40Z

dc.date.issued

2018-11

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Wess, M., Dinakarrao, S. M. P., & Jantsch, A. (2018). Weighted Quantization-Regularization in DNNs for Weight Memory Minimization Toward HW Implementation. <i>IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems</i>, <i>37</i>(11), 2929–2939. https://doi.org/10.1109/TCAD.2018.2857080</div> </div>

dc.identifier.issn

0278-0070

dc.identifier.uri

http://hdl.handle.net/20.500.12708/191908

dc.description.abstract

Deployment of deep neural networks on hardware platforms is often constrained by limited on-chip memory and computational power. The proposed weight quantization offers the possibility of optimizing weight memory alongside transforming the weights to hardware friendly data types. We apply dynamic fixed point (DFP) and power-of-two (Po2) quantization in conjunction with layer-wise precision scaling to minimize the weight memory. To alleviate accuracy degradation due to precision scaling, we employ quantization-aware fine-tuning. For fine-tuning, quantization-regularization (QR) and weighted QR are introduced to force the trained quantization by adding the distance of the weights to the desired quantization levels as a regularization term to the loss-function. While DFP quantization performs better when allowing different bit-widths for each layer, Po2 quantization in combination with retraining allows higher compression rates for equal bit-width quantization. The techniques are verified on an all-convolutional network. With accuracy degradation of 0.10% points, for DFP with layer-wise precision scaling we achieve compression ratios of 7.34 for CIFAR-10, 4.7 for CIFAR-100, and 9.33 for SVHN dataset.

dc.description.sponsorship

Christian Doppler Forschungsgesells

dc.language.iso

dc.publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

dc.relation.ispartof

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

dc.subject

Convolutional neural networks

dc.subject

memory minimization

dc.subject

quantization

dc.subject

regularization

dc.title

Weighted Quantization-Regularization in DNNs for Weight Memory Minimization Toward HW Implementation

dc.type

Article

dc.type

Artikel

dc.identifier.scopus

2-s2.0-85050238996

dc.identifier.url

https://api.elsevier.com/content/abstract/scopus_id/85050238996

dc.description.startpage

2929

dc.description.endpage

2939

dc.relation.grantno

123456

dc.type.category

Original Research Article

tuw.container.volume

tuw.container.issue

tuw.journal.peerreviewed

true

tuw.peerreviewed

true

tuw.project.title

CDL Embedded Machine Learning

tuw.researchinfrastructure

Vienna Scientific Cluster

tuw.researchTopic.id

tuw.researchTopic.name

Computer Engineering and Software-Intensive Systems

tuw.researchTopic.value

100

dcterms.isPartOf.title

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

tuw.publication.orgunit

E384-02 - Forschungsbereich Systems on Chip

tuw.publisher.doi

10.1109/TCAD.2018.2857080

dc.identifier.articleid

8412511

dc.identifier.eissn

1937-4151

dc.description.numberOfPages

tuw.author.orcid

0000-0002-1877-4114

tuw.author.orcid

0000-0002-4417-2387

tuw.author.orcid

0000-0003-2251-0004

dc.description.sponsorshipexternal

Christian Doppler Gesellschaft

wb.sci

true

wb.sciencebranch

Elektrotechnik, Elektronik, Informationstechnik

wb.sciencebranch.oefos

2020

wb.sciencebranch.value

100

item.languageiso639-1

item.openairetype

research article

item.grantfulltext

none

item.fulltext

no Fulltext

item.cerifentitytype

Publications

item.openairecristype

http://purl.org/coar/resource_type/c_2df8fbb1

crisitem.author.dept

E384-02 - Forschungsbereich Systems on Chip

crisitem.author.dept

E384 - Institut für Computertechnik

crisitem.author.dept

E384-02 - Forschungsbereich Systems on Chip

crisitem.author.orcid

0000-0002-1877-4114

crisitem.author.orcid

0000-0003-2251-0004

crisitem.author.parentorg

E384 - Institut für Computertechnik

crisitem.author.parentorg

E350 - Fakultät für Elektrotechnik und Informationstechnik

crisitem.author.parentorg

E384 - Institut für Computertechnik

crisitem.project.funder

Christian Doppler Forschungsgesells

crisitem.project.grantno

123456

Enthalten in den Sammlungen:

Article

Zur Kurzanzeige

Seiten Aufrufe

147

aufgerufen am 16.01.2024

Google Scholar^TM

Check

Seiten Aufrufe

Google ScholarTM

Google Scholar^TM