Fast, Quantization Aware DNN Training for Efficient HW Implementation

Schnöll, Daniel; Wess, Matthias; Bittner, Matthias; Götzinger, Maximilian; Jantsch, Axel

doi:10.1109/DSD60849.2023.00100

Datensatz Zitierlink:

http://hdl.handle.net/20.500.12708/207604

Titel:

Fast, Quantization Aware DNN Training for Efficient HW Implementation

Zitat:

Schnöll, D., Wess, M., Bittner, M., Götzinger, M., & Jantsch, A. (2023). Fast, Quantization Aware DNN Training for Efficient HW Implementation. In 2023 26th Euromicro Conference on Digital System Design (DSD) (pp. 700–707). https://doi.org/10.1109/DSD60849.2023.00100

Verlags-DOI:

10.1109/DSD60849.2023.00100

Publikationstyp:

Konferenzbeitrag - Beitrag in einem Abstract Book

Sprache:

Englisch

Autor_innen:

Schnöll, Daniel
Wess, Matthias
Bittner, Matthias
Götzinger, Maximilian
Jantsch, Axel

Organisationseinheit:

E384-02 - Forschungsbereich Systems on Chip

Erschienen in:

2023 26th Euromicro Conference on Digital System Design (DSD)

ISBN:

979-8-3503-4419-6

Datum (veröffentlicht):

2023

Veranstaltungsname:

26th Euromicro Conference on Digital System Design (DSD 2023)

Veranstaltungszeitraum:

6-Sep-2023 - 8-Sep-2023

Veranstaltungsort:

Golem, Durres, Albanien

Umfang:

Peer Reviewed:

Keywords:

Convolution; hardware-friendly; Neural networks; Quantization (signal); Quantization Aware Training; Training

Abstract:

Quantization of Deep Neural Networks is a central technique to reduce the computation load in embedded devices. Even in quantized Deep Neural Networks (DNNs), the scaler/rescaler following a convolution or dense layer often requires a high bit width multiplication and a shift. Previous work has proposed to remove the multiplier by restricting the quantization method. We propose a Quantisation Aware Training (QAT) approach, which explicitly models the rescaler during training, eliminating the limitations of quantization functions and achieving a 30-35% improvement in training time and a significant reduction in memory requirements compared to the state-of-the-art. GitHub: https://github.com/embedded-machine-learning/FastQATforPOTRescaler

Projekt (extern):

Christian Doppler Forschungsgesells

Forschungsschwerpunkte:

Mathematical and Algorithmic Foundations: 60%
Computer Science Foundations: 30%
Computational System Design: 10%

Wissenschaftszweig:

2020 - Elektrotechnik, Elektronik, Informationstechnik: 100%

Enthalten in den Sammlungen:

Conference Paper

Zur Langanzeige

Seiten Aufrufe

aufgerufen am 04.01.2025

Download(s)

aufgerufen am 04.01.2025

Google Scholar^TM

Check

Seiten Aufrufe

Download(s)

Google ScholarTM

Google Scholar^TM