<div class="csl-bib-body">
<div class="csl-entry">Ahmadifarsani, S., Müller-Gritschneder, D., & Schlichtmann, U. (2025). <i>A High-Level Compiler Integration Approach for Deep Learning Accelerators Supporting Abstraction and Optimization</i>. arXiv. https://doi.org/10.48550/arXiv.2507.04828</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/222310
-
dc.description.abstract
The growing adoption of domain-specific architectures in edge computing platforms for deep learning has highlighted the efficiency of hardware accelerators. However, integrating custom accelerators into modern machine learning (ML) compilers remains a complex challenge due to the need for significant modifications in compilation layers and specialized scheduling techniques. Existing frameworks offer partial solutions and require users to navigate intricate compiler internals.
In this paper, we introduce a TVM-based compilation integration approach that targets GEMM-based deep learning accelerators. Our approach abstracts the complexities of compiler integration, enabling seamless integration of accelerators without requiring in-depth knowledge of the underlying compiler. Furthermore, we extend and incorporate design space exploration tools, specifically CoSA, to automate efficient tensor scheduling, accounting for factors such as uneven mapping and double buffering. Our framework is benchmarked on the Gemmini accelerator, demonstrating performance comparable to its specialized manually implemented toolchain.
en
dc.language.iso
en
-
dc.subject
DL Accelerators
en
dc.subject
ML Compiler
en
dc.subject
Tensor Scheduling
en
dc.title
A High-Level Compiler Integration Approach for Deep Learning Accelerators Supporting Abstraction and Optimization
en
dc.type
Preprint
en
dc.type
Preprint
de
dc.identifier.arxiv
2507.04828
-
dc.contributor.affiliation
Technical University of Munich, Germany
-
dc.contributor.affiliation
Electrical and Computer Engineering - Technical University of Munich (München, DE)
-
tuw.researchTopic.id
I2
-
tuw.researchTopic.name
Computer Engineering and Software-Intensive Systems
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E191-02 - Forschungsbereich Embedded Computing Systems
-
tuw.publisher.doi
10.48550/arXiv.2507.04828
-
dc.description.numberOfPages
8
-
tuw.author.orcid
0009-0003-4517-168X
-
tuw.author.orcid
0000-0003-0903-631X
-
tuw.author.orcid
0000-0003-4431-7619
-
tuw.publisher.server
arXiv
-
wb.sciencebranch
Informatik
-
wb.sciencebranch
Elektrotechnik, Elektronik, Informationstechnik
-
wb.sciencebranch
Mathematik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.oefos
2020
-
wb.sciencebranch.oefos
1010
-
wb.sciencebranch.value
50
-
wb.sciencebranch.value
40
-
wb.sciencebranch.value
10
-
item.openairecristype
http://purl.org/coar/resource_type/c_816b
-
item.fulltext
no Fulltext
-
item.cerifentitytype
Publications
-
item.grantfulltext
none
-
item.openairetype
preprint
-
item.languageiso639-1
en
-
crisitem.author.dept
Technical University of Munich, Germany
-
crisitem.author.dept
E191-02 - Forschungsbereich Embedded Computing Systems
-
crisitem.author.dept
Electrical and Computer Engineering - Technical University of Munich (München, DE)