SuperFast: Fast Supernet Training Using Initial Knowledge

Thoma, Moritz; Aghajanzadeh, Emad; Balamuthu Sampath, Shambhavi; Mori, Pierpaolo; Fasfous, Nael; Frickenstein, Alexander; Vemparala, Manoj-Rohit; Mueller-Gritschneder, Daniel; Schlichtmann, Ulf

doi:10.1109/DAC63849.2025.11132779

DC Field

Value

Language

dc.contributor.author

Thoma, Moritz

dc.contributor.author

Aghajanzadeh, Emad

dc.contributor.author

Balamuthu Sampath, Shambhavi

dc.contributor.author

Mori, Pierpaolo

dc.contributor.author

Fasfous, Nael

dc.contributor.author

Frickenstein, Alexander

dc.contributor.author

Vemparala, Manoj-Rohit

dc.contributor.author

Mueller-Gritschneder, Daniel

dc.contributor.author

Schlichtmann, Ulf

dc.date.accessioned

2025-10-13T12:46:48Z

dc.date.available

2025-10-13T12:46:48Z

dc.date.issued

2025-09-15

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Thoma, M., Aghajanzadeh, E., Balamuthu Sampath, S., Mori, P., Fasfous, N., Frickenstein, A., Vemparala, M.-R., Mueller-Gritschneder, D., & Schlichtmann, U. (2025). SuperFast: Fast Supernet Training Using Initial Knowledge. In <i>2025 62nd ACM/IEEE Design Automation Conference (DAC)</i> (pp. 1–7). IEEE. https://doi.org/10.1109/DAC63849.2025.11132779</div> </div>

dc.identifier.uri

http://hdl.handle.net/20.500.12708/219945

dc.description.abstract

Once-for-all based neural architecture search (NAS) proposes to train a supernet once and extract specialized subnets from it for efficient deployment. This decoupling between training and search enables easy multi-target deployment without retraining. Nevertheless, the initial training cost has remained extremely high, with SOTA approaches like ElasticViT and NASViT taking more than 72 and 83 GPU days respectively. While other approaches have tried to accelerate the training by warming up the largest model in the search space, we argue that this is suboptimal, and knowledge is easier scaled upward than downward. Hence, we propose SuperFast, a simple, plug and play workflow, that (I.) pretrains a subnet of the supernet search space, and (II.) distributes its knowledge within the supernet before the training. SuperFast offers a substantial acceleration in the supernet training, resulting in a significantly better accuracy vs. training-cost trade-off. Using SuperFast on both ElasticViT and NASViT supernets achieves the baseline’s accuracy 1.4× and 1.8× faster on the ImageNet dataset. Moreover, for a given time budget, SuperFast improves accuracy vs. latency trade-offs for subnets, gaining 4.0 p.p. for the 20−50 ms range on Pixel 6. Code available in https://github.com/MoritzTho/SuperFast.

dc.language.iso

dc.subject

Neural Architecture Search

dc.subject

Embedded machine learning

dc.title

SuperFast: Fast Supernet Training Using Initial Knowledge

dc.type

Inproceedings

dc.type

Konferenzbeitrag

dc.contributor.affiliation

Technical University of Munich, Germany

dc.contributor.affiliation

Technical University of Munich, Germany

dc.contributor.affiliation

Technical University of Munich, Germany

dc.contributor.affiliation

BMW Group (Germany), Germany

dc.contributor.affiliation

BMW Group (Germany), Germany

dc.contributor.affiliation

BMW Group (Germany), Germany

dc.contributor.affiliation

BMW Group (Germany), Germany

dc.contributor.affiliation

Technical University of Munich, Germany

dc.relation.isbn

979-8-3315-0304-8

dc.relation.doi

10.1109/DAC63849.2025

dc.description.startpage

dc.description.endpage

dc.type.category

Full-Paper Contribution

tuw.booktitle

2025 62nd ACM/IEEE Design Automation Conference (DAC)

tuw.peerreviewed

true

tuw.relation.publisher

IEEE

tuw.researchTopic.id

tuw.researchTopic.name

Computer Engineering and Software-Intensive Systems

tuw.researchTopic.value

100

tuw.publication.orgunit

E191-02 - Forschungsbereich Embedded Computing Systems

tuw.publisher.doi

10.1109/DAC63849.2025.11132779

dc.description.numberOfPages

tuw.author.orcid

0009-0005-1988-3848

tuw.author.orcid

0000-0003-0903-631X

tuw.author.orcid

0000-0003-4431-7619

tuw.event.name

2025 62nd ACM/IEEE Design Automation Conference (DAC)

tuw.event.startdate

22-06-2025

tuw.event.enddate

25-06-2025

tuw.event.online

On Site

tuw.event.type

Event for scientific audience

tuw.event.place

San Francisco, CA

tuw.event.country

tuw.event.presenter

Thoma, Moritz

wb.sciencebranch

Informatik

wb.sciencebranch

Elektrotechnik, Elektronik, Informationstechnik

wb.sciencebranch

Mathematik

wb.sciencebranch.oefos

1020

wb.sciencebranch.oefos

2020

wb.sciencebranch.oefos

1010

wb.sciencebranch.value

item.languageiso639-1

item.grantfulltext

none

item.openairetype

conference paper

item.openairecristype

http://purl.org/coar/resource_type/c_5794

item.cerifentitytype

Publications

item.fulltext

no Fulltext

crisitem.author.dept

Technical University of Munich, Germany

crisitem.author.dept

Technical University of Munich, Germany

crisitem.author.dept

Technical University of Munich, Germany

crisitem.author.dept

BMW Group (Germany), Germany

crisitem.author.dept

BMW Group (Germany), Germany

crisitem.author.dept

BMW Group (Germany), Germany

crisitem.author.dept

BMW Group (Germany), Germany

crisitem.author.dept

E191-02 - Forschungsbereich Embedded Computing Systems

crisitem.author.dept

Technical University of Munich, Germany

crisitem.author.orcid

0009-0005-1988-3848

crisitem.author.orcid

0000-0003-0903-631X

crisitem.author.orcid

0000-0003-4431-7619

crisitem.author.parentorg

E191 - Institut für Computer Engineering

Appears in Collections:

Conference Paper

Show simple item record

Page view(s)

checked on Oct 13, 2025

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Google Scholar^TM