<div class="csl-bib-body">
<div class="csl-entry">Ilager, S., Briem, L. F., & Brandic, I. (2025). <i>GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code Generation</i>. arXiv. https://doi.org/10.48550/arXiv.2501.11006</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/222622
-
dc.description.abstract
Large Language Models (LLMs) are becoming integral to daily life, showcasing their vast potential across various Natural Language Processing (NLP) tasks. Beyond NLP, LLMs are increasingly used in software development tasks, such as code completion, modification, bug fixing, and code translation. Software engineers widely use tools like GitHub Copilot and Amazon Q, streamlining workflows and automating tasks with high accuracy. While the resource and energy intensity of LLM training is often highlighted, inference can be even more resource-intensive over time, as it's a continuous process with a high number of invocations. Therefore, developing resource-efficient alternatives for LLM inference is crucial for sustainability. This work proposes GREEN-CODE, a framework for energy-aware code generation in LLMs. GREEN-CODE performs dynamic early exit during LLM inference. We train a Reinforcement Learning (RL) agent that learns to balance the trade-offs between accuracy, latency, and energy consumption. Our approach is evaluated on two open-source LLMs, Llama 3.2 3B and OPT 2.7B, using the JavaCorpus and PY150 datasets. Results show that our method reduces the energy consumption between 23-50 % on average for code generation tasks without significantly affecting accuracy.
en
dc.language.iso
en
-
dc.subject
LLM
en
dc.subject
Quantum Computing
en
dc.subject
Green-Code
en
dc.title
GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code Generation
en
dc.type
Preprint
en
dc.type
Preprint
de
dc.identifier.arxiv
2501.11006
-
dc.contributor.affiliation
University of Amsterdam, Netherlands (the)
-
dc.contributor.affiliation
TU Wien, Austria
-
tuw.researchTopic.id
I1
-
tuw.researchTopic.name
Logic and Computation
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E056-23 - Fachbereich Innovative Combinations and Applications of AI and ML (iCAIML)