Optimal Energy Scheduling for Battery and Hydrogen Storage Systems Using Reinforcement Learning

Zebenholzer, Moritz; Kasper, Lukas; Schirrer, Alexander; Hofmann, René

doi:10.69997/sct.134052

DC Field

Value

Language

dc.contributor.author

Zebenholzer, Moritz

dc.contributor.author

Kasper, Lukas

dc.contributor.author

Schirrer, Alexander

dc.contributor.author

Hofmann, René

dc.contributor.editor

Van Impe, Jan F.M.

dc.contributor.editor

Léonard, Grégoire

dc.contributor.editor

Bhonsale, Satyajeet S.

dc.date.accessioned

2025-07-22T11:53:14Z

dc.date.available

2025-07-22T11:53:14Z

dc.date.issued

2025

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Zebenholzer, M., Kasper, L., Schirrer, A., & Hofmann, R. (2025). Optimal Energy Scheduling for Battery and Hydrogen Storage Systems Using Reinforcement Learning. In J. F. M. Van Impe, G. Léonard, & S. S. Bhonsale (Eds.), <i>Proceedings of the 35th European Symposium on Computer Aided Process Engineering (ESCAPE 35)</i> (pp. 1201–1207). PSE Press. https://doi.org/10.69997/sct.134052</div> </div>

dc.identifier.uri

http://hdl.handle.net/20.500.12708/217230

dc.description.abstract

Optimal energy scheduling for sector-coupled multi-energy systems is becoming increasingly important as renewable energies such as wind and photovoltaics continue to expand. They are very volatile and difficult to predict. This creates a deviation between generation and demand that can be compensated for by energy storage technologies. For these, rule-based control is well established in industry, and mixed-integer model predictive control (MPC) is an area of research that promises the best results, usually regarding minimal costs. Drawbacks of MPC include the need for an adequate system model, often associated with high modeling effort, high computational effort for larger prediction horizons, and complications with stochastic variables. In this work, Reinforcement Learning is used in an attempt to overcome these difficulties without applying elaborate mixed-integer linear programming. The self-learning algorithm, which requires no explicit knowledge of the system behavior, can learn a control policy and uncertainties of the variables just by interaction with the (simulated) system model. It is demonstrated that Reinforcement Learning (exchange factor = 36.8 %) can learn complex system behavior with comparable quality to model predictive control (ex. = 32.4 %) and outperforms rule-based control (ex. = 41.8 %). This is done in a case study with the goal of minimizing the exchange of energy with the grid, with a battery and hydrogen system providing storage flexibility. These results were achieved in the context that the Reinforcement Learning agent only has instantaneous rather than predictive information, i.e., a very limited state of information compared to the MPC. The trained policy is then deployed while significantly decreasing the computational effort.

dc.language.iso

dc.relation.ispartofseries

Systems and Control Transactions

dc.rights.uri

http://creativecommons.org/licenses/by-sa/4.0/

dc.subject

Optimal Energy Scheduling

dc.subject

reinforcement learning (RL)

dc.subject

model predictive control (MPC)

dc.title

Optimal Energy Scheduling for Battery and Hydrogen Storage Systems Using Reinforcement Learning

dc.type

Inproceedings

dc.type

Konferenzbeitrag

dc.rights.license

Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International

dc.rights.license

Creative Commons Attribution-ShareAlike 4.0 International

dc.contributor.editoraffiliation

KU Leuven, Belgium

dc.relation.isbn

978-1-7779403-3-1

dc.relation.issn

2818-4734

dc.description.startpage

1201

dc.description.endpage

1207

dc.rights.holder

dc.type.category

Full-Paper Contribution

tuw.booktitle

Proceedings of the 35th European Symposium on Computer Aided Process Engineering (ESCAPE 35)

tuw.container.volume

tuw.peerreviewed

true

tuw.relation.publisher

PSE Press

tuw.researchTopic.id

tuw.researchTopic.name

Sustainable Production and Technologies

tuw.researchTopic.name

Modeling and Simulation

tuw.researchTopic.value

tuw.publication.orgunit

E302-03 - Forschungsbereich Industrielle Energiesysteme

tuw.publication.orgunit

E325-04 - Forschungsbereich Regelungstechnik und Prozessautomatisierung

tuw.publication.orgunit

E056-06 - Fachbereich Smart Industrial Concept (SIC)

tuw.publication.orgunit

E302-01 - Forschungsbereich Thermodynamik und Wärmetechnik

tuw.publisher.doi

10.69997/sct.134052

dc.identifier.libraryid

AC17592045

dc.description.numberOfPages

tuw.author.orcid

0000-0001-9474-3021

tuw.author.orcid

0000-0001-6580-4913

dc.rights.identifier

CC BY-SA 4.0

dc.rights.identifier

CC BY-SA 4.0

tuw.editor.orcid

0000-0002-8549-2205

tuw.event.name

35th European Symposium on Computer Aided Process Engineering (ESCAPE 35)

tuw.event.startdate

06-07-2025

tuw.event.enddate

09-07-2025

tuw.event.online

On Site

tuw.event.type

Event for scientific audience

tuw.event.country

tuw.event.presenter

Zebenholzer, Moritz

wb.sciencebranch

Chemische Verfahrenstechnik

wb.sciencebranch

Maschinenbau

wb.sciencebranch.oefos

2040

wb.sciencebranch.oefos

2030

wb.sciencebranch.value

item.openairecristype

http://purl.org/coar/resource_type/c_5794

item.mimetype

application/pdf

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.languageiso639-1

item.openairetype

conference paper

item.grantfulltext

open

item.openaccessfulltext

Open Access

crisitem.author.dept

E302-03 - Forschungsbereich Industrielle Energiesysteme

crisitem.author.dept

E302 - Institut für Energietechnik und Thermodynamik

crisitem.author.dept

E325-04 - Forschungsbereich Regelungstechnik und Prozessautomatisierung

crisitem.author.dept

E302 - Institut für Energietechnik und Thermodynamik

crisitem.author.orcid

0000-0001-9474-3021

crisitem.author.orcid

0000-0001-6580-4913

crisitem.author.parentorg

E302 - Institut für Energietechnik und Thermodynamik

crisitem.author.parentorg

E300 - Fakultät für Maschinenwesen und Betriebswissenschaften

crisitem.author.parentorg

E325 - Institut für Mechanik und Mechatronik

crisitem.author.parentorg

E300 - Fakultät für Maschinenwesen und Betriebswissenschaften

Appears in Collections:

Conference Paper

Fulltext (Version of Record (published version))

Adobe PDF

(2.99 MB)

CC BY-SA 4.0

Show simple item record

Google Scholar^TM

Check

Google ScholarTM

Google Scholar^TM