<div class="csl-bib-body">
<div class="csl-entry">Eniser, H. F., Lin, S., Müller, N., Isychev, A., Wüstholz, V., Valera, I., Hoffmann, J., & Christakis, M. (2025). Using Action-Policy Testing in RL to Reduce the Number of Bugs. In M. Likhachev, H. Rudová, & E. Scala (Eds.), <i>Eighteenth International Symposium on Combinatorial Search</i> (pp. 181–185). AAAI Press. http://hdl.handle.net/20.500.12708/226143</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/226143
-
dc.description.abstract
Reinforcement learning is becoming ever more prominent in solving combinatorial search problems, in particular ones where states are images. Prior work has devised action-policy testing methodology, that identifies so-called bug states where policy performance is sub-optimal. Here we show how to leverage this methodology during the RL process, using action-policy testing to find bugs and injecting those as alternate start states for the training runs. Running experiments across six 2D games, we find that our testing-guided training often achieves similar expected reward while reducing the number of bugs.
en
dc.language.iso
en
-
dc.subject
action-policy testing
en
dc.subject
reinforcement learning
en
dc.subject
bug reduction
en
dc.title
Using Action-Policy Testing in RL to Reduce the Number of Bugs
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.affiliation
Max Planck Institute for Software Systems, Germany
-
dc.contributor.affiliation
Saarland University, Germany
-
dc.contributor.affiliation
German Research Centre for Artificial Intelligence, Germany
-
dc.contributor.affiliation
ConsenSys, Austria
-
dc.contributor.affiliation
Saarland University, Germany
-
dc.contributor.affiliation
Saarland University, Germany
-
dc.contributor.editoraffiliation
Faculty of Informatics - Masaryk University (Brno, CZ)
-
dc.contributor.editoraffiliation
University of Brescia (Brescia, IT)
-
dc.relation.isbn
978-1-57735-901-2
-
dc.relation.issn
2832-9171
-
dc.description.startpage
181
-
dc.description.endpage
185
-
dc.type.category
Full-Paper Contribution
-
dc.relation.eissn
2832-9163
-
tuw.booktitle
Eighteenth International Symposium on Combinatorial Search
-
tuw.peerreviewed
true
-
tuw.relation.publisher
AAAI Press
-
tuw.relation.publisherplace
Washington DC
-
tuw.researchTopic.id
I4
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E194-01 - Forschungsbereich Software Engineering
-
tuw.publication.orgunit
E056-26 - Fachbereich Automated Reasoning
-
dc.description.numberOfPages
5
-
tuw.author.orcid
0000-0001-6375-0421
-
tuw.author.orcid
0000-0002-6440-4376
-
tuw.author.orcid
0000-0002-2649-1958
-
tuw.editor.orcid
0000-0002-9539-2398
-
tuw.editor.orcid
0000-0002-6410-4250
-
tuw.editor.orcid
0000-0003-2274-875X
-
tuw.event.name
The 18th International Symposium on Combinatorial Search (SoCS 2025)
en
tuw.event.startdate
12-08-2025
-
tuw.event.enddate
15-08-2025
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Glasgow
-
tuw.event.country
GB
-
tuw.event.presenter
Eniser, Hasan Ferit
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.value
100
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
item.fulltext
no Fulltext
-
item.languageiso639-1
en
-
item.grantfulltext
none
-
item.openairetype
conference paper
-
item.cerifentitytype
Publications
-
crisitem.author.dept
Max Planck Institute for Software Systems, Germany
-
crisitem.author.dept
Saarland University, Germany
-
crisitem.author.dept
German Research Centre for Artificial Intelligence, Germany
-
crisitem.author.dept
E194-01 - Forschungsbereich Software Engineering
-
crisitem.author.dept
ConsenSys, Austria
-
crisitem.author.dept
Saarland University, Germany
-
crisitem.author.dept
Saarland University, Germany
-
crisitem.author.dept
E194-01 - Forschungsbereich Software Engineering
-
crisitem.author.orcid
0000-0001-6375-0421
-
crisitem.author.orcid
0000-0002-6440-4376
-
crisitem.author.orcid
0000-0002-2649-1958
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering