<div class="csl-bib-body">
<div class="csl-entry">Engesser, T., Le Marre, T., Lorini, E., Schwarzentruber, F., & Zanuttini, B. (2025). A Simple Integration of Epistemic Logic and Reinforcement Learning. In <i>AAMAS ’25 : Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems</i> (pp. 686–694). International Foundation for Autonomous Agents and Multiagent Systems.</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/222929
-
dc.description.abstract
We propose an integration of epistemic logic with reinforcement learning via a semantics that uses the concept of belief bases. In our framework, an agent's subjective state is identified with their belief base, which captures the agent's personal representation of the environment. The agent's subjective state is distinguished from the global state, which captures the overall information about the environment and about the agent's belief base from an external perspective. We instantiate the concepts of global state and subjective state in Partially-Observable Markov Decision Process (POMDPs), defining so-called Belief Base POMDPs (BB-POMDPs).
We show that in our epistemic framework, we can use the beliefs of the learning agent to formalize and implement a natural form of shielding, which prevents agents from performing actions that are not known to be safe. Our implementation of shielding relies on a model-checking algorithm to automatically verify whether a given fact is deducible from the agent's belief base.
We perform a case study of model-free reinforcement learning on a simple wumpus scenario, using a variant of Q-learning on the agent's subjective states, using the agent's beliefs for reward shaping and shielding. In particular, our experiments show that our version of shielding can successfully protect the agent from harm while improving the utility of the learned policy.
en
dc.language.iso
en
-
dc.relation.ispartofseries
AAMAS
-
dc.subject
Epistemic logic
en
dc.subject
Reinforcement learning
en
dc.subject
doxastic logic
en
dc.subject
Markov Decision Process
en
dc.title
A Simple Integration of Epistemic Logic and Reinforcement Learning
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.affiliation
École Normale Supérieure de Lyon, France
-
dc.contributor.affiliation
Université Toulouse - Jean Jaurès, France
-
dc.contributor.affiliation
École Normale Supérieure de Lyon, France
-
dc.contributor.affiliation
Université de Caen Normandie, France
-
dc.relation.isbn
979-8-4007-1426-9
-
dc.description.startpage
686
-
dc.description.endpage
694
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
AAMAS '25 : Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems
-
tuw.peerreviewed
true
-
tuw.book.ispartofseries
AAMAS
-
tuw.relation.publisher
International Foundation for Autonomous Agents and Multiagent Systems
-
tuw.relation.publisherplace
Richland, SC, USA
-
tuw.researchTopic.id
I1
-
tuw.researchTopic.name
Logic and Computation
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E192-05 - Forschungsbereich Theory and Logic
-
dc.description.numberOfPages
9
-
tuw.author.orcid
0009-0004-9763-0179
-
tuw.author.orcid
0000-0002-1228-4333
-
tuw.event.name
24th International Conference on Autonomous Agents and Multiagent System (AAMAS 2025)