<div class="csl-bib-body">
<div class="csl-entry">Neufeld, E., Ciabattoni, A., & Tulcan, R. F. (2025). Combining MORL with Restraining Bolts to Learn Normative Behaviour. In J. Kwok (Ed.), <i>Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence</i> (pp. 4615–4623). ACM. https://doi.org/10.24963/ijcai.2025/514</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/221680
-
dc.description.abstract
Normative Restraining Bolts (NRBs) adapt the restraining bolt technique (originally developed for safe reinforcement learning) to ensure compliance with social, legal, and ethical norms. While effective, NRBs rely on trial-and-error weight tuning, which hinders their ability to enforce hierarchical norms; moreover, norm updates require retraining. In this paper, we reformulate learning with NRBs as a multi-objective reinforcement learning (MORL) problem, where each norm is treated as a distinct objective. This enables the introduction of Ordered Normative Restraining Bolts (ONRBs), which support algorithmic weight selection, prioritized norms, norm updates, and provide formal guarantees on minimizing norm violations. Case studies show that ONRBs offer a robust and principled foundation for RL-agents to comply with a wide range of norms while achieving their goals.
en
dc.language.iso
en
-
dc.subject
Knowledge Representation and Reasoning
en
dc.subject
Learning and reasoning
en
dc.subject
Agent-based and Multi-agent Systems
en
dc.subject
Normative systems
en
dc.subject
AI Ethics
en
dc.subject
Trust
en
dc.subject
Fairness
en
dc.subject
Machine Learning
en
dc.subject
Reinforcement learning
en
dc.title
Combining MORL with Restraining Bolts to Learn Normative Behaviour
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.affiliation
TU Wien, Austria
-
dc.relation.isbn
978-1-956792-06-5
-
dc.relation.doi
10.24963/ijcai.2025
-
dc.description.startpage
4615
-
dc.description.endpage
4623
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence
-
tuw.peerreviewed
true
-
tuw.relation.publisher
ACM
-
tuw.researchTopic.id
I1
-
tuw.researchTopic.name
Logic and Computation
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E192-05 - Forschungsbereich Theory and Logic
-
tuw.publication.orgunit
E056-13 - Fachbereich LogiCS
-
tuw.publication.orgunit
E056-23 - Fachbereich Innovative Combinations and Applications of AI and ML (iCAIML)
-
tuw.publisher.doi
10.24963/ijcai.2025/514
-
dc.description.numberOfPages
9
-
tuw.author.orcid
0000-0002-6966-1136
-
tuw.event.name
Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2025))