<div class="csl-bib-body">
<div class="csl-entry">Recski, G., Iklodi, E., Lellmann, B., Kovács, Á., & Hanbury, A. (2024). BRISE-plandok: a German legal corpus of building regulations. <i>Language Resources and Evaluation</i>. https://doi.org/10.1007/s10579-024-09747-7</div>
</div>
-
dc.identifier.issn
1574-020X
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/199060
-
dc.description.abstract
We present the BRISE-Plandok corpus, a collection of 250 text documents with a total of over 7000 sentences from the Zoning Map of the City of Vienna, annotated manually with formal representations of the rules they convey. The generic rule format used by the corpus enables automated compliance checking of building plans, a process developed as part of the BRISE (https://smartcity.wien.gv.at/en/brise/) project. The format also allows for conversion to multiple logic formalisms, including dyadic deontic logic, enabling automated reasoning. Annotation guidelines were developed in collaboration with experts of the city’s building inspection office, describing nearly 100 domain-specific attributes with examples. Each document was annotated independently by two trained annotators and subsequently reviewed by the authors. A rule-based system for the automatic extraction of rules from text was developed and used in the annotation process to provide suggestions. The reviewed dataset was also used to train a set of baseline machine learning models for the task of attribute extraction, the main step in the rule extraction process. Both the rule-based system and the ML baselines are evaluated on the annotated dataset and released as open-source software. We also describe and release the framework used for generating and parsing the interactive xlsx spreadsheets used by annotators.
en
dc.description.sponsorship
European Commission
-
dc.language.iso
en
-
dc.publisher
SPRINGER
-
dc.relation.ispartof
Language Resources and Evaluation
-
dc.subject
Rule corpus
en
dc.subject
Rule extraction
en
dc.subject
Annotation
en
dc.subject
Rule-based methods
en
dc.subject
Deontic Logic
en
dc.subject
Automated Compliance Checking
en
dc.title
BRISE-plandok: a German legal corpus of building regulations
en
dc.type
Article
en
dc.type
Artikel
de
dc.relation.grantno
UIA04-081
-
dc.type.category
Original Research Article
-
tuw.journal.peerreviewed
true
-
tuw.peerreviewed
true
-
tuw.project.title
Digitalisierung der Bauvorschriften für die Einreichung in Wien
-
tuw.researchTopic.id
I1
-
tuw.researchTopic.id
X1
-
tuw.researchTopic.id
I4
-
tuw.researchTopic.name
Logic and Computation
-
tuw.researchTopic.name
Beyond TUW-research foci
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
35
-
tuw.researchTopic.value
30
-
tuw.researchTopic.value
35
-
dcterms.isPartOf.title
Language Resources and Evaluation
-
tuw.publication.orgunit
E194-04 - Forschungsbereich Data Science
-
tuw.publication.orgunit
E192-02 - Forschungsbereich Databases and Artificial Intelligence
-
tuw.publisher.doi
10.1007/s10579-024-09747-7
-
dc.date.onlinefirst
2024
-
dc.identifier.eissn
1574-0218
-
dc.description.numberOfPages
40
-
tuw.author.orcid
0000-0001-5551-3100
-
tuw.author.orcid
0000-0003-1475-1684
-
tuw.author.orcid
0000-0001-6132-7144
-
tuw.author.orcid
0000-0002-7149-5843
-
wb.sci
true
-
wb.sciencebranch
Sprach- und Literaturwissenschaften
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
6020
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.value
30
-
wb.sciencebranch.value
70
-
item.languageiso639-1
en
-
item.grantfulltext
none
-
item.cerifentitytype
Publications
-
item.openairetype
research article
-
item.openairecristype
http://purl.org/coar/resource_type/c_2df8fbb1
-
item.fulltext
no Fulltext
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
E192-02 - Forschungsbereich Databases and Artificial Intelligence
-
crisitem.author.dept
E192-05 - Forschungsbereich Theory and Logic
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.orcid
0000-0001-5551-3100
-
crisitem.author.orcid
0000-0003-1475-1684
-
crisitem.author.orcid
0000-0001-6132-7144
-
crisitem.author.orcid
0000-0002-7149-5843
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E192 - Institut für Logic and Computation
-
crisitem.author.parentorg
E192 - Institut für Logic and Computation
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering