<div class="csl-bib-body">
<div class="csl-entry">Ferranti, N., De Souza, J. F., Ahmetaj, S., & Polleres, A. (2024). Formalizing and Validating Wikidata’s Property Constraints using SHACL and SPARQL. <i>Semantic Web</i>, <i>15</i>(6), 2333–2380. https://doi.org/https://doi.org/10.3233/SW-243611</div>
</div>
-
dc.identifier.issn
1570-0844
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/224960
-
dc.description.abstract
In this paper, we delve into the crucial role of constraints in maintaining data integrity in knowledge graphs with a specific focus on Wikidata, one of the most extensive collaboratively maintained open data knowledge graphs on the Web. The World Wide Web Consortium (W3C) recommends the Shapes Constraint Language (SHACL) as the constraint language for validating Knowledge Graphs, which comes in two different levels of expressivity, SHACL-Core, as well as SHACL-SPARQL. Despite the availability of SHACL, Wikidata currently represents its property constraints through its own RDF data model, which relies on Wikidata’s specific reification mechanism based on authoritative namespaces, and – partially ambiguous – natural language definitions. In the present paper, we investigate whether and how the semantics of Wikidata property constraints, can be formalized using SHACL-Core, SHACL-SPARQL, as well as directly as SPARQL queries. While the expressivity of SHACL-Core turns out to be insufficient for expressing all Wikidata property constraint types, we present SPARQL queries to identify violations for all 32 current Wikidata constraint types. We compare the semantics of this unambiguous SPARQL formalization with Wikidata’s violation reporting system and discuss limitations in terms of evaluation via Wikidata’s public SPARQL query endpoint, due to its current scalability. Our study, on the one hand, sheds light on the unique characteristics of constraints defined by the Wikidata community, in order to improve the quality and accuracy of data in this collaborative knowledge graph. On the other hand, as a “byproduct”, our formalization extends existing benchmarks for both SHACL and SPARQL with a challenging, large-scale real-world use case.
en
dc.description.sponsorship
FWF - Österr. Wissenschaftsfonds
-
dc.language.iso
en
-
dc.publisher
IOS PRESS
-
dc.relation.ispartof
Semantic Web
-
dc.subject
Wikidata
en
dc.subject
Data quality
en
dc.subject
Knowledge Graphs
en
dc.subject
Constraints
en
dc.subject
Shapes Constraint Language
en
dc.subject
SPARQL
en
dc.title
Formalizing and Validating Wikidata’s Property Constraints using SHACL and SPARQL
en
dc.type
Article
en
dc.type
Artikel
de
dc.contributor.affiliation
Vienna University of Economics and Business, Austria
-
dc.contributor.affiliation
Universidade Federal de Juiz de Fora, Brazil
-
dc.contributor.affiliation
Vienna University of Economics and Business, Austria
-
dc.description.startpage
2333
-
dc.description.endpage
2380
-
dc.relation.grantno
T 1349-N
-
dc.type.category
Original Research Article
-
tuw.container.volume
15
-
tuw.container.issue
6
-
tuw.journal.peerreviewed
true
-
tuw.peerreviewed
true
-
wb.publication.intCoWork
International Co-publication
-
tuw.project.title
Grundlagen der Schlussfolgerungen in der Shape Constraint Language
-
tuw.researchTopic.id
I1
-
tuw.researchTopic.name
Logic and Computation
-
tuw.researchTopic.value
100
-
dcterms.isPartOf.title
Semantic Web
-
tuw.publication.orgunit
E192-03 - Forschungsbereich Knowledge Based Systems
-
tuw.publisher.doi
https://doi.org/10.3233/SW-243611
-
dc.date.onlinefirst
2024-08-23
-
dc.identifier.eissn
2210-4968
-
dc.description.numberOfPages
48
-
tuw.author.orcid
0000-0001-5670-1146
-
wb.sci
true
-
wb.sciencebranch
Informatik
-
wb.sciencebranch
Mathematik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.oefos
1010
-
wb.sciencebranch.value
80
-
wb.sciencebranch.value
20
-
item.openairetype
research article
-
item.openairecristype
http://purl.org/coar/resource_type/c_2df8fbb1
-
item.cerifentitytype
Publications
-
item.languageiso639-1
en
-
item.grantfulltext
restricted
-
item.fulltext
no Fulltext
-
crisitem.author.dept
Vienna University of Economics and Business
-
crisitem.author.dept
Universidade Federal de Juiz de Fora
-
crisitem.author.dept
E192-03 - Forschungsbereich Knowledge Based Systems