<div class="csl-bib-body">
<div class="csl-entry">Rabbani, K., Lissandrini, M., & Hose, K. (2023). Extraction of validating shapes from very large knowledge graphs. <i>Proceedings of the VLDB Endowment</i>, <i>16</i>(5), 1023–1032. https://doi.org/10.14778/3579075.3579078</div>
</div>
-
dc.identifier.issn
2150-8097
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/193051
-
dc.description.abstract
Knowledge Graphs (KGs) represent heterogeneous domain knowledge on the Web and within organizations. There exist shapes constraint languages to define validating shapes to ensure the quality of the data in KGs. Existing techniques to extract validating shapes often fail to extract complete shapes, are not scalable, and are prone to produce spurious shapes. To address these shortcomings, we propose the Quality Shapes Extraction (QSE) approach to extract validating shapes in very large graphs, for which we devise both an exact and an approximate solution. QSE provides information about the reliability of shape constraints by computing their confidence and support within a KG and in doing so allows to identify shapes that are most informative and less likely to be affected by incomplete or incorrect data. To the best of our knowledge, QSE is the first approach to extract a complete set of validating shapes from WikiData. Moreover, QSE provides a 12x reduction in extraction time compared to existing approaches, while managing to filter out up to 93% of the invalid and spurious shapes, resulting in a reduction of up to 2 orders of magnitude in the number of constraints presented to the user, e.g., from 11,916 to 809 on DBpedia.
en
dc.language.iso
en
-
dc.publisher
ASSOC COMPUTING MACHINERY
-
dc.relation.ispartof
Proceedings of the VLDB Endowment
-
dc.rights.uri
http://creativecommons.org/licenses/by-nc-nd/4.0/
-
dc.subject
Knowledge Graphs
en
dc.subject
Heterogeneous
en
dc.subject
Validating Shapes
en
dc.subject
Data
en
dc.subject
Approaches
en
dc.subject
Quality Shapes Extraction (QSE)
en
dc.subject
Spurious Shapes
en
dc.title
Extraction of validating shapes from very large knowledge graphs
en
dc.type
Article
en
dc.type
Artikel
de
dc.rights.license
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International
en
dc.rights.license
Creative Commons Namensnennung - Nicht kommerziell - Keine Bearbeitungen 4.0 International