<div class="csl-bib-body">
<div class="csl-entry">Arenas, M., Merkl, T. C., Pichler, R., & Riveros, C. (2024). Towards Tractability of the Diversity of Query Answers: Ultrametrics to the Rescue. <i>Proceedings of the ACM on Management of Data (PACMMOD)</i>, <i>2</i>(5), Article 215. https://doi.org/10.1145/3695833</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/209312
-
dc.description.abstract
The set of answers to a query may be very large, potentially overwhelming users when presented with the entire set. In such cases, presenting only a small subset of the answers to the user may be preferable. A natural requirement for this subset is that it should be as diverse as possible to reflect the variety of the entire population. To achieve this, the diversity of a subset is measured using a metric that determines how different two solutions are and a diversity function that extends this metric from pairs to sets. In the past, several studies have shown that finding a diverse subset from an explicitly given set is intractable even for simple metrics (like Hamming distance) and simple diversity functions (like summing all pairwise distances). This complexity barrier becomes even more challenging when trying to output a diverse subset from a set that is only implicitly given (such as the query answers for a given query and a database). Until now, tractable cases have been found only for restricted problems and particular diversity functions.
To overcome these limitations, we focus in this work on the notion of ultrametrics, which have been widely studied and used in many applications. Starting from any ultrametric d and a diversity function δ extending d, we provide sufficient conditions over δ for having polynomial-time algorithms to construct diverse answers. To the best of our knowledge, these conditions are satisfied by all the diversity functions considered in the literature. Moreover, we complement these results with lower bounds that show specific cases when these conditions are not satisfied and finding diverse subsets becomes intractable. We conclude by applying these results to the evaluation of conjunctive queries, demonstrating efficient algorithms for finding a diverse subset of solutions for acyclic conjunctive queries when the attribute order is used to measure diversity.
en
dc.description.sponsorship
WWTF Wiener Wissenschafts-, Forschu und Technologiefonds
-
dc.language.iso
en
-
dc.publisher
ACM
-
dc.relation.ispartof
Proceedings of the ACM on Management of Data (PACMMOD)
-
dc.subject
Query evaluation
en
dc.subject
diversity
en
dc.subject
conjunctive queries
en
dc.title
Towards Tractability of the Diversity of Query Answers: Ultrametrics to the Rescue
en
dc.type
Article
en
dc.type
Artikel
de
dc.relation.grantno
ICT22-011
-
dc.type.category
Original Research Article
-
tuw.container.volume
2
-
tuw.container.issue
5
-
tuw.journal.peerreviewed
true
-
tuw.peerreviewed
true
-
wb.publication.intCoWork
International Co-publication
-
tuw.project.title
Decompose and Conquer: Fast Query Processing via Decomposition
-
tuw.researchTopic.id
I1
-
tuw.researchTopic.name
Logic and Computation
-
tuw.researchTopic.value
100
-
dcterms.isPartOf.title
Proceedings of the ACM on Management of Data (PACMMOD)
-
tuw.publication.orgunit
E192-02 - Forschungsbereich Databases and Artificial Intelligence
-
tuw.publisher.doi
10.1145/3695833
-
dc.identifier.articleid
215
-
dc.identifier.eissn
2836-6573
-
dc.description.numberOfPages
26
-
tuw.author.orcid
0000-0002-1760-122X
-
tuw.author.orcid
0000-0003-0832-116X
-
wb.sciencebranch
Informatik
-
wb.sciencebranch
Mathematik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.oefos
1010
-
wb.sciencebranch.value
80
-
wb.sciencebranch.value
20
-
item.languageiso639-1
en
-
item.openairetype
research article
-
item.grantfulltext
none
-
item.fulltext
no Fulltext
-
item.cerifentitytype
Publications
-
item.openairecristype
http://purl.org/coar/resource_type/c_2df8fbb1
-
crisitem.author.dept
E404 - CSLEARN
-
crisitem.author.dept
E192-02 - Forschungsbereich Databases and Artificial Intelligence
-
crisitem.author.dept
E192-02 - Forschungsbereich Databases and Artificial Intelligence
-
crisitem.author.orcid
0000-0002-1760-122X
-
crisitem.author.orcid
0000-0003-0832-116X
-
crisitem.author.parentorg
E192 - Institut für Logic and Computation
-
crisitem.author.parentorg
E192 - Institut für Logic and Computation
-
crisitem.project.funder
WWTF Wiener Wissenschafts-, Forschu und Technologiefonds