<div class="csl-bib-body">
<div class="csl-entry">Atzeni, P., Baldazzi, T., Bellomarini, L., Laurenza, E., & Sallinger, E. (2026). Semantic-aware query answering with Large Language Models. <i>DATA & KNOWLEDGE ENGINEERING</i>, <i>161</i>, Article 102494. https://doi.org/10.1016/j.datak.2025.102494</div>
</div>
-
dc.identifier.issn
0169-023X
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/221002
-
dc.description.abstract
In the modern data-driven world, answering queries over heterogeneous and semantically inconsistent data remains a significant challenge. Modern datasets originate from diverse sources, such as relational databases, semi-structured repositories, and unstructured documents, leading to substantial variability in schemas, terminologies, and data formats. Traditional systems, constrained by rigid syntactic matching and strict data binding, struggle to capture critical semantic connections and schema ambiguities, failing to meet the growing demand among data scientists for advanced forms of flexibility and context-awareness in query answering. In parallel, the advent of Large Language Models (LLMs) has introduced new capabilities in natural language interpretation, making them highly promising for addressing such challenges. However, LLMs alone lack the systematic rigor and explainability required for robust query processing and decision-making in high-stakes domains. In this paper, we propose Soft Query Answering (Soft QA), a novel hybrid approach that integrates LLMs as an intermediate semantic layer within the query processing pipeline. Soft QA enhances query answering adaptability and flexibility by injecting semantic understanding through context-aware, schema-informed prompts, and leverages LLMs to semantically link entities, resolve ambiguities, and deliver accurate query results in complex settings. We demonstrate its practical effectiveness through real-world examples, highlighting its ability to resolve semantic mismatches and improve query outcomes without requiring extensive data cleaning or restructuring.
en
dc.description.sponsorship
WWTF Wiener Wissenschafts-, Forschu und Technologiefonds
-
dc.language.iso
en
-
dc.publisher
ELSEVIER
-
dc.relation.ispartof
DATA & KNOWLEDGE ENGINEERING
-
dc.subject
Large language models
en
dc.subject
Neurosymbolic AI
en
dc.subject
Query answering
en
dc.title
Semantic-aware query answering with Large Language Models