<div class="csl-bib-body">
<div class="csl-entry">Ali, S. J., Naganathan, V., & Bork, D. (2024). Establishing Traceability Between Natural Language Requirements and Software Artifacts by Combining RAG and LLMs. In <i>Conceptual Modeling</i> (pp. 295–314). https://doi.org/10.1007/978-3-031-75872-0_16</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/205507
-
dc.description.abstract
Software Engineering aims to effectively translate stakeholders’ requirements into executable code to fulfill their needs. Traceability from natural language use case requirements to classes in a UML class diagram, subsequently translated into code implementation, is essential in systems development and maintenance. Tasks such as assessing the impact of changes and enhancing software reusability require a clear link between these requirements and their software implementation. However, establishing such links manually across extensive codebases is prohibitively challenging. Requirements, typically articulated in natural language, embody semantics that clarify the purpose of the codebase. Conventional traceability methods, relying on textual similarities between requirements and code, often suffer from low precision due to the semantic gap between high-level natural language requirements and the syntactic nature of code. The advent of Large Language Models (LLMs) provides new methods to address this challenge through their advanced capability to interpret both natural language and code syntax. Furthermore, representing code as a knowledge graph facilitates the use of graph structural information to enhance traceability links. This paper introduces an LLM-supported retrieval augmented generation approach for enhancing requirements traceability to the class diagram of the code, incorporating keyword, vector, and graph indexing techniques, and their integrated application. We present a comparative analysis against conventional methods and among different indexing strategies and parameterizations on the performance. Our results demonstrate how this methodology significantly improves the efficiency and accuracy of establishing traceability links in software development processes.
-
dc.language.iso
en
-
dc.relation.ispartofseries
Lecture Notes in Computer Science
-
dc.subject
Large Language Models
-
dc.subject
LLM
-
dc.subject
Requirements Engineering
-
dc.subject
Requirements Traceability
-
dc.subject
Retrieval Augmented Generation
-
dc.title
Establishing Traceability Between Natural Language Requirements and Software Artifacts by Combining RAG and LLMs
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.affiliation
Microsoft (India), India
-
dc.relation.isbn
978-3-031-75872-0
-
dc.description.startpage
295
-
dc.description.endpage
314
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
Conceptual Modeling
-
tuw.container.volume
15238
-
tuw.peerreviewed
true
-
tuw.researchTopic.id
I4
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E194-03 - Forschungsbereich Business Informatics
-
tuw.publisher.doi
10.1007/978-3-031-75872-0_16
-
dc.description.numberOfPages
20
-
tuw.author.orcid
0000-0002-0710-8052
-
tuw.author.orcid
0000-0001-8259-2297
-
tuw.event.name
43rd International Conference on Conceptual Modeling (ER 2024)
en
tuw.event.startdate
28-10-2024
-
tuw.event.enddate
31-10-2024
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Pittsburgh
-
tuw.event.country
US
-
tuw.event.presenter
Ali, Syed Juned
-
wb.sciencebranch
Informatik
-
wb.sciencebranch
Wirtschaftswissenschaften
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.oefos
5020
-
wb.sciencebranch.value
90
-
wb.sciencebranch.value
10
-
item.languageiso639-1
en
-
item.openairetype
conference paper
-
item.grantfulltext
none
-
item.fulltext
no Fulltext
-
item.cerifentitytype
Publications
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
crisitem.author.dept
Microsoft (India)
-
crisitem.author.dept
E194-03 - Forschungsbereich Business Informatics
-
crisitem.author.orcid
0000-0002-0710-8052
-
crisitem.author.orcid
0000-0001-8259-2297
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering