<div class="csl-bib-body">
<div class="csl-entry">Kusa, W., Lipani, A., Knoth, P., & Hanbury, A. (2023). VoMBaT: a tool for visualising evaluation measure behaviour in high-recall search tasks. In <i>SIGIR ’23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval</i> (pp. 3105–3109). Association for Computing Machinery. https://doi.org/10.1145/3539618.3591802</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/189546
-
dc.description.abstract
The objective of High-Recall Information Retrieval (HRIR) is to retrieve as many relevant documents as possible for a given search topic. One approach to HRIR is Technology-Assisted Review (TAR), which uses information retrieval and machine learning techniques to aid the review of large document collections. TAR systems are commonly used in legal eDiscovery and systematic literature reviews. Successful TAR systems are able to find the majority of relevant documents using the least number of assessments. Commonly used retrospective evaluation assumes that the system achieves a specific, fixed recall level first, and then measures the precision or work saved (e.g., precision at r% recall). This approach can cause problems related to understanding the behaviour of evaluation measures in a fixed recall setting. It is also problematic when estimating time and money savings during technology-assisted reviews. This paper presents a new visual analytics tool to explore the dynamics of evaluation measures depending on recall level. We implemented 18 evaluation measures based on the confusion matrix terms, both from general IR tasks and specific to TAR. The tool allows for a comparison of the behaviour of these measures in a fixed recall evaluation setting. It can also simulate savings in time and money and a count of manual vs automatic assessments for different datasets depending on the model quality. The tool is open-source, and the demo is available under the following URL: https://vombat.streamlit.app.
en
dc.description.sponsorship
European Commission
-
dc.language.iso
en
-
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
-
dc.subject
citation screening
en
dc.subject
evaluation
en
dc.subject
evaluation measures
en
dc.subject
systematic reviews
en
dc.subject
technology-assisted reviews
en
dc.subject
TNR
en
dc.subject
visual analytics
en
dc.title
VoMBaT: a tool for visualising evaluation measure behaviour in high-recall search tasks
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.rights.license
Creative Commons Namensnennung 4.0 International
de
dc.rights.license
Creative Commons Attribution 4.0 International
en
dc.contributor.affiliation
University College London, United Kingdom of Great Britain and Northern Ireland (the)
-
dc.contributor.affiliation
The Open University, United Kingdom of Great Britain and Northern Ireland (the)