<div class="csl-bib-body">
<div class="csl-entry">Staudinger, M., El-Ebshihy, A., Ningtyas, A. M., Piroi, F., & Hanbury, A. (2024). AMATU@Simpletext2024: Are LLMs Any Good for Scientific Leaderboard Extraction? : Notebook for the SimpleText Lab at CLEF 2024. In G. Faggioli, N. Ferro, P. Galuščáková, & A. Garcia Seco de Herrera (Eds.), <i>CLEF 2024 Working Notes: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024)</i> (pp. 3300–3316).</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/210207
-
dc.description.abstract
In this paper, we present our approach to solve the SOTA challenge of the SimpleText shared task at CLEF 2024.
The objective of the challenge is to extract all (Task, Dataset, Metric, Score) tuples from scientific papers which
report model score on benchmark datasets. In this work, we propose a rule-based classification model to identify
papers that reports score information. We then apply different methods to extract TDMS using: (1) a baseline
model from the literature, and (2) two Large Language Models (LLMs), GPT-3.5 and Mistral. Results show that the
baseline model outperforms the LLMs in most cases, especially in zero-shot settings, with improvements seen in
few-shot settings. Manual investigation shows that extracting TDMS from paper text is challenging, particularly
for "Dataset" and "Score" extraction.
en
dc.language.iso
en
-
dc.relation.ispartofseries
CEUR Workshop Proceedings
-
dc.subject
Scientific Text Extraction
en
dc.subject
State-of-the-art
en
dc.subject
Entity Extraction
en
dc.subject
Relation Extraction
en
dc.title
AMATU@Simpletext2024: Are LLMs Any Good for Scientific Leaderboard Extraction? : Notebook for the SimpleText Lab at CLEF 2024
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.description.startpage
3300
-
dc.description.endpage
3316
-
dc.type.category
Full-Paper Contribution
-
dc.relation.eissn
1613-0073
-
tuw.booktitle
CLEF 2024 Working Notes: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024)
-
tuw.peerreviewed
true
-
tuw.researchTopic.id
I4
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E194-04 - Forschungsbereich Data Science
-
tuw.publication.orgunit
E058-06 - Fachbereich Zentrum für Forschungsdatenmanagement
-
dc.description.numberOfPages
17
-
tuw.author.orcid
0000-0002-5164-2690
-
tuw.author.orcid
0000-0001-7584-6439
-
tuw.author.orcid
0000-0002-7149-5843
-
tuw.editor.orcid
0000-0002-5070-2049
-
tuw.editor.orcid
0000-0001-6328-7131
-
tuw.editor.orcid
0000-0002-6509-5325
-
tuw.event.name
CLEF 2024 SimpleText
en
tuw.event.startdate
09-09-2024
-
tuw.event.enddate
12-09-2024
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Grenoble
-
tuw.event.country
FR
-
tuw.event.presenter
Staudinger, Moritz
-
wb.sciencebranch
Informatik
-
wb.sciencebranch
Wirtschaftswissenschaften
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.oefos
5020
-
wb.sciencebranch.value
90
-
wb.sciencebranch.value
10
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
item.cerifentitytype
Publications
-
item.languageiso639-1
en
-
item.fulltext
no Fulltext
-
item.openairetype
conference paper
-
item.grantfulltext
none
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
E194-01 - Forschungsbereich Software Engineering
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
E058-06 - Fachbereich Zentrum für Forschungsdatenmanagement
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.orcid
0000-0002-5164-2690
-
crisitem.author.orcid
0000-0001-7584-6439
-
crisitem.author.orcid
0000-0002-7149-5843
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E058 - Forschungs-, Technologie- und Innovationssupport
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering