<div class="csl-bib-body">
<div class="csl-entry">Banyasz, D., Hofstätter, S., & Hanbury, A. (2023). Search in Archival Facsimile Documents for Digital History. In <i>2023 IEEE 19th International Conference on e-Science (e-Science)</i>. IEEE 19th International Conference on eScience 2023, Limassol, Cyprus. IEEE. https://doi.org/10.1109/e-Science58273.2023.10254826</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/192602
-
dc.description.abstract
Recent advances in text digitization and processing have opened up many possibilities for historical archives to be processed and digitized in an efficient and automated manner. Processing steps, also involving language detection, optical character recognition (OCR), named entity recognition (NER), recognition error detection, and automated or manual correction can result in digitized archives providing both high-quality facsimile representations of original document scans and extracted text metadata close to the original text in a machine-friendly format. Exploration of digitally enhanced archives is an important step forward in the future workflow of archivists and historians alike. After analysing the requirements of these users, we propose a concept for dynamically generating retrieval-relevant facsimile image snippets. This work demonstrates a Human-in-the-Loop retrieval and research workflow based on these methods by providing a search user interface prototype geared towards intuitively exploring topics across a multilingual historical facsimile archive corpus.
en
dc.language.iso
en
-
dc.subject
archival document
en
dc.subject
digital history
en
dc.subject
information retrieval
en
dc.subject
user interface
en
dc.title
Search in Archival Facsimile Documents for Digital History
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.relation.publication
2023 IEEE 19th International Conference on e-Science (e-Science)
-
dc.relation.isbn
979-8-3503-2223-1
-
dc.relation.doi
10.1109/e-Science58273.2023
-
dc.relation.issn
2325-372X
-
dc.type.category
Full-Paper Contribution
-
dc.relation.eissn
2325-3703
-
tuw.booktitle
2023 IEEE 19th International Conference on e-Science (e-Science)
-
tuw.peerreviewed
true
-
tuw.relation.publisher
IEEE
-
tuw.researchTopic.id
I4
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E194-04 - Forschungsbereich Data Science
-
tuw.publisher.doi
10.1109/e-Science58273.2023.10254826
-
dc.description.numberOfPages
10
-
tuw.author.orcid
0000-0002-7149-5843
-
tuw.event.name
IEEE 19th International Conference on eScience 2023
en
tuw.event.startdate
09-10-2023
-
tuw.event.enddate
13-10-2023
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Limassol
-
tuw.event.country
CY
-
tuw.event.institution
IEEE
-
tuw.event.presenter
Banyasz, David
-
tuw.event.track
Multi Track
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.value
100
-
item.languageiso639-1
en
-
item.openairetype
conference paper
-
item.grantfulltext
none
-
item.fulltext
no Fulltext
-
item.cerifentitytype
Publications
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.orcid
0000-0002-7149-5843
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering