<div class="csl-bib-body">
<div class="csl-entry">Hollaus, F. (2021). <i>Restoration of multispectral images of ancient documents</i> [Dissertation, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2022.99025</div>
</div>
-
dc.identifier.uri
https://doi.org/10.34726/hss.2022.99025
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/19463
-
dc.description.abstract
This thesis is concerned with the restoration of images of historical documents. The ancient writings imaged contain partially faded-out characters or are degraded by background variations. MultiSpectral Imaging (MSI) has proven to be a valuable tool for the non-invasive investigation of such ancient manuscripts, since it can be used to acquire information that is invisible to the human eye. The document images which are examined in this work, have been acquired with a portable MSI system. The imaging in narrowband spectral ranges led to a considerable legibility increase. The images taken form the basis for two kinds of restoration techniques that are introduced in this work: First, an enhancement method is proposed that projects the multispectral samples on a lower dimensional space by applying an Linear Discriminant Analysis (LDA) based transformation. Thus, not only the dimensionality of the multispectral images is lowered, but also the legibility of the degraded writings is increased. A qualitative analysis conducted by philologists shows that the method partially outperforms unsupervised dimension reduction methods, which are used in previous works. The second aim of this work is the separation of the ancient writings from the remaining background. Such binarization methods are used as a preprocessing step for other document image analysis methods, including OCR (OCR) or writer identification. Multiple binarization methods have been developed for the multispectral document images considered: Two methods make use of a target detection algorithm, which is used to determine if ink is present within the multispectral samples. A further binarization method is introduced, which makes use of Gaussian Mixture Model (GMM) based clustering. The methods introduced make use of spatial and spectral information. Furthermore, a Fully Convolutional Network (FCN) is used for the binarization task. The methods are evaluated on two databases: First, the methods are applied on the MultiSpectral Text Extraction (MS-TEx) dataset, where the methods achieve promising results. The best performances are gained by the target detection-based methods. These methods participated in the MS-TEx 2015 contest, where they were ranked first and second. Second, the methods are evaluated on the MultiSpectral Document Binarization (MSBin) dataset. This dataset is larger and allows for a successful training of the FCN, which outperforms the remaining binarization methods. Nevertheless, the results gained by all methods proposed are superior to the results which are gained by a traditional binarization approach that is designed for grayscale images.
en
dc.language
English
-
dc.language.iso
en
-
dc.rights.uri
http://rightsstatements.org/vocab/InC/1.0/
-
dc.subject
Binärisierung
de
dc.subject
Dokumentenanalyse
de
dc.subject
Multispektral Aufnahme
de
dc.subject
Bildverbesserung
de
dc.subject
Binarization
en
dc.subject
Document Image Analysis
en
dc.subject
Multi-Spectral Imaging
en
dc.subject
Image Enhancement
en
dc.title
Restoration of multispectral images of ancient documents
en
dc.type
Thesis
en
dc.type
Hochschulschrift
de
dc.rights.license
In Copyright
en
dc.rights.license
Urheberrechtsschutz
de
dc.identifier.doi
10.34726/hss.2022.99025
-
dc.contributor.affiliation
TU Wien, Österreich
-
dc.rights.holder
Fabian Hollaus
-
dc.publisher.place
Wien
-
tuw.version
vor
-
tuw.thesisinformation
Technische Universität Wien
-
tuw.publication.orgunit
E193 - Institut für Visual Computing and Human-Centered Technology
-
dc.type.qualificationlevel
Doctoral
-
dc.identifier.libraryid
AC16435999
-
dc.description.numberOfPages
155
-
dc.thesistype
Dissertation
de
dc.thesistype
Dissertation
en
tuw.author.orcid
0000-0002-5708-3666
-
dc.rights.identifier
In Copyright
en
dc.rights.identifier
Urheberrechtsschutz
de
tuw.advisor.staffStatus
staff
-
tuw.advisor.orcid
0000-0003-4195-1593
-
item.languageiso639-1
en
-
item.openairetype
doctoral thesis
-
item.grantfulltext
open
-
item.fulltext
with Fulltext
-
item.cerifentitytype
Publications
-
item.mimetype
application/pdf
-
item.openairecristype
http://purl.org/coar/resource_type/c_db06
-
item.openaccessfulltext
Open Access
-
crisitem.author.dept
E193-01 - Forschungsbereich Computer Vision
-
crisitem.author.parentorg
E193 - Institut für Visual Computing and Human-Centered Technology