<div class="csl-bib-body">
<div class="csl-entry">Djukanovic, M. (2021). <i>Exact and heuristic approaches for solving string problems from bioinformatics</i> [Dissertation, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2021.93203</div>
</div>
-
dc.identifier.uri
https://doi.org/10.34726/hss.2021.93203
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/18223
-
dc.description.abstract
This thesis provides several new algorithms for solving prominent string problems from the literature, most of these being variants of the well-known longest common subsequence (LCS) problem. Given a set of input strings, a longest common subsequence is a string of maximum length that can be obtained from each input string by removing letters, i.e., which is a common subsequence of all input strings. The problem is known to be NP–hard and challenging to solve in practice for the general case of an arbitrary set of input strings. Besides the basic LCS problem variant, we consider here the following important variants: the longest common palindromic subsequence problem, the arc-preserving LCS problem, the longest common square subsequence problem, the repetition-free LCS problem, and the constrained LCS problem. These problems provide a range of important measures which serve for detecting similarities between molecules of various structures. Concerning heuristic approaches, we propose a general beam search framework in which many previously described methods from literature can be expressed. In particular, new state-of-the-art results were obtained on various benchmark sets utilizing a novel heuristic guidance that approximates the expected solution length of three different string problems. For solving the longest common square subsequence problem, a hybrid of a Reduced Variable Neighborhood Search method and a Beam search technique has been proposed. Concerning exact techniques, two kinds of methods are proposed: (i) pure exact methods based on A∗ search and (ii) anytime algorithms that build upon the A∗ search framework. Experimental results indicate that this A∗ search is also able to outperform all previously published more specific exact algorithms for the longest common palindromic subsequence and the constrained longest common subsequence problems with two input strings. Concerning anytime algorithms, we first make use of the derived A∗ search framework such that classical A∗ iterations are interleaved with beam search runs. Later, another anytime algorithm variant is proposed in which the beam search part is replaced by a major iteration of the Anytime Column Search. New state-of-the-art results are produced and better final optimality gaps were obtained by the latter hybrid, in comparison to a several state-of-the-art anytime algorithms from the literature. As an alternative exact approach, we further consider the transformation of LCS problem instances into Maximum Clique (MC) problem instance on the basis of so-called conflict graphs. In this way, state-of-the-art MC solvers can be utilized for solving the LCS problem instances. Further, an effective conflict graph reduction based on suboptimality checks is proposed. In conjunction with the general-purpose mixed integer linear programming solver Cplex, new state-of-the-art results are obtained on a wide range of benchmark instances.
en
dc.language
English
-
dc.language.iso
en
-
dc.rights.uri
http://rightsstatements.org/vocab/InC/1.0/
-
dc.subject
Combinatorial optimization
en
dc.subject
stringology
en
dc.subject
bioinformatics
en
dc.subject
anytime algorithms
en
dc.subject
best-first search
en
dc.subject
beam search
en
dc.subject
maximum clique problem
en
dc.subject
variable neighbourhood search
en
dc.subject
hybrid metaheuristics
en
dc.title
Exact and heuristic approaches for solving string problems from bioinformatics
en
dc.type
Thesis
en
dc.type
Hochschulschrift
de
dc.rights.license
In Copyright
en
dc.rights.license
Urheberrechtsschutz
de
dc.identifier.doi
10.34726/hss.2021.93203
-
dc.contributor.affiliation
TU Wien, Österreich
-
dc.rights.holder
Marko Djukanovic
-
dc.publisher.place
Wien
-
tuw.version
vor
-
tuw.thesisinformation
Technische Universität Wien
-
tuw.publication.orgunit
E192 - Institut für Logic and Computation
-
dc.type.qualificationlevel
Doctoral
-
dc.identifier.libraryid
AC16272098
-
dc.description.numberOfPages
225
-
dc.thesistype
Dissertation
de
dc.thesistype
Dissertation
en
dc.rights.identifier
In Copyright
en
dc.rights.identifier
Urheberrechtsschutz
de
tuw.advisor.staffStatus
staff
-
tuw.advisor.orcid
0000-0002-3293-177X
-
item.languageiso639-1
en
-
item.openairetype
doctoral thesis
-
item.grantfulltext
open
-
item.fulltext
with Fulltext
-
item.cerifentitytype
Publications
-
item.mimetype
application/pdf
-
item.openairecristype
http://purl.org/coar/resource_type/c_db06
-
item.openaccessfulltext
Open Access
-
crisitem.author.dept
E192-01 - Forschungsbereich Algorithms and Complexity