<div class="csl-bib-body">
<div class="csl-entry">Djukanovic, M., Raidl, G. R., & Blum, C. (2020). Finding Longest Common Subsequences: New anytime A* search results. <i>Applied Soft Computing</i>, <i>95</i>, Article 106499. https://doi.org/10.1016/j.asoc.2020.106499</div>
</div>
-
dc.identifier.issn
1568-4946
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/141589
-
dc.description.abstract
The Longest Common Subsequence (LCS) problem aims at finding a longest string that is a subsequence of each string from a given set of input strings. This problem has applications, in particular, in the context of bioinformatics, where strings represent DNA or protein sequences. Existing approaches include numerous heuristics, but only a few exact approaches, limited to rather small problem instances. Adopting various aspects from leading heuristics for the LCS, we first propose an exact A* search approach, which performs well in comparison to earlier exact approaches in the context of small instances. On the basis of A* search we then develop two hybrid A*–based algorithms in which classical A* iterations are alternated with beam search and anytime column search, respectively. A key feature to guide the heuristic search in these approaches is the usage of an approximate expected length calculation for the LCS of uniform random strings. Even for large problem instances these anytime A* variants yield reasonable solutions early during the search and improve on them over time. Moreover, they terminate with proven optimality if enough time and memory is given. Furthermore, they yield
upper bounds and, thus, quality guarantees when terminated early. We comprehensively evaluate the proposed methods using most of the available benchmark sets from the literature and compare to the current state-of-the-art methods. In particular, our algorithms are able to obtain new best results for 82 out of 117 instance groups. Moreover, in most cases they also provide significantly smaller optimality gaps than other anytime algorithms.
en
dc.language.iso
en
-
dc.publisher
ELSEVIER
-
dc.relation.ispartof
Applied Soft Computing
-
dc.subject
Software
en
dc.subject
A* search
en
dc.subject
longest common subsequence problem
en
dc.subject
Hybrid metaheuristic
en
dc.subject
Anytime column search
en
dc.subject
Beam search
en
dc.title
Finding Longest Common Subsequences: New anytime A* search results
en
dc.type
Artikel
de
dc.type
Article
en
dc.type.category
Original Research Article
-
tuw.container.volume
95
-
tuw.journal.peerreviewed
true
-
tuw.peerreviewed
true
-
wb.publication.intCoWork
International Co-publication
-
tuw.researchTopic.id
I1
-
tuw.researchTopic.name
Logic and Computation
-
tuw.researchTopic.value
100
-
dcterms.isPartOf.title
Applied Soft Computing
-
tuw.publication.orgunit
E192-01 - Forschungsbereich Algorithms and Complexity
-
tuw.publisher.doi
10.1016/j.asoc.2020.106499
-
dc.date.onlinefirst
2020-06-26
-
dc.identifier.articleid
106499
-
dc.identifier.eissn
1872-9681
-
dc.description.numberOfPages
21
-
tuw.author.orcid
0000-0003-1358-3789
-
tuw.author.orcid
0000-0002-1736-3559
-
wb.sci
true
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
1020
-
wb.facultyfocus
Logic and Computation (LC)
de
wb.facultyfocus
Logic and Computation (LC)
en
wb.facultyfocus.faculty
E180
-
item.languageiso639-1
en
-
item.openairetype
research article
-
item.grantfulltext
restricted
-
item.fulltext
no Fulltext
-
item.cerifentitytype
Publications
-
item.openairecristype
http://purl.org/coar/resource_type/c_2df8fbb1
-
crisitem.author.dept
E192-01 - Forschungsbereich Algorithms and Complexity
-
crisitem.author.dept
E192-01 - Forschungsbereich Algorithms and Complexity