Emeakaroha, V. C., Maurer, M., Stern, P., Łabaj, P. P., Brandic, I., & Kreil, D. P. (2013). Managing and Optimizing Bioinformatics Workflows for Data Analysis in Clouds. Journal of Grid Computing, 11(3), 407–428. https://doi.org/10.1007/s10723-013-9260-9
Software; Hardware and Architecture; Information Systems; Cloud computing; Computer Networks and Communications; Workflow execution; Resource level monitoring; Application level monitoring; Workflow management; Knowledge database
-
Abstract:
The rapid advancements in recent years of high-throughput technologies in the life sciences are facilitating the generation and storage of huge amount of data in different databases. Despite significant developments in computing capacity and performance, an analysis of these large-scale data in a search for biomedical relevant patterns remains a challenging task. Scientific workflow applications are deemed to support data-mining in more complex scenarios that include many data sources and computational tools, as commonly found in bioinformatics. A scientific workflow application is a holistic unit that defines, executes, and manages scientific applications using different software tools. Existing workflow applications are process- or data- rather than resource-oriented. Thus, they lack efficient computational resource management capabilities, such as those provided by Cloud computing environments. Insufficient computational resources disrupt the execution of workflow applications, wasting time and money. To address this issue, advanced resource monitoring and management strategies are required to determine the resource consumption behaviours of workflow applications to enable a dynamical allocation and deallocation of resources. In this paper, we present a novel Cloud management infrastructure consisting of resource level-, application level monitoring techniques, and a knowledge management strategy to manage computational resources for supporting workflow application executions in order to guarantee their performance goals and their successful completion. We present the design description of these techniques, demonstrate how they can be applied to scientific workflow applications, and present detailed evaluation results as a proof of concept.
en
Project title:
Foundations of Self-governing ICT Infrastructures (WWTF Wiener Wissenschafts-, Forschu und Technologiefonds)
-
Research Areas:
Distributed and Parallel Systems: 95% Computer Science Foundations: 5%