Title: Distributed big data frameworks
Other Titles: Verteilte Big Data Frameworks: Eine Übersicht
Language: English
Authors: Becker, Moritz 
Qualification level: Diploma
Advisor: Pichler, Reinhard 
Issue Date: 2017
Number of Pages: 143
Qualification level: Diploma
The ever increasing amount of data that the modern internet society produces poses challenges to corporations and information systems that need to store and process this data. In addition, novel trends like the internet of things even adumbrate a prospectively steeper increase of the data volume than in the past, thereby supporting the relevance of big data. In order to overcome the gap between storage capacity and data access speed while maintaining the economic feasibility of data processing, the industry has created frameworks that allow the horizontal scaling of data processing on large clusters of commodity hardware. The plethora of technologies that have since been developed makes the entrance to the field of big data processing increasingly hard. Therefore, this thesis identifies the major types of big data processing along with the programming models that have been designed to cover them. In addition, an introductory overview of the most important open source frameworks and technologies along with practical examples of how they can be used is given for each processing type. The thesis concludes by pointing out important extension projects to the presented base systems and by suggesting the conduction of a performance-centric comparison of Apache Spark and Apache Hadoop that can help to establish a more profound understanding of the nature of these systems and to identify potential novel research topics.
Keywords: verteilte Verarbeitungsmethoden; Big Data
distributed processing methods; Big Data
URI: https://resolver.obvsg.at/urn:nbn:at:at-ubtuw:1-103714
Library ID: AC14486179
Organisation: E184 - Institut für Informationssysteme 
Publication Type: Thesis
Appears in Collections:Thesis

Files in this item:

File Description SizeFormat
Becker Moritz - 2017 - Distributed big data frameworks.pdf1.53 MBAdobe PDFThumbnail
Show full item record

Page view(s)

checked on May 6, 2021


checked on May 6, 2021

Google ScholarTM


Items in reposiTUm are protected by copyright, with all rights reserved, unless otherwise indicated.