Title: Rule-based recommender for feature engineering in big data
Language: English
Authors: Lepadat, Mihai-Alexandru 
Qualification level: Diploma
Keywords: Feature Engineering; Recommender; Machine Learning; Apache Spark; Ontology
Advisor: Tjoa, A Min  
Assisting Advisor: Knees, Peter  
Issue Date: 2019
Number of Pages: 72
Qualification level: Diploma
Abstract: 
Feature engineering is of high importance for the success of many machine learning algorithms and requires domain-specific knowledge. Generally, this knowledge is only familiar to domain experts or incorporated into programs. We developed a knowledge-driven approach to support users during feature engineering and implemented a software application to evaluate this approach. The knowledge is represented in Web Ontology Language (OWL) and its main purpose is to offer the user a flexible way to tackle domain-specific datasets by building a reusable and comprehensible knowledge base. A semantic reasoner makes use of this knowledge to infer properties and provide users with recommendations. All data-related operations are performed in a scalable cluster computing engine backed up by Apache Spark. The evaluation is done on 6 freely available datasets from the domain of demographics. We were able to identify only a small fraction of recommendations that proved to be wrong.
URI: https://resolver.obvsg.at/urn:nbn:at:at-ubtuw:1-125384
http://hdl.handle.net/20.500.12708/13802
Library ID: AC15381183
Organisation: E194 - Institut für Information Systems Engineering 
Publication Type: Thesis
Hochschulschrift
Appears in Collections:Thesis

Files in this item:

Show full item record

Page view(s)

25
checked on Feb 21, 2021

Download(s)

29
checked on Feb 21, 2021

Google ScholarTM

Check


Items in reposiTUm are protected by copyright, with all rights reserved, unless otherwise indicated.