Robust statistical methods for outlier detection with application to household expenditure data

Gussenbauer, Johannes

doi:10.34726/hss.2015.25895

DC Field

Value

Language

dc.contributor.advisor

Templ, Matthias

dc.contributor.author

Gussenbauer, Johannes

dc.date.accessioned

2020-06-30T22:12:37Z

dc.date.issued

2015

dc.date.submitted

2015-10

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Gussenbauer, J. (2015). <i>Robust statistical methods for outlier detection with application to household expenditure data</i> [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2015.25895</div> </div>

dc.identifier.uri

https://doi.org/10.34726/hss.2015.25895

dc.identifier.uri

http://hdl.handle.net/20.500.12708/14511

dc.description

Abweichender Titel laut Übersetzung der Verfasserin/des Verfassers

dc.description.abstract

Outlier detection can be seen as a pre-processing step for locating data points in a data sample, which do not conform with the rest of the data. Various techniques and methods for outlier detection can be found in the literature dealing with different data types. In this master thesis the data sets used for outlier detection methods are household expenditure data from five countries. Based on classical estimates of the Gini coefficient these data sets are suspected to contain outlier. In order to detect data points that deviate from the rest of the data, one- and multi-dimensional outlier detection methods are applied on the household expenditure data. The outlier detection methods are based on robust estimates and incorporate, in some cases, the use of sample weights. Important issues concerning the data and outlier detection methods are the number of missing values in each data set as well as the position of true outliers, which is completely unknown. The main focus of this thesis lies in the understanding of the outlier detection methods and their in uence of the estimated Gini coefficient. Apart from applying the outlier detection methods on the various data sets and presenting the results, a recommendation on which of the outlier detection methods should be preferred when it comes to outlier detection on household expenditure data is presented in this work. In order to give a recommendation for outlier detection methods it is important to get a clearer vision of the performance of each outlier detection method on household expenditure data. To help understand the performance of the different outlier detection methods a simulation study, based on the original data from the survey, was conducted. The simulation study and all other calculations where executed using the R-programming language.

dc.language

English

dc.language.iso

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

Ausreissererkennung

dc.subject

Robustheit

dc.subject

Konsumdaten

dc.subject

Outlier Detection

dc.subject

Robustness

dc.subject

Expenditures Data

dc.title

Robust statistical methods for outlier detection with application to household expenditure data

dc.title.alternative

Robuste Ausreissererkennung in Konsumdaten

dc.type

Thesis

dc.type

Hochschulschrift

dc.rights.license

In Copyright

dc.rights.license

Urheberrechtsschutz

dc.identifier.doi

10.34726/hss.2015.25895

dc.contributor.affiliation

TU Wien, Österreich

dc.rights.holder

Johannes Gussenbauer

tuw.version

vor

tuw.thesisinformation

Technische Universität Wien

dc.contributor.assistant

Filzmoser, Peter

tuw.publication.orgunit

E105 - Institut für Stochastik und Wirtschaftsmathematik

dc.type.qualificationlevel

Diploma

dc.identifier.libraryid

AC12670635

dc.description.numberOfPages

dc.identifier.urn

urn:nbn:at:at-ubtuw:1-89066

dc.thesistype

Diplomarbeit

dc.thesistype

Diploma Thesis

dc.rights.identifier

In Copyright

dc.rights.identifier

Urheberrechtsschutz

tuw.advisor.staffStatus

staff

tuw.assistant.staffStatus

staff

tuw.assistant.orcid

0000-0002-8014-4682

item.languageiso639-1

item.openairetype

master thesis

item.grantfulltext

open

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.mimetype

application/pdf

item.openairecristype

http://purl.org/coar/resource_type/c_bdcc

item.openaccessfulltext

Open Access

crisitem.author.dept

TU Wien

Appears in Collections:

Thesis

Fulltext (Version of Record (published version))

Adobe PDF

(1.11 MB)

In Copyright

Show simple item record

Page view(s)

291

checked on Nov 21, 2023

Download(s)

192

checked on Nov 21, 2023

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM