reposiTUm: Outliers and compositional data

Notice
This item was automatically migrated from a legacy system. It's data has not been checked and might not meet the quality criteria of the present system.

Record link:

http://hdl.handle.net/20.500.12708/122865

Title:

Outliers and compositional data

Citation:

Filzmoser, P. (2019). Outliers and compositional data. IAMG2019, Pennsylvania, United States of America (the). http://hdl.handle.net/20.500.12708/122865

Publication Type:

Presentation - Keynote Presentation

Authors:

Filzmoser, Peter

Organisational Unit:

E105-06 - Forschungsbereich Computational Statistics

Date (published):

2019

Event name:

IAMG2019

Event date:

10-Aug-2019 - 16-Aug-2019

Event place:

Pennsylvania, United States of America (the)

Abstract:

Statistical data analysis should always be done with care if outliers are present in the data, since they have the potential to spoil the analysis. However, usually it is not clear if multivariate data contain outliers, and in particular, if such outliers would affect the statistical method to be used. Diagnostic plots of the results from the analysis will only reveal outliers if the method itself is robust against the outliers. Moreover, the impact of outliers depends on the statistical model being used. Identifying outliers in compositional data is even more tricky because their values are unusual not in the absolute but in a relative sense. With the log-ratio approach for compositional data analysis, outliers could even be artificially created by including variables with extremely low and unreliable values - a frequent practical issue. We will discuss these problems and provide more detailed insight, propose some possible approaches to cope with these issues, and illustrate them at real data, mainly from the field of geochemistry.

Science Branch:

Mathematik

Appears in Collections:

Presentation

Show full item record

Page view(s)

113

checked on Dec 1, 2023

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Google Scholar^TM