Sparse and robust modeling for high-dimensional data

Hoffmann, Irene

doi:10.34726/hss.2017.28000

DC Field

Value

Language

dc.contributor.advisor

Filzmoser, Peter

dc.contributor.author

Hoffmann, Irene

dc.date.accessioned

2020-06-29T16:37:44Z

dc.date.issued

2017

dc.date.submitted

2017-11

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Hoffmann, I. (2017). <i>Sparse and robust modeling for high-dimensional data</i> [Dissertation, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2017.28000</div> </div>

dc.identifier.uri

https://doi.org/10.34726/hss.2017.28000

dc.identifier.uri

http://hdl.handle.net/20.500.12708/7446

dc.description.abstract

The development of statistical methods for high-dimensional data has become an important focus in recent research. Classical regression and classification approaches require full rank data matrices, with more observations than variables. In many areas of application (e.g. bioinformatics and chemometrics) this assumption is not met. Sparse methods describe a class of approaches where a penalty is imposed on the coefficient estimate to favour exact zero values and so intrinsically perform variable selection. Another challenge in many applications are outliers in the data, which are observations that do not follow the structure of the majority of the data and so violate the distribution assumptions which are necessary for classical model estimation. Robust methods give stable estimates when outliers are present and model the relationship of the majority of the data. The focus of this thesis is on the development of regression and classification methods, which are appropriate for high-dimensional data and data with outliers. Sparse partial robust M regression is a robust and sparse regression method. A robust subspace is identified, including only a subset of the original variables, where a robust regression model is estimated. This approach is then extended to binary classification problems. With the help of the optimal scoring approach, regression methods can be applied to classification problems. Robust sparse optimal scoring is a classification method based on least trimmed squares regression. Finally, sparse and robust linear regression and logistic regression methods are introduced based on least trimmed squares with an elastic net penalty, which induces sparsity and at the same time favours similar coefficient estimates for highly correlated variables.

dc.language

English

dc.language.iso

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

Sparsity

dc.subject

Robustness

dc.title

Sparse and robust modeling for high-dimensional data

dc.type

Thesis

dc.type

Hochschulschrift

dc.rights.license

In Copyright

dc.rights.license

Urheberrechtsschutz

dc.identifier.doi

10.34726/hss.2017.28000

dc.contributor.affiliation

TU Wien, Österreich

dc.rights.holder

Irene Hoffmann

dc.publisher.place

Wien

tuw.version

vor

tuw.thesisinformation

Technische Universität Wien

tuw.publication.orgunit

E105 - Institut für Stochastik und Wirtschaftsmathematik

dc.type.qualificationlevel

Doctoral

dc.identifier.libraryid

AC14489651

dc.description.numberOfPages

114

dc.identifier.urn

urn:nbn:at:at-ubtuw:1-104007

dc.thesistype

Dissertation

dc.thesistype

Dissertation

dc.rights.identifier

In Copyright

dc.rights.identifier

Urheberrechtsschutz

tuw.advisor.staffStatus

staff

tuw.advisor.orcid

0000-0002-8014-4682

item.languageiso639-1

item.openairetype

doctoral thesis

item.grantfulltext

open

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.mimetype

application/pdf

item.openairecristype

http://purl.org/coar/resource_type/c_db06

item.openaccessfulltext

Open Access

crisitem.author.dept

E105 - Institut für Stochastik und Wirtschaftsmathematik

crisitem.author.parentorg

E100 - Fakultät für Mathematik und Geoinformation

Appears in Collections:

Thesis

Fulltext (Version of Record (published version))

Adobe PDF

(996.67 kB)

In Copyright

Show simple item record

Page view(s)

259

checked on Nov 19, 2023

Download(s)

checked on Nov 19, 2023

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM