Robust linear regression for high-dimensional data: an overview

Filzmoser, Peter; Nordhausen, Klaus

doi:10.1002/wics.1524

DC Element

Wert

Sprache

dc.contributor.author

Filzmoser, Peter

dc.contributor.author

Nordhausen, Klaus

dc.date.accessioned

2022-12-23T15:15:33Z

dc.date.available

2022-12-23T15:15:33Z

dc.date.issued

2021

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Filzmoser, P., & Nordhausen, K. (2021). Robust linear regression for high-dimensional data: an overview. <i>Wiley Interdisciplinary Reviews: Computational Statistics</i>. https://doi.org/10.1002/wics.1524</div> </div>

dc.identifier.issn

1939-0068

dc.identifier.uri

http://hdl.handle.net/20.500.12708/137149

dc.description.abstract

Digitization as the process of converting information into numbers leads to bigger and more complex data sets, bigger also with respect to the number of measured variables. This makes it harder or impossible for the practitioner to identify outliers or observations that are inconsistent with an underlying model. Classical least‐squares based procedures can be affected by those outliers. In the regression context, this means that the parameter estimates are biased, with consequences on the validity of the statistical inference, on regression diagnostics, and on the prediction accuracy. Robust regression methods aim at assigning appropriate weights to observations that deviate from the model. While robust regression techniques are widely known in the low‐dimensional case, researchers and practitioners might still not be very familiar with developments in this direction for high‐dimensional data. Recently, different strategies have been proposed for robust regression in the high‐dimensional case, typically based on dimension reduction, on shrinkage, including sparsity, and on combinations of such techniques. A very recent concept is downweighting single cells of the data matrix rather than complete observations, with the goal to make better use of the model‐consistent information, and thus to achieve higher efficiency of the parameter estimates.

dc.language.iso

dc.publisher

WILEY

dc.relation.ispartof

Wiley Interdisciplinary Reviews: Computational Statistics

dc.subject

Statistics and Probability

dc.title

Robust linear regression for high-dimensional data: an overview

dc.type

Artikel

dc.type

Article

dc.type.category

Original Research Article

tuw.journal.peerreviewed

true

tuw.peerreviewed

true

tuw.researchTopic.id

tuw.researchTopic.name

Computational Materials Science

tuw.researchTopic.value

100

dcterms.isPartOf.title

Wiley Interdisciplinary Reviews: Computational Statistics

tuw.publication.orgunit

E105-06 - Forschungsbereich Computational Statistics

tuw.publisher.doi

10.1002/wics.1524

dc.identifier.eissn

1939-0068

dc.description.numberOfPages

tuw.author.orcid

0000-0002-8014-4682

wb.sci

true

wb.sciencebranch

Mathematik

wb.sciencebranch

Informatik

wb.sciencebranch.oefos

1010

wb.sciencebranch.oefos

1020

wb.facultyfocus

Wirtschaftsmathematik und Stochastik

wb.facultyfocus

Mathematical Methods in Economics and Stochastics

wb.facultyfocus.faculty

E100

item.languageiso639-1

item.openairetype

research article

item.grantfulltext

none

item.fulltext

no Fulltext

item.cerifentitytype

Publications

item.openairecristype

http://purl.org/coar/resource_type/c_2df8fbb1

crisitem.author.dept

E105-06 - Forschungsbereich Computational Statistics

crisitem.author.dept

E017 - Continuing Education Center

crisitem.author.orcid

0000-0002-8014-4682

crisitem.author.orcid

0000-0002-3758-8501

crisitem.author.parentorg

E105 - Institut für Stochastik und Wirtschaftsmathematik

crisitem.author.parentorg

E620 - Vizerektorat Studium und Lehre

Enthalten in den Sammlungen:

Article

Zur Kurzanzeige

Seiten Aufrufe

175

aufgerufen am 01.12.2023

Google Scholar^TM

Check

Seiten Aufrufe

Google ScholarTM

Google Scholar^TM