Parzer, R., Filzmoser, P., & Vana Gür, L. (2024). Random projections for classification with high-dimensional data. In Proceedings of the 38th International Workshop on Statistical Modelling (pp. 236–239).
Proceedings of the 38th International Workshop on Statistical Modelling
-
Date (published):
2024
-
Event name:
38th International Workshop on Statistical Modelling
en
Event date:
14-Jul-2024 - 19-Jul-2024
-
Event place:
Durham, United Kingdom of Great Britain and Northern Ireland (the)
-
Number of Pages:
4
-
Keywords:
high-dimensional statistics; Classification; Dimension Reduction; Random Projection
en
Abstract:
We examine the binary classification problem in a challenging high-dimensional setting with correlated predictors, where the coefficients in a logistic regression model can vary from sparse to dense. In this work, we propose the use of a data-driven random projection matrix to reduce the original feature space. The random projection combines variables considering their respective effect on the class response and such that the regression coefficient can still be recovered. In a simulation exercise, we show that the proposed random projection produces significantly better prediction results than conventional random projections, even outperforming benchmarks such as glmnet’s logistic regression with elastic net penalty.
en
Project title:
Hochdimensionales statistisches Lernen: Neue Methoden zur Förderung der Wirtschafts- und Nachhaltigkeitspolitik: ZK 35-G (FWF - Österr. Wissenschaftsfonds)
-
Research Areas:
Mathematical and Algorithmic Foundations: 40% Modeling and Simulation: 20% Fundamental Mathematics Research: 40%