Previous Article in Journal
mbX: An R Package for Streamlined Microbiome Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Evaluating Estimator Performance Under Multicollinearity: A Trade-Off Between MSE and Accuracy in Logistic, Lasso, Elastic Net, and Ridge Regression with Varying Penalty Parameters

Department of Mathematics and Statistics, Florida International University, Miami, FL 33199, USA
*
Author to whom correspondence should be addressed.
Stats 2025, 8(2), 45; https://doi.org/10.3390/stats8020045
Submission received: 5 May 2025 / Revised: 27 May 2025 / Accepted: 29 May 2025 / Published: 31 May 2025

Abstract

Multicollinearity in logistic regression models can result in inflated variances and yield unreliable estimates of parameters. Ridge regression, a regularized estimation technique, is frequently employed to address this issue. This study conducts a comparative evaluation of the performance of 23 established ridge regression estimators alongside Logistic Regression, Elastic-Net, Lasso, and Generalized Ridge Regression (GRR), considering various levels of multicollinearity within the context of logistic regression settings. Simulated datasets with high correlations (0.80, 0.90, 0.95, and 0.99) and real-world data (municipal and cancer remission) were analyzed. Both results show that ridge estimators, such as kAL1, kAL2, kKL1, and kKL2, exhibit strong performance in terms of Mean Squared Error (MSE) and accuracy, particularly in smaller samples, while GRR demonstrates superior performance in large samples. Real-world data further confirm that GRR achieves the lowest MSE in highly collinear municipal data, while ridge estimators and GRR help prevent overfitting in small-sample cancer remission data. The results underscore the efficacy of ridge estimators and GRR in handling multicollinearity, offering reliable alternatives to traditional regression techniques, especially for datasets with high correlations and varying sample sizes.
Keywords: multicollinearity; Monte Carlo Simulation; accuracy; MSE; ridge regression multicollinearity; Monte Carlo Simulation; accuracy; MSE; ridge regression

Share and Cite

MDPI and ACS Style

Nayem, H.M.; Aziz, S.; Kibria, B.M.G. Evaluating Estimator Performance Under Multicollinearity: A Trade-Off Between MSE and Accuracy in Logistic, Lasso, Elastic Net, and Ridge Regression with Varying Penalty Parameters. Stats 2025, 8, 45. https://doi.org/10.3390/stats8020045

AMA Style

Nayem HM, Aziz S, Kibria BMG. Evaluating Estimator Performance Under Multicollinearity: A Trade-Off Between MSE and Accuracy in Logistic, Lasso, Elastic Net, and Ridge Regression with Varying Penalty Parameters. Stats. 2025; 8(2):45. https://doi.org/10.3390/stats8020045

Chicago/Turabian Style

Nayem, H. M., Sinha Aziz, and B. M. Golam Kibria. 2025. "Evaluating Estimator Performance Under Multicollinearity: A Trade-Off Between MSE and Accuracy in Logistic, Lasso, Elastic Net, and Ridge Regression with Varying Penalty Parameters" Stats 8, no. 2: 45. https://doi.org/10.3390/stats8020045

APA Style

Nayem, H. M., Aziz, S., & Kibria, B. M. G. (2025). Evaluating Estimator Performance Under Multicollinearity: A Trade-Off Between MSE and Accuracy in Logistic, Lasso, Elastic Net, and Ridge Regression with Varying Penalty Parameters. Stats, 8(2), 45. https://doi.org/10.3390/stats8020045

Article Metrics

Back to TopTop