LASSO and Elastic Net Tend to Over-Select Features-Reference-Cited by-同舟云学术

LASSO and Elastic Net Tend to Over-Select Features

Published:2023-08-30 Issue:17 Volume:11 Page:3738
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Liu Lu¹,Gao Junheng¹,Beasley Georgia²³,Jung Sin-Ho¹^ORCID

Affiliation:

1. Department of Biostatistics and Bioinformatics, Duke University, Durham, NC 27708, USA

2. Department of Surgery, Duke University Medical Center, Durham, NC 27710, USA

3. Duke Cancer Institute, Durham, NC 27710, USA

Abstract

Machine learning methods have been a standard approach to select features that are associated with an outcome and to build a prediction model when the number of candidate features is large. LASSO is one of the most popular approaches to this end. The LASSO approach selects features with large regression estimates, rather than based on statistical significance, that are associated with the outcome by imposing an L1-norm penalty to overcome the high dimensionality of the candidate features. As a result, LASSO may select insignificant features while possibly missing significant ones. Furthermore, from our experience, LASSO has been found to select too many features. By selecting features that are not associated with the outcome, we may have to spend more cost to collect and manage them in the future use of a fitted prediction model. Using the combination of L1- and L2-norm penalties, elastic net (EN) tends to select even more features than LASSO. The overly selected features that are not associated with the outcome act like white noise, so that the fitted prediction model may lose prediction accuracy. In this paper, we propose to use standard regression methods, without any penalizing approach, combined with a stepwise variable selection procedure to overcome these issues. Unlike LASSO and EN, this method selects features based on statistical significance. Through extensive simulations, we show that this maximum likelihood estimation-based method selects a very small number of features while maintaining a high prediction power, whereas LASSO and EN make a large number of false selections to result in loss of prediction accuracy. Contrary to LASSO and EN, the regression methods combined with a stepwise variable selection method is a standard statistical method, so that any biostatistician can use it to analyze high-dimensional data, even without advanced bioinformatics knowledge.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/17/3738/pdf

Reference26 articles.

1. Incremental Benefits of Machine Learning—When Do We Need a Better Mousetrap;Engelhard;JAMA Cardiol.,2021

2. Regression Shrinkage and Selection via the Lasso;Tibshirani;J. R. Stat. Soc. Ser. B Methodol.,1996

3. Lee, J., Sohn, I., Do, I.G., Kim, K.M., Park, S.H., Park, J.O., Park, Y.S., Lim, H.Y., Sohn, T.S., and Bae, J.M. (2014). Nanostring-based multigene assay to predict recurrence for gastric cancer patients after surgery. PLoS ONE, 9.

4. Standardization and the group LASSO penalty;Simon;Stat. Sin.,2012

5. Regularization and Variable Selection via the Elastic Net;Zou;J. R. Stat. Soc. Ser. B Stat. Methodol.,2005

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-objective optimization and thermodynamic assessment of a solar unit with a novel tube shape equipped with a helical tape;Applied Thermal Engineering;2024-10

2. Repeated Sieving for Prediction Model Building with High-Dimensional Data;Journal of Personalized Medicine;2024-07-19

3. Genetic Algorithm Selection of Interacting Features (GASIF) for Selecting Biological Gene-Gene Interactions;Proceedings of the Genetic and Evolutionary Computation Conference;2024-07-14