Generalized Penalized Constrained Regression: Sharp Guarantees in High Dimensions with Noisy Features

Author:

Alrashdi Ayed M.1ORCID,Alazmi Meshari2,Alrasheedi Masad A.3

Affiliation:

1. Department of Electrical Engineering, College of Engineering, University of Ha’il, Ha’il 81441, Saudi Arabia

2. Department of Information and Computer Science, College of Computer Science and Engineering, University of Ha’il, Ha’il 81411, Saudi Arabia

3. Department of Management Information Systems, College of Business Administration, Taibah University, Madinah 42353, Saudi Arabia

Abstract

The generalized penalized constrained regression (G-PCR) is a penalized model for high-dimensional linear inverse problems with structured features. This paper presents a sharp error performance analysis of the G-PCR in the over-parameterized high-dimensional setting. The analysis is carried out under the assumption of a noisy or erroneous Gaussian features matrix. To assess the performance of the G-PCR problem, the study employs multiple metrics such as prediction risk, cosine similarity, and the probabilities of misdetection and false alarm. These metrics offer valuable insights into the accuracy and reliability of the G-PCR model under different circumstances. Furthermore, the derived results are specialized and applied to well-known instances of G-PCR, including l1-norm penalized regression for sparse signal recovery and l2-norm (ridge) penalization. These specific instances are widely utilized in regression analysis for purposes such as feature selection and model regularization. To validate the obtained results, the paper provides numerical simulations conducted on both real-world and synthetic datasets. Using extensive simulations, we show the universality and robustness of the results of this work to the assumed Gaussian distribution of the features matrix. We empirically investigate the so-called double descent phenomenon and show how optimal selection of the hyper-parameters of the G-PCR can help mitigate this phenomenon. The derived expressions and insights from this study can be utilized to optimally select the hyper-parameters of the G-PCR. By leveraging these findings, one can make well-informed decisions regarding the configuration and fine-tuning of the G-PCR model, taking into consideration the specific problem at hand as well as the presence of noisy features in the high-dimensional setting.

Funder

Deputyship for Research & Innovation, Ministry of Education, Saudi Arabia

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Reference78 articles.

1. Proximal algorithms;Parikh;Found. Trends Optim.,2014

2. Tarantola, A. (2005). Inverse Problem Theory and Methods for Model Parameter Estimation, SIAM.

3. Kailath, T., Sayed, A.H., and Hassibi, B. (2000). Linear Estimation, Prentice Hall.

4. Groetsch, C.W., and Groetsch, C. (1993). Inverse Problems in the Mathematical Sciences, Springer.

5. Pattern recognition;Bishop;Mach. Learn.,2006

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3