A Factor Analysis Perspective on Linear Regression in the ‘More Predictors than Samples’ Case-Reference-Cited by-同舟云学术

A Factor Analysis Perspective on Linear Regression in the ‘More Predictors than Samples’ Case

Published:2021-08-03 Issue:8 Volume:23 Page:1012
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Ciobanu Sebastian^ORCID,Ciortuz Liviu

Abstract

Linear regression (LR) is a core model in supervised machine learning performing a regression task. One can fit this model using either an analytic/closed-form formula or an iterative algorithm. Fitting it via the analytic formula becomes a problem when the number of predictors is greater than the number of samples because the closed-form solution contains a matrix inverse that is not defined when having more predictors than samples. The standard approach to solve this issue is using the Moore–Penrose inverse or the L2 regularization. We propose another solution starting from a machine learning model that, this time, is used in unsupervised learning performing a dimensionality reduction task or just a density estimation one—factor analysis (FA)—with one-dimensional latent space. The density estimation task represents our focus since, in this case, it can fit a Gaussian distribution even if the dimensionality of the data is greater than the number of samples; hence, we obtain this advantage when creating the supervised counterpart of factor analysis, which is linked to linear regression. We also create its semisupervised counterpart and then extend it to be usable with missing data. We prove an equivalence to linear regression and create experiments for each extension of the factor analysis model. The resulting algorithms are either a closed-form solution or an expectation–maximization (EM) algorithm. The latter is linked to information theory by optimizing a function containing a Kullback–Leibler (KL) divergence or the entropy of a random variable.

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/23/8/1012/pdf

Reference24 articles.

1. Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression. (Additional Chapter to Machine Learning; McGraw-Hill: New York, NY, USA, 1997.) Published Online;Mitchell,2017

2. Machine Learning: A Probabilistic Perspective;Murphy,2012

3. Machine Learning Course, Lecture Notes, Mixtures of Gaussians and the EM Algorithmhttp://cs229.stanford.edu/notes2020spring/cs229-notes7b.pdf

4. Machine Learning Course, Homework 4, pr 1.1; CMU: Pittsburgh, PA, USA, 2010; p. 528 in Ciortuz, L.; Munteanu, A.; Bădărău, Ehttps://bit.ly/320ZuIk

5. Machine Learning Course, Lecture Notes, Part Xhttp://cs229.stanford.edu/notes2020spring/cs229-notes9.pdf

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Use of Regression Models to Measure the Relationship between Electronic Media Use and Sleep Duration;SSRN Electronic Journal;2024