A Guide for Sparse PCA: Model Comparison and Applications-Reference-Cited by-同舟云学术

A Guide for Sparse PCA: Model Comparison and Applications

Published:2021-06-29 Issue:4 Volume:86 Page:893-919
ISSN:0033-3123
Container-title:Psychometrika
language:en
Short-container-title:Psychometrika

Author:

Guerra-Urzola Rosember^ORCID,Van Deun Katrijn,Vera Juan C.^ORCID,Sijtsma Klaas

Abstract

AbstractPCA is a popular tool for exploring and summarizing multivariate data, especially those consisting of many variables. PCA, however, is often not simple to interpret, as the components are a linear combination of the variables. To address this issue, numerous methods have been proposed to sparsify the nonzero coefficients in the components, including rotation-thresholding methods and, more recently, PCA methods subject to sparsity inducing penalties or constraints. Here, we offer guidelines on how to choose among the different sparse PCA methods. Current literature misses clear guidance on the properties and performance of the different sparse PCA methods, often relying on the misconception that the equivalence of the formulations for ordinary PCA also holds for sparse PCA. To guide potential users of sparse PCA methods, we first discuss several popular sparse PCA methods in terms of where the sparseness is imposed on the loadings or on the weights, assumed model, and optimization criterion used to impose sparseness. Second, using an extensive simulation study, we assess each of these methods by means of performance measures such as squared relative error, misidentification rate, and percentage of explained variance for several data generating models and conditions for the population model. Finally, two examples using empirical data are considered.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,General Psychology

Link

https://link.springer.com/content/pdf/10.1007/s11336-021-09773-2.pdf

Reference54 articles.

1. Adachi, K., & Trendafilov, N. T. (2016). Sparse principal component analysis subject to prespecified cardinality of loadings. Computational Statistics, 314(4), 1403–1427. https://doi.org/10.1007/s00180-015-0608-4.

2. Baik, J., & Silverstein, J. W. (2006). Eigenvalues of large sample covariance matrices of spiked population models. Journal of Multivariate Analysis, 97(6), 1382–1408. https://doi.org/10.1016/j.jmva.2005.08.003.

3. Beck, A., & Teboulle, M. (2009). A fast iterative Shrinkage–Thresholding algorithm for linear inverse problems. SIAM Journal of Imaging Sciences, 2(1), 183–202. https://doi.org/10.1137/080716542.

4. Bertsimas, D., King, A., & Mazumder, R. (2016). Best subset selection via a modern optimization lens (Vol. 44) (No. 2). https://doi.org/10.1214/15-AOS1388

5. Cadima, J., & Jolliffe, I. T. (1995). Loadings and correlations in the interpretation of principal components. Journal of Applied Statistics, 22(2), 203–214. https://doi.org/10.1080/757584614.

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Measuring Domain Shift in Vibration Signals to Improve Cross-Domain Diagnosis of Piston Aero Engine Faults;Processes;2024-09-05

2. Topological data analysis expands the genotype to phenotype map for 3D maize root system architecture;Frontiers in Plant Science;2024-01-15

3. A critical assessment of sparse PCA (research): why (one should acknowledge that) weights are not loadings;Behavior Research Methods;2023-08-01

4. Examining Students’ Learning Styles Impacted on Learning Outcome in the MOOC Course: A Case Study;Proceedings of the 2023 9th International Conference on Frontiers of Educational Technologies;2023-06-09

5. Volatile Markers for Cancer in Exhaled Breath—Could They Be the Signature of the Gut Microbiota?;Molecules;2023-04-15