Criteria for choosing the number of dimensions in a principal component analysis: An empirical assessment-Reference-Cited by-同舟云学术

Criteria for choosing the number of dimensions in a principal component analysis: An empirical assessment

Published:2020-09-28 Issue: Volume: Page:
ISSN:
Container-title:Anais do XXXV Simpósio Brasileiro de Banco de Dados (SBBD 2020)
language:
Short-container-title:

Author:

Silva Renata,Oliveira Daniel,Santos Davi Pereira,Santos Lucio F.D.,Wilson Rodrigo Erthal,Bedo Marcos

Abstract

Principal component analysis (PCA) is an efficient model for the optimization problem of finding d' axes of a subspace Rd' ⊆ Rd so that the mean squared distances from a given set R of points to the axes are minimal. Despite being steadily employed since 1901 in different scenarios, e.g., mechanics, PCA has become an important link in machine learning chained tasks, such as feature learning and AutoML designs. A frequent yet open issue that arises from supervised-based problems is how many PCA axes are required for the performance of machine learning constructs to be tuned. Accordingly, we investigate the behavior of six independent and uncoupled criteria for estimating the number of PCA axes, namely Scree-Plot %, Scree Plot Gap, Kaiser-Guttman, Broken-Stick, p-Score, and 2D. In total, we evaluate the performance of those approaches in 20 high dimensional datasets by using (i) four different classifiers, and (ii) a hypothesis test upon the reported F-Measures. Results indicate Broken-Stick and Scree-Plot % criteria consistently outperformed the competitors regarding supervised-based tasks, whereas estimators Kaiser-Guttman and Scree-Plot Gap delivered poor performances in the same scenarios.

Publisher

Sociedade Brasileira de Computação - SBC

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exposure behaviour to Escherichia coli among households in Imvepi refugee settlement, Terego district Uganda;BMC Public Health;2024-07-30

2. Yellow Cardinal (Gubernatrix cristata) males respond more strongly to local than to foreign dialects;Ibis;2023-04-20

3. Wia-Spine: A CBIR environment with embedded radiomic features to assess fragility fractures;2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS);2022-07