Context-Based Evaluation of Dimensionality Reduction Algorithms—Experiments and Statistical Significance Analysis-Reference-Cited by-同舟云学术

Context-Based Evaluation of Dimensionality Reduction Algorithms—Experiments and Statistical Significance Analysis

Published:2021-04-30 Issue:2 Volume:15 Page:1-40
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Ghosh Aindrila¹,Nashaat Mona¹,Miller James¹,Quader Shaikh²

Affiliation:

1. Electrical and Computer Engineering, University of Alberta, Canada

2. IBM Canada Software Lab, Unionville, Ontario, IBM Canada

Abstract

Dimensionality reduction is a commonly used technique in data analytics. Reducing the dimensionality of datasets helps not only with managing their analytical complexity but also with removing redundancy. Over the years, several such algorithms have been proposed with their aims ranging from generating simple linear projections to complex non-linear transformations of the input data. Subsequently, researchers have defined several quality metrics in order to evaluate the performances of different algorithms. Hence, given a plethora of dimensionality reduction algorithms and metrics for their quality analysis, there is a long-existing need for guidelines on how to select the most appropriate algorithm in a given scenario. In order to bridge this gap, in this article, we have compiled 12 state-of-the-art quality metrics and categorized them into 5 identified analytical contexts. Furthermore, we assessed 15 most popular dimensionality reduction algorithms on the chosen quality metrics using a large-scale and systematic experimental study. Later, using a set of robust non-parametric statistical tests, we assessed the generalizability of our evaluation on 40 real-world datasets. Finally, based on our results, we present practitioners’ guidelines for the selection of an appropriate dimensionally reduction algorithm in the present analytical contexts.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3428077

Reference79 articles.

1. Dimensionality reduction : A comparative review;van der Maaten L.;J. Mach. Learn. Res.,2008

2. Fast feature selection using fractal dimension;Jr C. T.;J. Inf. Data Manag.,2010

3. Dimensionality reduction for visualizing single-cell data using UMAP

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine Learning-Based Cellular Traffic Prediction Using Data Reduction Techniques;IEEE Access;2024

2. Food safety supply chain from perspective of big data algorithm and energy efficiency;International Journal of Global Energy Issues;2024

3. Application of Machine Learning Techniques to Help in the Feature Selection Related to Hospital Readmissions of Suicidal Behavior;International Journal of Mental Health and Addiction;2022-07-18

4. Quality-Informed Process Mining: A Case for Standardised Data Quality Annotations;ACM Transactions on Knowledge Discovery from Data;2022-04-05