Mutual Information-Based Variable Selection on Latent Class Cluster Analysis-Reference-Cited by-同舟云学术

Mutual Information-Based Variable Selection on Latent Class Cluster Analysis

Published:2022-04-29 Issue:5 Volume:14 Page:908
ISSN:2073-8994
Container-title:Symmetry
language:en
Short-container-title:Symmetry

Author:

Riyanto Andreas^ORCID,Kuswanto Heri^ORCID,Prastyo Dedy Dwi^ORCID

Abstract

Machine learning techniques are becoming indispensable tools for extracting useful information. Among many machine learning techniques, variable selection is a solution used for converting high-dimensional data into simpler data while still preserving the characteristics of the original data. Variable selection aims to find the best subset of variables that produce the smallest generalization error; it can also reduce computational complexity, storage, and costs. The variable selection method developed in this paper was part of a latent class cluster (LCC) analysis—i.e., it was not a pre-processing step but, instead, formed part of LCC analysis. Many studies have shown that variable selection in LCC analysis suffers from computational problems and has difficulty meeting local dependency assumptions—therefore, in this study, we developed a method for selecting variables using mutual information (MI) in LCC analysis. Mutual information (MI) is a symmetrical measure of information that is carried by two random variables. The proposed method was applied to MI-based variable selection in LCC analysis, and, as a result, four variables were selected for use in LCC-based village clustering.

Publisher

MDPI AG

Subject

Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2073-8994/14/5/908/pdf

Reference30 articles.

1. Toward Integrating Feature Selection Algorithms for Classification and Clustering;Liu;IEEE Trans. Knowl. Data Eng.,2005

2. A review of feature selection methods based on mutual information

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning approach for predicting state transitions via shank acceleration data during freezing of gait in Parkinson’s disease;Biomedical Signal Processing and Control;2024-06

2. An advanced variable selection method based on information gain and Fisher criterion reselection iteration for multivariate calibration;Chemometrics and Intelligent Laboratory Systems;2023-04

3. Clustering Stock Prices of Financial Sector Using K-Means Clustering With Dynamic Time Warping;2022 6th International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE);2022-12-13