Feature selection using differential evolution for microarray data classification-Reference-Cited by-同舟云学术

Feature selection using differential evolution for microarray data classification

Published:2023-10-05 Issue:1 Volume:3 Page:
ISSN:2730-7239
Container-title:Discover Internet of Things
language:en
Short-container-title:Discov Internet Things

Author:

Prajapati Sanjay^ORCID,Das Himansu^ORCID,Gourisaria Mahendra Kumar^ORCID

Abstract

AbstractThe dimensions of microarray datasets are very large, containing noise and redundancy. The problem with microarray datasets is the presence of more features compared to the number of samples, which adversely affects algorithm performance. In other words, the number of columns exceeds the number of rows. Therefore, to extract precise information from microarray datasets, a robust technique is required. Microarray datasets play a critical role in detecting various diseases, including cancer and tumors. This is where feature selection techniques come into play. In recent times, feature selection (FS) has gained significant importance as a data preparation method, particularly for high-dimensional data. It is preferable to address classification problems with fewer features while maintaining high accuracy, as not all features are necessary to achieve this goal. The primary objective of feature selection is to identify the optimal subset of features. In this context, we will employ the Differential Evolution (DE) algorithm. DE is a population-based stochastic search approach that has found widespread use in various scientific and technical domains to solve optimization problems in continuous spaces. In our approach, we will combine DE with three different classification algorithms: Random Forest (RF), Decision Tree (DT), and Logistic Regression (LR). Our analysis will include a comparison of the accuracy achieved by each algorithmic model on each dataset, as well as the fitness error for each model. The results indicate that when feature selection was used the results were better compared to the results where the feature selection was not used.

Publisher

Springer Science and Business Media LLC

Subject

General Earth and Planetary Sciences,General Energy

Link

https://link.springer.com/content/pdf/10.1007/s43926-023-00042-5.pdf

Reference40 articles.

1. Kim J, Yoon Y, Park HJ, Kim YH. Comparative study of classification algorithms for various DNA microarray data. Genes. 2022;13(3):494.

2. Cho SB, Won HH. Machine learning in DNA microarray analysis for cancer classification. Proc First Asia-Pacific Bioinform Conf Bioinform. 2003;2003(19):189–98.

3. Dasgupta A, Nath A. Classification of machine learning algorithms. Int J Innov Res Adv Eng (IJIRAE). 2016;3(3):6–11.

4. Das, H., Naik, B., & Behera, H. S. (2020). Disease classification using linguistic neuro-fuzzy model. In Progress in Computing, Analytics and Networking: Proceedings of ICCAN 2019 (pp. 45-53). Springer Singapore.

5. Abdullah, M. N., Yap, B. W., Sapri, N. N. F. F., & Wan Yaacob, W. F. (2023). Multi-class Classification for Breast Cancer with High Dimensional Microarray Data Using Machine Learning Classifier. In Data Science and Emerging Technologies: Proceedings of DaSET 2022 (pp. 329-342). Singapore: Springer Nature Singapore.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine Learning Meets Meta-Heuristics: Bald Eagle Search Optimization and Red Deer Optimization for Feature Selection in Type II Diabetes Diagnosis;Bioengineering;2024-07-29

2. FSBOA: feature selection using bat optimization algorithm for software fault detection;Discover Internet of Things;2024-07-18