Affiliation:
1. Department of Biomedical Engineering, Science and Research Branch Islamic Azad University, Tehran 13878-65911, lran
2. Department of Biomedical Engineering, Meybod University, Meybod 89169-64717, Iran
3. Medical Image and Signal Processing Research Center, Isfahan University of Medical Sciences, Isfahan 81746-73461, Iran
Abstract
Abstract
Hybrid algorithms are effective methods for solving optimization problems that rarely have been used in the gene selection procedure. This paper introduces a novel modified model for microarray data classification using an optimized gene subset selection method. The proposed approach consists of ensemble feature selection based on wrapper methods using five criteria, which reduces the data dimensions and time complexity. Five feature ranking procedures, including receiver operating characteristic curve, two-sample T-test, Wilcoxon, Bhattacharyya distance, and entropy, are used in the soft weighting method. Besides, we proposed a classification method that used the support vector machine (SVM) and metaheuristic algorithm. The optimization of the SVM hyper-parameters for the radial basis function (RBF) kernel function is performed using a modified Water Cycle Algorithm (mWCA). The results indicate that the ensemble performance of genes-mWCA SVM (EGmWS) is considered an efficient method compared to similar approaches in terms of accuracy and solving the uncertainty problem. Five benchmark microarray datasets, including leukemia, MicroRNA-Breast, diffuse large B-cell lymphoma, prostate, and colon, are employed for experiments. The highest and lowest numbers of genes are related to prostate with 12 533 genes and MicroRNA-Breast with 1926 genes, respectively. Besides, the highest and lowest numbers of samples are MicroRNA-Breast with 132 samples and colon with 62 samples, respectively. The results of classifying all data by applying effective genes of the EF-WS yielded high accuracies in microarray data classification. In addition to the robustness and simplicity of the proposed method, the model’s generalizability is another crucial aspect of the method that can be further developed to increase the accuracy while reducing classification error.
Funder
Isfahan University of Medical Sciences
Publisher
Oxford University Press (OUP)
Subject
Computational Mathematics,Computer Graphics and Computer-Aided Design,Human-Computer Interaction,Engineering (miscellaneous),Modelling and Simulation,Computational Mechanics
Reference58 articles.
1. Group search optimizer: A nature-inspired meta-heuristic optimization algorithm with its results, variants, and applications;Abualigah;Neural Computing and Applications,2021
2. A comprehensive survey of the Grasshopper optimization algorithm: Results, variants, and applications;Abualigah;Neural Computing and Applications,2020
3. Selection scheme sensitivity for a hybrid Salp Swarm Algorithm: Analysis and applications;Abualigah;Engineering with Computers,2020
4. The arithmetic optimization algorithm;Abualigah;Computer Methods in Applied Mechanics and Engineering,2021
5. A consensus multi-view multi-objective gene selection approach for improved sample classification;Acharya;BMC Bioinformatics,2020
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献