Author:
Shankari B Uma,kumar C Arun
Abstract
Abstract
A key challenge before classification can take place is feature selection. An effective feature selection method would increase classification accuracy and simultaneously reduce computation costs and time. A variety of filter approaches, along with different search algorithms, were considered in this study. Five traditional classifiers were evaluated on the selected gene subsets: Random Forest, Sequential minimal optimization algorithm, Naive Bayes, Decision Trees, and K-Nearest Neighbour. The datasets chosen for this analysis are the microarray gene expression data of two types of cancers: Acute Lymphocytic Leukaemia (ALL)/Acute Myeloid Leukaemia (AML) and Lung cancer. According to the experimental results, a fuzzy rough subset combined with Genetic Search selects optimal relevant gene subsets and produces significantly good classifier accuracy. Compared to classical classifiers described here, this research finds that Random Forest classifiers yield 94.33% on the raw dataset and 100% classifier accuracy after applying feature selection methods. Utilizing conventional methods like Precision, Recall, F-Score, and Region of Characteristics, MCC Matthews correlation coefficient, results are validated.
Subject
Computer Science Applications,History,Education
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Exploration of Strategies for Dual-Snake Competition for Food Based on Greedy Algorithm;Proceedings of the 2024 3rd International Conference on Cyber Security, Artificial Intelligence and Digital Economy;2024-03