Cancer Classification Utilizing Voting Classifier with Ensemble Feature Selection Method and Transcriptomic Data-Reference-Cited by-同舟云学术

Cancer Classification Utilizing Voting Classifier with Ensemble Feature Selection Method and Transcriptomic Data

Published:2023-09-14 Issue:9 Volume:14 Page:1802
ISSN:2073-4425
Container-title:Genes
language:en
Short-container-title:Genes

Author:

Khatun Rabea¹,Akter Maksuda²,Islam Md. Manowarul²^ORCID,Uddin Md. Ashraf³^ORCID,Talukder Md. Alamin²^ORCID,Kamruzzaman Joarder⁴^ORCID,Azad AKM⁵^ORCID,Paul Bikash Kumar⁶⁷,Almoyad Muhammad Ali Abdulllah⁸,Aryal Sunil³^ORCID,Moni Mohammad Ali⁹^ORCID

Affiliation:

1. Department of Computer Science and Engineering, Green University of Bangladesh, Dhaka 1207, Bangladesh

2. Department of Computer Science and Engineering, Jagannath University, Dhaka 1100, Bangladesh

3. School of Information Technology, Deakin University, Waurn Ponds Campus, Geelong, VIC 3125, Australia

4. Centre for Smart Analytics, Federation University Australia, Ballarat, VIC 3842, Australia

5. Department of Mathematics and Statistics, College of Science, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11564, Saudi Arabia

6. Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Tangail 1902, Bangladesh

7. Department of Software Engineering, Daffodil International University (DIU), Dhaka 1342, Bangladesh

8. Department of Basic Medical Sciences, College of Applied Medical Sciences in Khamis Mushyt King Khalid University, Abha 61412, Saudi Arabia

9. Artificial Intelligence & Data Science, School of Health and Rehabilitation Sciences, Faculty of Health and Behavioural Sciences, The University of Queensland, St Lucia, QLD 4072, Australia

Abstract

Biomarker-based cancer identification and classification tools are widely used in bioinformatics and machine learning fields. However, the high dimensionality of microarray gene expression data poses a challenge for identifying important genes in cancer diagnosis. Many feature selection algorithms optimize cancer diagnosis by selecting optimal features. This article proposes an ensemble rank-based feature selection method (EFSM) and an ensemble weighted average voting classifier (VT) to overcome this challenge. The EFSM uses a ranking method that aggregates features from individual selection methods to efficiently discover the most relevant and useful features. The VT combines support vector machine, k-nearest neighbor, and decision tree algorithms to create an ensemble model. The proposed method was tested on three benchmark datasets and compared to existing built-in ensemble models. The results show that our model achieved higher accuracy, with 100% for leukaemia, 94.74% for colon cancer, and 94.34% for the 11-tumor dataset. This study concludes by identifying a subset of the most important cancer-causing genes and demonstrating their significance compared to the original data. The proposed approach surpasses existing strategies in accuracy and stability, significantly impacting the development of ML-based gene analysis. It detects vital genes with higher precision and stability than other existing methods.

Funder

Deanship of Scientific Research Large Groups at King Khalid University, Kingdom of Saudi Arabia

Publisher

MDPI AG

Subject

Genetics (clinical),Genetics

Link

https://www.mdpi.com/2073-4425/14/9/1802/pdf

Reference64 articles.

1. Talukder, M.A., Islam, M.M., Uddin, M.A., Akhter, A., Pramanik, M.A.J., Aryal, S., Almoyad, M.A.A., Hasan, K.F., and Moni, M.A. (2023). An efficient deep learning model to categorize brain tumor using reconstruction and fine-tuning. Expert Syst. Appl., 120534.

2. Machine learning-based lung and colon cancer detection using deep feature extraction and ensemble learning;Talukder;Expert Syst. Appl.,2022

3. A Hybrid Dependable Deep Feature Extraction and Ensemble-based Machine Learning Approach for Breast Cancer Detection;Sharmin;IEEE Access,2023

4. World Health Organization Media Centre (2020). Cancer Fact Sheet, World Health Organization.

5. An expert system to classify microarray gene expression data using gene selection by decision tree;Horng;Expert Syst. Appl.,2009

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Utilizing Deep Feature Fusion for Automatic Leukemia Classification: An Internet of Medical Things-Enabled Deep Learning Framework;Sensors;2024-07-08

2. BrainNet: Precision Brain Tumor Classification with Optimized EfficientNet Architecture;International Journal of Intelligent Systems;2024-01