Performance Comparison of different Disease Detection using Stacked Ensemble Learning Model-Reference-Cited by-同舟云学术

Performance Comparison of different Disease Detection using Stacked Ensemble Learning Model

Published:2024-03 Issue:1 Volume:6 Page:26-39
ISSN:2582-2640
Container-title:Journal of Soft Computing Paradigm
language:en
Short-container-title:JSCP

Author:

Paul Arunya,Kar Tejaswini,Pahadsingh Sasmita,Satpathy Priya Chandan,Behera Biswaranjan

Abstract

Malignancy risks and genetic disorders have long been challenging due to procedures that lack precision and predictability, thereby complicating the precise identification of diseases and their root causes. Machine learning classifiers have emerged as more suitable and effective tools. Various machine learning classifiers have been utilized to examine different genetic disorders, and the results from these classifiers have been further compared to determine their superiority. In this study, a variety of classifiers, including the SVM, KNN, decision tree, random forest, and logistic regression algorithms, are examined. These classifiers utilize specific training variables to analyze how input values correspond to the respective class. After successfully implementing each classifier, we proceeded to employ Stacking, an ensemble machine learning technique that aggregates predictions from individual classifiers on the same dataset. Four datasets, including the breast cancer, diabetes, Parkinson’s, and genomic datasets, were successfully implemented using the aforementioned methods, and the results obtained showed how the input values correspond to the class using a few training variables. SVM classifier was shown to be the most effective of the five described classifiers, having the highest accuracy in most of the cases. It provided accuracies of 97.43%, 97.46%, 97.45%, and 97.44% for each of the genome cancer, diabetes, Parkinson’s, and breast cancer datasets. The KNN and Random Forest models also came out to be very effective, with accuracy around 95% and 91%, respectively, for various disease datasets. The Logistic Regression and Decision Tree models also worked well. However, the ensemble method of Stacking proved to be highly efficient above all other base models and generated accuracies above 97.5% for all the aforementioned diseases.

Publisher

Inventive Research Organization

Reference16 articles.

1. [1] A. Mahapatra, S. Pahadsingh and T. Kar, “Transfer learning based COVID-19 detection Using Radiological Images,” 2021 IEEE 2nd International Conference on Applied Electromagnetics, Signal Processing, & Communication (AESPC), Bhubaneswar, India, 2021, pp. 1-4,

2. [2] S. Acharya, T. Kar, U. C. Samal, and P. K. Patra, “Performance Comparison between SVM and LS-SVM for Rice Leaf Disease detection ”, EAI Endorsed Scal Inf Syst, vol. 10, no. 6, Sep. 2023.

3. [3] S. Mohan, C. Thirumalai and G. Srivastava, “Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques,” in IEEE Access, vol. 7, pp. 81542-81554,2019,

4. [4] Mei, Jie, Christian Desrosiers, and Johannes Frasnelli. "Machine learning for the diagnosis of Parkinson's disease: a review of literature." Frontiers in aging neuroscience 13 (2021): 633752.D.

5. [5] Dahiwade, Dhiraj, Gajanan Patle, and Ektaa Meshram. "Designing disease prediction model using machine learning approach." In 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), pp. 1211-1215. IEEE, 2019.