Affiliation:
1. Department of Computer Science and Engineering, National Institute of Technology, Warangal, Telangana, India
Abstract
Disease diagnosis is very important in the medical field. It is essential to diagnose chronic diseases such as diabetes, heart disease, cancer, and kidney diseases in the early stage. In recent times, ensembled-based approaches giving effective predictive performance than individual classifiers and gained attention in assisting doctors with early diagnosis. But one of the challenges in these approaches is dealing with class-imbalanced data and improper configuration of ensemble classifiers with optimized parameters. In this paper, a novel 3-level stacking approach with ADASYN oversampling technique with PSO Optimized SVM meta-model (Stacked-ADASYN-PSO) is proposed. Our proposed Stacked-ADASYN-PSO model uses base models such as Logistic regression(LR), K-Nearest neighbor (KNN), Support Vector Machine (SVM), Decision Tree (DT), and Multi-Layer Perceptron (MLP) in layer-0. In layer-1 three meta classifiers namely LR, KNN, and Bagging DT are used. In layer-2 PSO optimized SVM used as the final meta-model to combine the previous layer predictions. To evaluate the robustness of the proposed model It is tested on five benchmark disease datasets from the UCI machine learning repository. These results are compared with state-of-the-art ensemble models and non-ensemble models. Results demonstrated that the proposed model performance is superior in terms of AUC, accuracy, specificity, and precision. We have performed statistical analysis using paired T-tests with a 95% confidence level and our proposed stacking model is significantly differs when compared to base classifiers.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference57 articles.
1. Diversity based imbalance learning approach for software fault prediction using machine learning models;Manchala;Applied Soft Computing,2022
2. Smote: synthetic minority over-sampling technique;Chawla;Journal of Artificial Intelligence Research,2002
3. Han Hui , Wang Wen-Yuan and Mao Bing-Huan , Borderline-smote: a newover-sampling method in imbalanced data sets learning, In International conference on intelligent computing, pages 878–887. Springer, 2005.
4. He Haibo , Bai Yang , Garcia Edwardo A , and Li Shutao , Adasyn:Adaptive synthetic sampling approach for imbalanced learning, In 2008 IEEE international joint conference on neural networks(IEEE world congress on computational intelligence) , pages 1322–1328. IEEE 2008.
5. The effects of over and under sampling onfault-prone module detection;Kamei;First international symposiumon empirical software engineering and measurement (ESEM 2007),2007
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献