Abstract
Medical science-related studies have reinforced that the prevalence of coronary heart disease which is associated with the heart and blood vessels has been the most significant cause of health loss and death globally. Recently, data mining and machine learning have been used to detect diseases based on the unique characteristics of a person. However, these techniques have often posed challenges due to the complexity in understanding the objective of the datasets, the existence of too many factors to analyze as well as lack of performance accuracy. This research work is of two-fold effort: firstly, feature extraction and selection. This entails extraction of the principal components, and consequently, the Correlation-based Feature Selection (CFS) method was applied to select the finest principal components of the combined (Cleveland and Statlog) heart dataset. Secondly, by applying datasets to three single and three ensemble classifiers, the best hyperparameters that reflect the pre-eminent predictive outcomes were investigated. The experimental result reveals that hyperparameter optimization has improved the accuracy of all the models. In the comparative studies, the proposed work outperformed related works with an accuracy of 97.91%, and an AUC of 0.996 by employing six optimal principal components selected from the CFS method and optimizing parameters of the Rotation Forest ensemble classifier.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference64 articles.
1. Decision tree-based diagnosis of coronary artery disease: CART model;Ghiasi;Comput. Methods Programs Biomed.,2020
2. HDPM: An Effective Heart Disease Prediction Model for a Clinical Decision Support System;Fitriyani;IEEE Access,2020
3. Prediction of Heart Disease Using Feature Selection and Random Forest Ensemble Method;Yadav;Int. J. Pharm. Res.,2020
4. Shahid, A.H., Singh, M.P., Roy, B., and Aadarsh, A. (2020, January 9–12). Coronary Artery Disease Diagnosis Using Feature Selection Based Hybrid Extreme Learning Machine. Proceedings of the 2020 3rd International Conference on Information and Computer Technologies (ICICT), San Jose, CA, USA.
5. WHO (2021, October 14). 2020. [Online], Available online: https://www.who.int/health-topics/cardiovascular-diseases/#tab=tab_1.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献