Elimination and Backward Selection of Features (P-Value Technique) In Prediction of Heart Disease by Using Machine Learning Algorithms-Reference-Cited by-同舟云学术

Elimination and Backward Selection of Features (P-Value Technique) In Prediction of Heart Disease by Using Machine Learning Algorithms

Published:2021-04-05 Issue:6 Volume:12 Page:2650-2665
ISSN:1309-4653
Container-title:Turkish Journal of Computer and Mathematics Education (TURCOMAT)
language:
Short-container-title:TURCOMAT

Author:

Saurabh Pal Ritu Aggrawal,

Abstract

Background: Early speculation of cardiovascular disease can help determine the lifestyle change options of high-risk patients, thereby reducing difficulties. We propose a coronary heart disease data set analysis technique to predict people’s risk of danger based on people’s clinically determined history. The methods introduced may be integrated into multiple uses, such for developing decision support system, developing a risk management network, and help for experts and clinical staff. Methods: We employed the Framingham Heart study dataset, which is publicly available Kaggle, to train several machine learning classifiers such as logistic regression (LR), K-nearest neighbor (KNN), Naïve Bayes (NB), decision tree (DT), random forest (RF) and gradient boosting classifier (GBC) for disease prediction. The p-value method has been used for feature elimination, and the selected features have been incorporated for further prediction. Various thresholds are used with different classifiers to make predictions. In order to estimating the precision of the classifiers, ROC curve, confusion matrix and AUC value are considered for model verification. The performance of the six classifiers is used for comparison to predict chronic heart disease (CHD). Results: After applying the p-value backward elimination statistical method on the 10-year CHD data set, 6 significant features were selected from 14 features with p <0.5. In the performance of machine learning classifiers, GBC has the highest accuracy score, which is 87.61%. Conclusions: Statistical methods, such as the combination of p-value backward elimination method and machine learning classifiers, thereby improving the accuracy of the classifier and shortening the running time of the machine.

Publisher

Auricle Technologies, Pvt., Ltd.

Subject

Computational Theory and Mathematics,Computational Mathematics,General Mathematics,Education

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Spatial variability of clay minerals in a semi-arid region of Turkiye;Geoderma Regional;2024-09

2. Optimizing the light gradient-boosting machine algorithm for an efficient early detection of coronary heart disease;Informatics and Health;2024-09

3. Individualized Machine-learning-based Clinical Assessment Recommendation System;2024-07-24

4. Online Detection and Infographic Explanation of Spam Reviews with Data Drift Adaptation;Informatica;2024

5. Enhancing Human–Computer Interaction in Online Education: A Machine Learning Approach to Predicting Student Emotion and Satisfaction;International Journal of Human–Computer Interaction;2023-12-19