Affiliation:
1. Research Scholar, Assam Don Bosco University, Guwahati, India
2. South Eastern University of Sri Lanka, Oluvil, Sri Lanka
3. Department of Computer Applications, Assam Don Bosco University, Guwahati, India
4. Department of Computer Science, South Eastern University of Sri Lanka, Oluvil, Sri Lanka
Abstract
BACKGROUND: Heart disease is a severe health issue that results in high fatality rates worldwide. Identifying cardiovascular diseases such as coronary artery disease (CAD) and heart attacks through repetitive clinical data analysis is a significant task. Detecting heart disease in its early stages can save lives. The most lethal cardiovascular condition is CAD, which develops over time due to plaque buildup in coronary arteries, causing incomplete blood flow obstruction. Machine Learning (ML) is progressively used in the medical sector to detect CAD disease. OBJECTIVE: The primary aim of this work is to deliver a state-of-the-art approach to enhancing CAD prediction accuracy by using a DL algorithm in a classification context. METHODS: A unique ML technique is proposed in this study to predict CAD disease accurately using a deep learning algorithm in a classification context. An ensemble voting classifier classification model is developed based on various methods such as Naïve Bayes (NB), Logistic Regression (LR), Decision Tree (DT), XGBoost, Random Forest (RF), Convolutional Neural Network (CNN), Support Vector Machine (SVM), K Nearest Neighbor (KNN), Bidirectional LSTM and Long Short-Term Memory (LSTM). The performance of the ensemble models and a novel model are compared in this study. The Alizadeh Sani dataset, which consists of a random sample of 216 cases with CAD, is used in this study. Synthetic Minority Over Sampling Technique (SMOTE) is used to address the issue of imbalanced datasets, and the Chi-square test is used for feature selection optimization. Performance is assessed using various assessment methodologies, such as confusion matrix, accuracy, recall, precision, f1-score, and auc-roc. RESULTS: When a novel algorithm achieves the highest accuracy relative to other algorithms, it demonstrates its effectiveness in several ways, including superior performance, robustness, generalization capability, efficiency, innovative approaches, and benchmarking against baselines. These characteristics collectively contribute to establishing the novel algorithm as a promising solution for addressing the target problem in machine learning and related fields. CONCLUSION: Implementing the novel model in this study significantly improved performance, achieving a prediction accuracy rate of 92% in the detection of CAD. These findings are competitive and on par with the top outcomes among other methods.