Enhancing Cardiovascular Disease Prediction: A Domain Knowledge-Based Feature Selection and Stacked Ensemble Machine Learning Approach

Author:

Rustamov Zahiriddin1,Rustamov Jaloliddin1,Zaki Nazar1,Turaev Sherzod1,Sultana Most Sarmin2,Tan Jeanne Ywei2,Balakrishnan Vimala2

Affiliation:

1. United Arab Emirates University

2. University Malaya

Abstract

AbstractCardiovascular diseases (CVDs) are prevalent disorders affecting the heart or blood arteries. Early disease detection significantly enhances survival prospects, thus emphasizing the necessity for accurate prediction methods. Emerging technologies, such as machine learning (ML), present promising avenues for more precise prediction of CVDs. However, a critical challenge lies in developing models that not only ensure optimal predictive performance but also conform to well-established domain knowledge, thereby enhancing their credibility. Single classifiers often fall short due to issues like overfitting and bias. In response, this study proposes a domain knowledge-based feature selection integrated with a stacking ensemble classifier. The Framingham Heart Study, UCI Heart Disease and UAE retrospective cohort study datasets were utilized for training and evaluation of the ML algorithms. The results indicate that the proposed domain knowledge-based feature selection performs on par with frequently adopted feature selection techniques. Moreover, the proposed stacked ensemble, in conjunction with domain knowledge-based feature selection, achieved the highest metrics with 89.66% accuracy, and 89.16% F1-score on the Framingham dataset. Similarly, the proposed method achieved an F1-score of 85.26% and 96.23% on the UCI Heart Disease and UAE datasets. Furthermore, this study employs explainable AI techniques to illuminate the decision-making process of the predictive models. Thus, the study establishes that domain knowledge-based feature selection promotes the credibility of ML models without compromising predictive performance.

Publisher

Research Square Platform LLC

Reference54 articles.

1. World Health Organization Cardiovascular Diseases (CVDs) Available online: https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds)

2. Doppala BP, Bhattacharyya D, Janarthanan M, Baik NA (2022) Reliable Machine Intelligence Model for Accurate Identification of Cardiovascular Diseases Using Ensemble Techniques. J. Healthc. Eng. 2022, doi:10.1155/2022/2585235

3. Clustering and Association Rule Mining of Cardiovascular Disease Risk Factors;Rustamov Z,2022

4. Abnane, I. A Systematic Mapping Study for Ensemble Classification Methods in Cardiovascular Disease;Hosni M;Artif Intell Rev,2021

5. Prediction of Heart Diseases Using Data Mining Techniques;Masih N;Int J Big Data Anal Healthc,2018

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3