Machine learning-based risk factor analysis and prediction model construction for the occurrence of chronic heart failure: a health ecologic research (Preprint)

Author:

Xu Qian,Cai Xue,Yu Ruicong,Zheng Yueyue,Zheng Yueyue,Sun Jing,Xu Cuirong

Abstract

BACKGROUND

Chronic heart failure is a serious threat to human health, with high morbidity and mortality rates, imposing a heavy burden on the healthcare system and society. With the abundance of medical data and the rapid development of machine learning technologies, new opportunities are provided for in-depth investigation of the mechanisms of chronic heart failure and the construction of predictive models. The introduction of health ecology research methodology enables a comprehensive dissection of chronic heart failure risk factors from a wider range of environmental, social and individual factors. This not only helps to identify high-risk groups at an early stage, but also provides a scientific basis for the development of precise prevention and intervention strategies.

OBJECTIVE

This study aims to use machine learning (ML) to construct a predictive model of the risk of occurrence of chronic heart failure (CHF) and analyze the risk of CHF from a health ecology perspective.

METHODS

This study is a retrospective cohort study based on the Jackson Heart Study. This study included 2,553 patients who did not have heart failure at baseline and used the occurrence of chronic heart failure as an outcome measure during a 10-year follow-up period. This study used machine learning algorithms to first clean the data, and then used chi-square tests and principal component analysis to select and interpret features. Finally, models were constructed based on the selected features. A total of four models were constructed that are decision tree model, random forest model, XGBoost model and stacked model.

RESULTS

Through feature selection, a total of 20 risk factors were ultimately determined, namely age, alcohol drinking, systolic blood pressure, glycosylated hemoglobin, high sensitivity C-reactive protein, heart rate, insurance type, income, education, the proportion of the population living in poverty in the region, neighborhood problems, favorable food stores (3 mile kernel), sportindex, activeindex, medical institution which usually go, ever awakened by trouble breathing, ever had swelling of feet or ankles, marriage, ratio of mv_peake to ma_peaka, history of cardiovascular diseases. The model with the best performance is XGBOOST, which has an accuracy of 0.889, a sensitivity of 0.919, and an F1 value of 0.859.

CONCLUSIONS

This study proposes an ML-based risk prediction model for the development of chronic heart failure, which uses chi-square and PCA for feature selection and interprets it in the context of health ecology. XGBoost is superior to RF and DT and can accurately and rapidly predict disease onset, provide new ideas for clinical diagnosis and disease progression, and provide effective real-time risk assessment and intervention tools for chronic heart failure patients.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3