Fair prediction of 2-year stroke risk in patients with atrial fibrillation

Author:

Gao Jifan1,Mar Philip2,Tang Zheng-Zheng1,Chen Guanhua1

Affiliation:

1. Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison , Madison, WI 53726, United States

2. Department of Internal Medicine, Saint Louis University, School of Medicine , Saint Louis, MO 63104, United States

Abstract

Abstract Objective This study aims to develop machine learning models that provide both accurate and equitable predictions of 2-year stroke risk for patients with atrial fibrillation across diverse racial groups. Materials and Methods Our study utilized structured electronic health records (EHR) data from the All of Us Research Program. Machine learning models (LightGBM) were utilized to capture the relations between stroke risks and the predictors used by the widely recognized CHADS2 and CHA2DS2-VASc scores. We mitigated the racial disparity by creating a representative tuning set, customizing tuning criteria, and setting binary thresholds separately for subgroups. We constructed a hold-out test set that not only supports temporal validation but also includes a larger proportion of Black/African Americans for fairness validation. Results Compared to the original CHADS2 and CHA2DS2-VASc scores, significant improvements were achieved by modeling their predictors using machine learning models (Area Under the Receiver Operating Characteristic curve from near 0.70 to above 0.80). Furthermore, applying our disparity mitigation strategies can effectively enhance model fairness compared to the conventional cross-validation approach. Discussion Modeling CHADS2 and CHA2DS2-VASc risk factors with LightGBM and our disparity mitigation strategies achieved decent discriminative performance and excellent fairness performance. In addition, this approach can provide a complete interpretation of each predictor. These highlight its potential utility in clinical practice. Conclusions Our research presents a practical example of addressing clinical challenges through the All of Us Research Program data. The disparity mitigation framework we proposed is adaptable across various models and data modalities, demonstrating broad potential in clinical informatics.

Funder

National Science Foundation

the Fall Research Competition

Wisconsin Alumni Research Foundation

University of Wisconsin School of Medicine and Public Health from the Wisconsin

Publisher

Oxford University Press (OUP)

Reference38 articles.

1. Stroke: causes and clinical features;Murphy;Medicine (Abingdon),2020

2. Global epidemiology of atrial fibrillation: an increasing epidemic and public health challenge;Lippi;Int J Stroke,2021

3. Atrial fibrillation and mechanisms of stroke: time for a new model;Kamel;Stroke,2016

4. Stroke prevention in atrial fibrillation;Freedman;Lancet,2016

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3