Assessing socioeconomic bias in machine learning algorithms in health care: a case study of the HOUSES index

Author:

Juhn Young J12,Ryu Euijung3,Wi Chung-Il12,King Katherine S3ORCID,Malik Momin4ORCID,Romero-Brufau Santiago5ORCID,Weng Chunhua6,Sohn Sunghwan7,Sharp Richard R8,Halamka John D49ORCID

Affiliation:

1. Precision Population Science Lab, Mayo Clinic , Rochester, Minnesota, USA

2. Artificial Intelligence Program of Department of Pediatric and Adolescent Medicine, Mayo Clinic , Rochester, Minnesota, USA

3. Department of Quantitative Health Sciences, Mayo Clinic , Rochester, Minnesota, USA

4. Center for Digital Health, Mayo Clinic , Rochester, Minnesota, USA

5. Department of Internal Medicine, Mayo Clinic , Rochester, Minnesota, USA

6. Department of Biomedical Informatics, Columbia University , New York, New York, USA

7. Department of Artificial Intelligence and Informatics, Mayo Clinic , Rochester, Minnesota, USA

8. Biomedical Ethics Program, Mayo Clinic , Rochester, Minnesota, USA

9. Mayo Clinic Platform , Rochester, Minnesota, USA

Abstract

Abstract Objective Artificial intelligence (AI) models may propagate harmful biases in performance and hence negatively affect the underserved. We aimed to assess the degree to which data quality of electronic health records (EHRs) affected by inequities related to low socioeconomic status (SES), results in differential performance of AI models across SES. Materials and Methods This study utilized existing machine learning models for predicting asthma exacerbation in children with asthma. We compared balanced error rate (BER) against different SES levels measured by HOUsing-based SocioEconomic Status measure (HOUSES) index. As a possible mechanism for differential performance, we also compared incompleteness of EHR information relevant to asthma care by SES. Results Asthmatic children with lower SES had larger BER than those with higher SES (eg, ratio = 1.35 for HOUSES Q1 vs Q2–Q4) and had a higher proportion of missing information relevant to asthma care (eg, 41% vs 24% for missing asthma severity and 12% vs 9.8% for undiagnosed asthma despite meeting asthma criteria). Discussion Our study suggests that lower SES is associated with worse predictive model performance. It also highlights the potential role of incomplete EHR data in this differential performance and suggests a way to mitigate this bias. Conclusion The HOUSES index allows AI researchers to assess bias in predictive model performance by SES. Although our case study was based on a small sample size and a single-site study, the study results highlight a potential strategy for identifying bias by using an innovative SES measure.

Funder

National Institute of Health (NIH)-funded

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3