Sources of bias in artificial intelligence that perpetuate healthcare disparities

Sources of bias in artificial intelligence that perpetuate healthcare disparities—A global review

Published:2022-03-31 Issue:3 Volume:1 Page:e0000022
ISSN:2767-3170
Container-title:PLOS Digital Health
language:en
Short-container-title:PLOS Digit Health

Author:

Celi Leo Anthony,Cellini Jacqueline,Charpignon Marie-Laure^ORCID,Dee Edward Christopher^ORCID,Dernoncourt Franck^ORCID,Eber Rene^ORCID,Mitchell William Greig^ORCID,Moukheiber Lama,Schirmer Julian,Situ Julia,Paguio Joseph^ORCID,Park Joel^ORCID,Wawira Judy Gichoya,Yao Seth^ORCID,

Abstract

BackgroundWhile artificial intelligence (AI) offers possibilities of advanced clinical prediction and decision-making in healthcare, models trained on relatively homogeneous datasets, and populations poorly-representative of underlying diversity, limits generalisability and risks biased AI-based decisions. Here, we describe the landscape of AI in clinical medicine to delineate population and data-source disparities.MethodsWe performed a scoping review of clinical papers published in PubMed in 2019 using AI techniques. We assessed differences in dataset country source, clinical specialty, and author nationality, sex, and expertise. A manually tagged subsample of PubMed articles was used to train a model, leveraging transfer-learning techniques (building upon an existing BioBERT model) to predict eligibility for inclusion (original, human, clinical AI literature). Of all eligible articles, database country source and clinical specialty were manually labelled. A BioBERT-based model predicted first/last author expertise. Author nationality was determined using corresponding affiliated institution information using Entrez Direct. And first/last author sex was evaluated using the Gendarize.io API.ResultsOur search yielded 30,576 articles, of which 7,314 (23.9%) were eligible for further analysis. Most databases came from the US (40.8%) and China (13.7%). Radiology was the most represented clinical specialty (40.4%), followed by pathology (9.1%). Authors were primarily from either China (24.0%) or the US (18.4%). First and last authors were predominately data experts (i.e., statisticians) (59.6% and 53.9% respectively) rather than clinicians. And the majority of first/last authors were male (74.1%).InterpretationU.S. and Chinese datasets and authors were disproportionately overrepresented in clinical AI, and almost all of the top 10 databases and author nationalities were from high income countries (HICs). AI techniques were most commonly employed for image-rich specialties, and authors were predominantly male, with non-clinical backgrounds. Development of technological infrastructure in data-poor regions, and diligence in external validation and model re-calibration prior to clinical implementation in the short-term, are crucial in ensuring clinical AI is meaningful for broader populations, and to avoid perpetuating global health inequity.

Publisher

Public Library of Science (PLoS)

Reference66 articles.

1. Artificial intelligence for the otolaryngologist: A state of the art review;A Bur;Otolaryngol Head Neck Surg,2019

2. Approval of artificial intelligence and machine learning-based medical devices in the USA and europe (2015–20): A comparative analysis.;UJ Muehlematter;The Lancet Digital Health,2021

3. Deep learning—a technology with the potential to transform health care;G. Hinton;JAMA,2018

4. Opportunities and obstacles for deep learning in biology and medicine;T Ching;Journal of The Royal Society Interface,2018

Cited by 124 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial intelligence in orthopaedic surgery: A comprehensive review of current innovations and future directions;Computational and Structural Biotechnology Reports;2024-12

2. Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities;Journal of Pathology Informatics;2024-12

3. Artificial intelligence in oncology: ensuring safe and effective integration of language models in clinical practice;The Lancet Regional Health - Europe;2024-11

4. Applications of Artificial Intelligence and Machine Learning in Spine MRI;Bioengineering;2024-09-05

5. AI Safety and Security;Advances in Computational Intelligence and Robotics;2024-08-30