Investigating for bias in healthcare algorithms: a sex-stratified analysis of supervised machine learning models in liver disease prediction-Reference-Cited by-同舟云学术

Investigating for bias in healthcare algorithms: a sex-stratified analysis of supervised machine learning models in liver disease prediction

Published:2022-04 Issue:1 Volume:29 Page:e100457
ISSN:2632-1009
Container-title:BMJ Health Care Inform
language:en
Short-container-title:BMJ Health Care Inform

Author:

Straw Isabel^ORCID,Wu Honghan

Abstract

ObjectivesThe Indian Liver Patient Dataset (ILPD) is used extensively to create algorithms that predict liver disease. Given the existing research describing demographic inequities in liver disease diagnosis and management, these algorithms require scrutiny for potential biases. We address this overlooked issue by investigating ILPD models for sex bias.MethodsFollowing our literature review of ILPD papers, the models reported in existing studies are recreated and then interrogated for bias. We define four experiments, training on sex-unbalanced/balanced data, with and without feature selection. We build random forests (RFs), support vector machines (SVMs), Gaussian Naïve Bayes and logistic regression (LR) classifiers, running experiments 100 times, reporting average results with SD.ResultsWe reproduce published models achieving accuracies of >70% (LR 71.31% (2.37 SD) – SVM 79.40% (2.50 SD)) and demonstrate a previously unobserved performance disparity. Across all classifiers females suffer from a higher false negative rate (FNR). Presently, RF and LR classifiers are reported as the most effective models, yet in our experiments they demonstrate the greatest FNR disparity (RF; −21.02%; LR; −24.07%).DiscussionWe demonstrate a sex disparity that exists in published ILPD classifiers. In practice, the higher FNR for females would manifest as increased rates of missed diagnosis for female patients and a consequent lack of appropriate care. Our study demonstrates that evaluating biases in the initial stages of machine learning can provide insights into inequalities in current clinical practice, reveal pathophysiological differences between the male and females, and can mitigate the digitisation of inequalities into algorithmic systems.ConclusionOur findings are important to medical data scientists, clinicians and policy-makers involved in the implementation medical artificial intelligence systems. An awareness of the potential biases of these systems is essential in preventing the digital exacerbation of healthcare inequalities.

Funder

UK Research and Innovation

Publisher

BMJ

Subject

Health Information Management,Health Informatics,Computer Science Applications

Reference30 articles.

1. The burden of liver disease in Europe: A review of available epidemiological data

3. A review on the sex differences in organ and system pathology with alcohol drinking;Vatsalya;Curr Drug Abuse Rev,2016

4. Sex-based disparities in liver transplant rates in the United States;Mathur;Am J Transplant,2011

5. UK Parliament, Women’s health outcomes: Is there a gender gap?, House of Lords Library, Editor. 2021, House of Lords. Available: https://lordslibrary.parliament.uk/womens-health-outcomes-is-there-a-gender-gap/

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Connected to the cloud at time of death: a case report;Journal of Medical Case Reports;2024-08-03

2. An improved mountain gazelle optimizer based on chaotic map and spiral disturbance for medical feature selection;PLOS ONE;2024-07-16

3. Insights From a Clinically Orientated Workshop on Health Care Cybersecurity and Medical Technology: Observational Study and Thematic Analysis;Journal of Medical Internet Research;2024-07-11

4. Paternalistic AI: the case of aged care;Humanities and Social Sciences Communications;2024-06-25

5. A Supervised Machine Learning Approach with Feature Selection for Sex-Specific Biomarker Prediction;2024-06-07