Subpopulation-specific machine learning prognosis for underrepresented patients with double prioritized bias correction-Reference-Cited by-同舟云学术

Subpopulation-specific machine learning prognosis for underrepresented patients with double prioritized bias correction

Published:2022-09-01 Issue:1 Volume:2 Page:
ISSN:2730-664X
Container-title:Communications Medicine
language:en
Short-container-title:Commun Med

Author:

Afrose Sharmin,Song Wenjia,Nemeroff Charles B.,Lu Chang^ORCID,Yao Danfeng^ORCID

Abstract

Abstract Background Many clinical datasets are intrinsically imbalanced, dominated by overwhelming majority groups. Off-the-shelf machine learning models that optimize the prognosis of majority patient types (e.g., healthy class) may cause substantial errors on the minority prediction class (e.g., disease class) and demographic subgroups (e.g., Black or young patients). In the typical one-machine-learning-model-fits-all paradigm, racial and age disparities are likely to exist, but unreported. In addition, some widely used whole-population metrics give misleading results. Methods We design a double prioritized (DP) bias correction technique to mitigate representational biases in machine learning-based prognosis. Our method trains customized machine learning models for specific ethnicity or age groups, a substantial departure from the one-model-predicts-all convention. We compare with other sampling and reweighting techniques in mortality and cancer survivability prediction tasks. Results We first provide empirical evidence showing various prediction deficiencies in a typical machine learning setting without bias correction. For example, missed death cases are 3.14 times higher than missed survival cases for mortality prediction. Then, we show DP consistently boosts the minority class recall for underrepresented groups, by up to 38.0%. DP also reduces relative disparities across race and age groups, e.g., up to 88.0% better than the 8 existing sampling solutions in terms of the relative disparity of minority class recall. Cross-race and cross-age-group evaluation also suggests the need for subpopulation-specific machine learning models. Conclusions Biases exist in the widely accepted one-machine-learning-model-fits-all-population approach. We invent a bias correction method that produces specialized machine learning prognostication models for underrepresented racial and age groups. This technique may reduce potentially life-threatening prediction mistakes for minority populations.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s43856-022-00165-w.pdf

Reference48 articles.

1. Parisot, S. et al. Disease prediction using graph convolutional networks: application to autism spectrum disorder and Alzheimer’s disease. Med. Image Anal. 48, 117–130 (2018).

2. Malav, A., Kadam, K. & Kamat, P. Prediction of heart disease using k-means and artificial neural network as Hybrid Approach to Improve Accuracy. Int. J. Eng. Technol. 9, 3081–3085 (2017).

3. Bora, A. et al. Predicting the risk of developing diabetic retinopathy using deep learning. Lancet Digit. Health https://doi.org/10.1016/S2589-7500(20)30250-8 (2020).

4. Ten Haaf, K. et al. Risk prediction models for selection of lung cancer screening candidates: a retrospective validation study. PLoS Med. 14, e1002277 (2017).

5. Hegselmann, S., Gruelich, L., Varghese, J. & Dugas, M. Reproducible survival prediction with SEER cancer data. In Proc. 3rd Machine Learning for Healthcare Conference 49–66 (PMLR, 2018).

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities;Journal of Pathology Informatics;2024-12

2. Sex-Based Performance Disparities in Machine Learning Algorithms for Cardiac Disease Prediction: Exploratory Study;Journal of Medical Internet Research;2024-08-26

3. Enhancing neuro-oncology care through equity-driven applications of artificial intelligence;Neuro-Oncology;2024-08-19

4. A survey of recent methods for addressing AI fairness and bias in biomedicine;Journal of Biomedical Informatics;2024-06

5. Personalising intravenous to oral antibiotic switch decision making through fair interpretable machine learning;Nature Communications;2024-01-13