Diagnosing and remediating harmful data shifts for the responsible deployment of clinical AI models-Reference-Cited by-同舟云学术

Diagnosing and remediating harmful data shifts for the responsible deployment of clinical AI models

Published:2023-03-29 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Subasri Vallijah^ORCID,Krishnan Amrit,Dhalla Azra,Pandya Deval,Malkin David^ORCID,Razak Fahad,Verma Amol A.,Goldenberg Anna^ORCID,Dolatabadi Elham^ORCID

Abstract

AbstractHarmful data shifts occur when the distribution of data used to train a clinical AI system differs significantly from the distribution of data encountered during deployment, leading to erroneous predictions and potential harm to patients. We evaluated the impact of data shifts on an early warning system for in-hospital mortality that uses electronic health record data from patients admitted to a general internal medicine service, across 7 large hospitals in Toronto, Canada. We found model performance to differ across subgroups of clinical diagnoses, sex and age. To explore the robustness of the model, we evaluated potentially harmful data shifts across demographics, hospital types, seasons, time of hospital admission, and whether the patient was admitted from an acute care institution or nursing home, without relying on model performance. Interestingly, many of these harmful data shifts were unidirectional. We found models trained on community hospitals experience harmful data shifts when evaluated on academic hospitals, whereas models trained on academic hospitals transfer well to the community hospitals. To improve model performance across hospital sites we employed transfer learning, a strategy that stores knowledge gained from learning one domain and applies it to a different but related domain. We found hospital type-specific models that leverage transfer learning, perform better than models that use all available hospitals. Furthermore, we monitored data shifts over time and identified model deterioration during the COVID-19 pandemic. Typically, machine learning models remain locked after deployment, however, this can lead to model deterioration due to harmful data shifts that occur over time. We used continual learning, the process of learning from a continual stream of data in a sequential manner, to mitigate data shifts over time and improve model performance. Overall, our study is a crucial step towards the deployment of clinical AI models, by providing strategies and workflows to ensure the safety and efficacy of these models in real-world settings.

Publisher

Cold Spring Harbor Laboratory

Reference69 articles.

1. An interpretable mortality prediction model for COVID-19 patients;Nature Machine Intelligence,2020

2. External validation demonstrates limited clinical utility of the interpretable mortality prediction model for patients with COVID-19;Nature Machine Intelligence,2020

3. Mortality prediction of patients in intensive care units using machine learning algorithms based on electronic health records;Sci. Rep,2022

4. Characteristics and outcomes of hospital admissions for COVID-19 and influenza in the Toronto area

5. Recurrent neural network models (CovRNN) for predicting outcomes of patients with COVID-19 on admission to hospital: model development and validation using electronic health record data;Lancet Digit Health,2022

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. No-code machine learning in radiology: implementation and validation of a platform that allows clinicians to train their own models;2024-04-26

2. Empirical data drift detection experiments on real-world medical imaging data;Nature Communications;2024-02-29

3. Artificial Intelligence in the 21st Century;Research on Intelligent Manufacturing and Assembly;2023-03-25