External validity of machine learning-based prognostic scores for cystic fibrosis: A retrospective study using the UK and Canadian registries-Reference-Cited by-同舟云学术

External validity of machine learning-based prognostic scores for cystic fibrosis: A retrospective study using the UK and Canadian registries

Published:2023-01-12 Issue:1 Volume:2 Page:e0000179
ISSN:2767-3170
Container-title:PLOS Digital Health
language:en
Short-container-title:PLOS Digit Health

Author:

Qin Yuchao^ORCID,Alaa Ahmed,Floto Andres,Schaar Mihaela van der

Abstract

Precise and timely referral for lung transplantation is critical for the survival of cystic fibrosis patients with terminal illness. While machine learning (ML) models have been shown to achieve significant improvement in prognostic accuracy over current referral guidelines, the external validity of these models and their resulting referral policies has not been fully investigated. Here, we studied the external validity of machine learning-based prognostic models using annual follow-up data from the UK and Canadian Cystic Fibrosis Registries. Using a state-of-the-art automated ML framework, we derived a model for predicting poor clinical outcomes in patients enrolled in the UK registry, and conducted external validation of the derived model using the Canadian Cystic Fibrosis Registry. In particular, we studied the effect of (1) natural variations in patient characteristics across populations and (2) differences in clinical practice on the external validity of ML-based prognostic scores. Overall, decrease in prognostic accuracy on the external validation set (AUCROC: 0.88, 95% CI 0.88-0.88) was observed compared to the internal validation accuracy (AUCROC: 0.91, 95% CI 0.90-0.92). Based on our ML model, analysis on feature contributions and risk strata revealed that, while external validation of ML models exhibited high precision on average, both factors (1) and (2) can undermine the external validity of ML models in patient subgroups with moderate risk for poor outcomes. A significant boost in prognostic power (F1 score) from 0.33 (95% CI 0.31-0.35) to 0.45 (95% CI 0.45-0.45) was observed in external validation when variations in these subgroups were accounted in our model. Our study highlighted the significance of external validation of ML models for cystic fibrosis prognostication. The uncovered insights on key risk factors and patient subgroups can be used to guide the cross-population adaptation of ML-based models and inspire new research on applying transfer learning methods for fine-tuning ML models to cope with regional variations in clinical care.

Funder

Cystic Fibrosis Trust

Cystic Fibrosis Foundation

Publisher

Public Library of Science (PLoS)

Reference37 articles.

1. Cystic Fibrosis Foundation consensus guidelines for the care of individuals with advanced cystic fibrosis lung disease;SG Kapnadak;Journal of Cystic Fibrosis,2020

2. Identifying and preventing cardiovascular disease in patients with cystic fibrosis;T Saunders;Nature Cardiovascular Research,2022

3. Lung transplantation for cystic fibrosis;JC Yeung;The Journal of Heart and Lung Transplantation,2020

4. Prediction of mortality in patients with cystic fibrosis;E Kerem;New England Journal of Medicine,1992

5. Discovery and clinical decision support for personalized healthcare;J Yoon;IEEE Journal of Biomedical and Health Informatics,2016

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Factors influencing clinician and patient interaction with machine learning-based risk prediction models: a systematic review;The Lancet Digital Health;2024-02

2. Assessing the transportability of clinical prediction models for cognitive impairment using causal models;BMC Medical Research Methodology;2023-08-19

3. Assessing the transportability of clinical prediction models for cognitive impairment using causal models;2022-03-02