Abstract
AbstractAs patient health information is highly regulated due to privacy concerns, the majority of machine learning (ML)-based healthcare studies are unable to test on external patient cohorts, resulting in a gap between locally reported model performance and cross-site generalizability. Different approaches have been introduced for developing models across multiple clinical sites, however no studies have compared methods for translating ready-made models for adoption in new settings. We introduce three methods to do this – (1) applying a ready-made model “as-is”; (2) readjusting the decision threshold on the output of a ready-made model using site-specific data; and (3) finetuning a ready-made model using site-specific data via transfer learning. Using a case study of COVID-19 diagnosis across four NHS Hospital Trusts, we show that all methods achieve clinically-effective performances (NPV >0.959), with transfer learning achieving the best results (mean AUROCs between 0.870-0.925). Our models demonstrate that site-specific customization improves predictive performance when compared to other ready-made approaches.
Publisher
Cold Spring Harbor Laboratory
Reference18 articles.
1. Fostering reproducibility and generalizability in machine learning for clinical prediction modeling in spine surgery;The Spine Journal,2021
2. Bai, X. , Wang, H. , Ma, L. , Xu, Y. , Gan, J. , Fan, Z. , … & Xia, T. (2021). Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence. Nature Machine Intelligence, 1–9.
3. Barak-Corren, Y. , Fine, A. M. , & Reis, B. Y. (2017). Early prediction model of patient hospitalization from the pediatric emergency department. Pediatrics, 139(5).
4. Prediction across healthcare settings: a case study in predicting emergency department disposition;npj Digital Medicine,2021
5. Machine learning comes of age: local impact versus national generalizability;Anesthesiology,2020