Considerations for Quality Control Monitoring of Machine Learning Models in Clinical Practice (Preprint)-Reference-Cited by-同舟云学术

Considerations for Quality Control Monitoring of Machine Learning Models in Clinical Practice (Preprint)

Published:2023-07-07 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Faust Louis^ORCID,Wilson Patrick,Asai Shusaku,Fu Sunyang^ORCID,Liu Hongfang^ORCID,Ruan Xiaoyang,Storlie Curt

Abstract

BACKGROUND

Integrating machine learning models into clinical practice presents a challenge of maintaining their efficacy over time. While the existing literature offers valuable strategies for detecting declining model performance, there is a need to document the broader challenges and solutions associated with real-world development and integration of model monitoring solutions.

OBJECTIVE

This work details the development and utilization of a platform for monitoring the performance of a production-level machine learning model operating in Mayo Clinic. The aim of this work is to provide a series of considerations and guidelines necessary for integrating such a platform into a team’s technical infrastructure and workflow. We document our experiences with this integration process and discuss the broader challenges encountered with real-world implementation and maintenance. Source code for the platform is also included.

METHODS

Our monitoring platform was built as an R shiny application; developed and implemented over the course of 6 months. The platform has been utilized and maintained for 2 years and is still in use as of July 2023.

RESULTS

The considerations necessary for the implementation of the monitoring platform center around four pillars: Feasibility – what resources can be utilized for platform development?; Design – through what statistics or models will the model be monitored, and how will these results be efficiently displayed to the end-user?; Implementation – how will this platform be built and where will it exist within the IT ecosystem?; and Policy – based on monitoring feedback, when and what actions will be taken to fix problems and how will these problems be translated to clinical staff?

CONCLUSIONS

While much of the literature surrounding machine learning performance monitoring emphasizes methodological approaches for capturing changes in performance, there remain a battery of other challenges and considerations that must be addressed for successful real-world implementation.

CLINICALTRIAL

INTERNATIONAL REGISTERED REPORT

RR2-10.1186/s13063-021-05546-5

Publisher

JMIR Publications Inc.

Reference24 articles.

1. Evaluation and Real-World Performance Monitoring of Artificial Intelligence Models in Clinical Practice: Try It, Buy It, Check It

2. External Validation of a Widely Implemented Proprietary Sepsis Prediction Model in Hospitalized Patients

3. The Clinician and Dataset Shift in Artificial Intelligence

4. Dataset Shift in Machine Learning

5. Support Vector Machines