A scoping review of AI/ML algorithm updating practices for model continuity and patient safety using a simplified checklist (Preprint)

Author:

Otokiti AhmedORCID,Shih Huan-juORCID,Ozoude MakuochukwuORCID,Siguachi Ilse,Warsame LeylaORCID,Williams KarmenORCID,Akinloye SeyiORCID

Abstract

BACKGROUND

The ubiquity of clinical artificial intelligence (AI) and machine learning (ML) models necessitates measures to ensure the reliability of model output over time. Previous reviews have highlighted the lack of external validation for most clinical models, but a comprehensive review assessing the current priority given to clinical model updating is lacking.

OBJECTIVE

The objective of this study was to analyze studies of clinical AI models based on PRISMA guidelines. Additionally, a new simple checklist/score system was developed to screen the quality of published AI/ML models. The primary aim was to understand the extent to which clinical model updating is prioritized in current research.

METHODS

We conducted a systematic analysis of studies on clinical AI models, adhering to PRISMA guidelines. A new checklist/score was introduced to assess the quality of the models. Demographic composition based on ethnicity or race was also considered in the analysis. This comprehensive approach aimed to provide a thorough evaluation of the current landscape of clinical AI models.

RESULTS

The results of our analysis revealed that only 9% of the reviewed 390 AI/ML studies stated an intention or method to update their models in the future. 98% of the AI/ML models in our review were in the research phase, and only 2 % in production phase. Furthermore, a mere 12% reported following best practice standards for model development. Notably, 84% of the studies did not provide demographic composition based on ethnicity or race.

CONCLUSIONS

In conclusion, a significant portion of studies currently lack commitment to future model updates. The low adherence to best practice standards for model development also highlights areas for improvement in the field. Furthermore, the absence of demographic information in a substantial number of studies raises concerns about the generalizability and equitable application of these models. The prevalence of research phase models built on proprietary data, limiting independent verification and validation of model output is also a big concern for patient safety. Addressing these issues is crucial for advancing the reliability and inclusivity of clinical AI and ML applications.

CLINICALTRIAL

NA

INTERNATIONAL REGISTERED REPORT

RR2-10.2196/37685

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3