Low adherence to existing model reporting guidelines by commonly used clinical prediction models-Reference-Cited by-同舟云学术

Low adherence to existing model reporting guidelines by commonly used clinical prediction models

Published:2021-07-23 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Lu Jonathan H.^ORCID,Callahan Alison^ORCID,Patel Birju S.^ORCID,Morse Keith E.^ORCID,Dash Dev,Shah Nigam H.^ORCID

Abstract

ABSTRACTObjectiveTo assess whether the documentation available for commonly used machine learning models developed by an electronic health record (EHR) vendor provides information requested by model reporting guidelines.Materials and MethodsWe identified items requested for reporting from model reporting guidelines published in computer science, biomedical informatics, and clinical journals, and merged similar items into representative “atoms”. Four independent reviewers and one adjudicator assessed the degree to which model documentation for 12 models developed by Epic Systems reported the details requested in each atom. We present summary statistics of consensus, interrater agreement, and reporting rates of all atoms for the 12 models.ResultsWe identified 220 unique atoms across 15 model reporting guidelines. After examining the documentation for the 12 most commonly used Epic models, the independent reviewers had an interrater agreement of 76%. After adjudication, the model documentations’ median completion rate of applicable atoms was 39% (range: 31%-47%). Most of the commonly requested atoms had reporting rates of 90% or above, including atoms concerning outcome definition, preprocessing, AUROC, internal validation and intended clinical use. For individual reporting guidelines, the median adherence rate for an entire guideline was 54% (range: 15%-71%). Atoms reported half the time or less included those relating to fairness (summary statistics and subgroup analyses, including for age, race/ethnicity, or sex), usefulness (net benefit, prediction time, warnings on out-of-scope use and when to stop use), and transparency (model coefficients).Atoms relating to reliability also had low reporting, including those related to missingness (missing data statistics, missingness strategy), validation (calibration plot, external validation), and monitoring (how models are updated/tuned, prediction monitoring).ConclusionThere are many recommendations about what should be reported about predictive models used to guide care. Existing model documentation examined in this study provides less than half of applicable atoms, and entire reporting guidelines have low adherence rates. Half or less of the reviewed documentation reported information related to usefulness, reliability, transparency and fairness of models. There is a need for better operationalization of reporting recommendations for predictive models in healthcare.KEY POINTSQuestionHow often does documentation for commonly deployed clinical predictive models report the information requested by model reporting guidelines?FindingCombining the recommendations from 15 model reporting guidelines, we identified 220 unique requested items. We reviewed the documentation of 12 commonly deployed Epic models and assessed the completion rate of applicable items. The median completion rate was 39%. While the most commonly requested items were highly reported, information on usefulness, reliability, transparency and fairness was missing from at least half of documentation.MeaningThere is incomplete documentation for model users to ensure that deployed models are useful, reliable, transparent and fair.

Publisher

Cold Spring Harbor Laboratory

Reference79 articles.

1. Scalable and accurate deep learning with electronic health records

2. Dissecting racial bias in an algorithm used to manage the health of populations

3. Better medicine through machine learning: What’s real, and what’s artificial?

4. Artificial Intelligence in Health Care

5. High-performance medicine: the convergence of human and artificial intelligence

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Unseen Hand: AI-Based Prescribing Decision Support Tools and the Evaluation of Drug Safety and Effectiveness;Drug Safety;2023-11-29

2. Consolidated Reporting Guidelines for Prognostic and Diagnostic Machine Learning Modeling Studies: Development and Validation;Journal of Medical Internet Research;2023-08-31

3. Consolidated Reporting Guidelines for Prognostic and Diagnostic Machine Learning Modeling Studies: Development and Validation (Preprint);2023-05-05

4. Reporting and Methodological Observations on Prognostic and Diagnostic Machine Learning Studies;JMIR AI;2023-04-28

5. Effectiveness of a Vendor Predictive Model for the Risk of Pediatric Asthma Exacerbation: A Difference-in-Differences Analysis (Preprint);2023-03-12