Practical Considerations and Applied Examples of Cross-Validation for Model Development and Evaluation in Health Care: Tutorial-Reference-Cited by-同舟云学术

Practical Considerations and Applied Examples of Cross-Validation for Model Development and Evaluation in Health Care: Tutorial

Published:2023-12-18 Issue: Volume:2 Page:e49023
ISSN:2817-1705
Container-title:JMIR AI
language:en
Short-container-title:JMIR AI

Author:

Wilimitis Drew^ORCID,Walsh Colin G^ORCID

Abstract

Cross-validation remains a popular means of developing and validating artificial intelligence for health care. Numerous subtypes of cross-validation exist. Although tutorials on this validation strategy have been published and some with applied examples, we present here a practical tutorial comparing multiple forms of cross-validation using a widely accessible, real-world electronic health care data set: Medical Information Mart for Intensive Care-III (MIMIC-III). This tutorial explored methods such as K-fold cross-validation and nested cross-validation, highlighting their advantages and disadvantages across 2 common predictive modeling use cases: classification (mortality) and regression (length of stay). We aimed to provide readers with reproducible notebooks and best practices for modeling with electronic health care data. We also described sets of useful recommendations as we demonstrated that nested cross-validation reduces optimistic bias but comes with additional computational challenges. This tutorial might improve the community’s understanding of these important methods while catalyzing the modeling community to apply these guides directly in their work using the published code.

Publisher

JMIR Publications Inc.

Reference33 articles.

1. Action-Informed Artificial Intelligence—Matching the Algorithm to the Problem

2. Towards better clinical prediction models: seven steps for development and an ABCD for validation

3. A framework for the oversight and local deployment of safe and high-quality prediction models

4. Reporting and Methods in Clinical Prediction Research: A Systematic Review

5. A systematic review finds prediction models for chronic kidney disease were poorly reported and often developed using inappropriate methods

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning applied to the prediction of relapse, hospitalization, and suicide in bipolar disorder using neuroimaging and clinical data: A systematic review;Journal of Affective Disorders;2024-09

2. Predicting thoracic aortic dissection in a diverse biobank using a polygenic risk score;2024-09-01

3. An Automated Machine Learning Framework for Antimicrobial Resistance Prediction Through Transcriptomics;2024-06-27

4. Consolidated Reporting Guidelines for Prognostic and Diagnostic Machine Learning Models (CREMLS);Journal of Medical Internet Research;2024-05-02

5. Consolidated Reporting Guidelines for Prognostic and Diagnostic Machine Learning Models (CREMLS) (Preprint);2024-04-04