Explainable Machine Learning for Prediction of Procedural Case Durations Developed Using a Large Multicenter Database: Algorithm Development and Validation (Preprint)

Author:

Kendale SamirORCID,Bishara Andrew,Burns MichaelORCID,Solomon Stuart,Corriere Matthew,Mathis MichaelORCID

Abstract

BACKGROUND

Accurate projections of procedural case durations are complex, but critical to planning of perioperative staffing, operating room resources, and patient communication. Nonlinear prediction models using machine learning methods may provide opportunities for hospitals to improve upon current estimates of procedure duration.

OBJECTIVE

We hypothesized a machine learning algorithm derived from a large multicenter dataset would more accurately predict surgical procedure duration when compared to a baseline linear regression approach. Using an explainable machine learning-based algorithm, results provide additional valuable insight regarding procedure duration and variability.

METHODS

A total of 1,177,893 procedures from 13 academic and private hospitals between 2016 and 2019 were used. Deep learning, gradient boosting, and ensemble machine learning models were generated using perioperative data available at three distinct time points: time of scheduling, time of arrival to the operating/procedure room (primary model), and time of surgical incision/procedure start. The primary outcome was procedure duration, defined by the time between arrival and departure of the patient from the procedure room. Model performance was assessed by mean absolute error, proportion of predictions within 20% of actual duration, and other standard metrics. Performance was compared to a baseline method of historical means within a linear regression model. Model features driving predictions were assessed using Shapley values and permutation feature importance.

RESULTS

Across all procedures, median procedure duration was 94 minutes (interquartile range of 50-167 minutes). In estimating procedure duration, the gradient boosting machine was the best performing model, demonstrating a mean absolute error of 34 minutes with 46% of predictions within 20% of actual duration in the test dataset. This represented a statistically and clinically significant improvement in predictions compared to a baseline linear regression model (43 minutes, p < 0.001; 39% of predictions within 20% of actual duration). The most important features in model training were historical procedure duration by surgeon, the word “free” within the procedure text, and time of day.

CONCLUSIONS

Nonlinear models using machine learning techniques may be used to generate high-performing, automatable, explainable, and scalable prediction models for procedure duration. Medi

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3