Machine Learning for Benchmarking Adolescent Idiopathic Scoliosis Surgery Outcomes

Author:

Gupta Aditi12,Oh Inez Y.1,Kim Seunghwan1,Marks Michelle C.3,Payne Philip R.O.1,Ames Christopher P.4,Pellise Ferran5,Pahys Joshua M.6,Fletcher Nicholas D.6,Newton Peter O.7,Kelly Michael P.8ORCID,

Affiliation:

1. Institute for Informatics, Washington University School of Medicine, St. Louis, MO

2. Division of Biostatistics, Washington University School of Medicine, St. Louis, MO

3. Setting Scoliosis Straight Foundation, San Diego, CA

4. Department of Neurological Surgery, University of California, San Francisco, San Francisco, CA

5. Spine Surgery Unit, Vall d'Hebron University Hospital, Barcelona, Spain

6. Shriners Hospitals for Children, Philadelphia, PA

7. Children’s Healthcare of Atlanta, Atlanta, GA

8. Rady Children’s Hospital, San Diego, CA

Abstract

Study Design. Retrospective cohort. Objective. The aim of this study was to design a risk-stratified benchmarking tool for adolescent idiopathic scoliosis (AIS) surgeries. Summary of Background Data. Machine learning (ML) is an emerging method for prediction modeling in orthopedic surgery. Benchmarking is an established method of process improvement and is an area of opportunity for ML methods. Current surgical benchmark tools often use ranks and no “gold standards” for comparisons exist. Materials and Methods. Data from 6076 AIS surgeries were collected from a multicenter registry and divided into three datasets: encompassing surgeries performed (1) during the entire registry, (2) the past 10 years, and (3) during the last 5 years of the registry. We trained three ML regression models (baseline linear regression, gradient boosting, and eXtreme gradient boosted) on each data subset to predict each of the five outcome variables, length of stay (LOS), estimated blood loss (EBL), operative time, Scoliosis Research Society (SRS)-Pain and SRS-Self-Image. Performance was categorized as “below expected” if performing worse than one standard deviation of the mean, “as expected” if within 1 SD, and “better than expected” if better than 1 SD of the mean. Results. Ensemble ML methods classified performance better than traditional regression techniques for LOS, EBL, and operative time. The best performing models for predicting LOS and EBL were trained on data collected in the last 5 years, while operative time used the entire 10-year dataset. No models were able to predict SRS-Pain or SRS-Self-Image in any useful manner. Point-precise estimates for continuous variables were subject to high average errors. Conclusions. Classification of benchmark outcomes is improved with ensemble ML techniques and may provide much needed case-adjustment for a surgeon performance program. Precise estimates of health-related quality of life scores and continuous variables were not possible, suggesting that performance classification is a better method of performance evaluation.

Publisher

Ovid Technologies (Wolters Kluwer Health)

Subject

Neurology (clinical),Orthopedics and Sports Medicine

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3