Comparison of Cox Regression to Machine Learning in Predicting Cancer-Specific Survival of Fibroblastic Osteosarcoma

Author:

Chao Longteng1,Ye Xinmiao1,Chen Junyuan1,She Guorong1,Zha Zhengang1

Affiliation:

1. The First Affiliated Hospital of Jinan University

Abstract

Abstract Background Bone cancer called osteosarcoma (OS), especially its fibroblastic type, makes things very hard in the world of bone diseases. This happens because of its fierce character and the complexity involved in deciding outcomes. Current prognostic models, like the American Joint Committee on Cancer (AJCC) system and Tumor Node Metastasis (TNM) Staging System, don't always fully include important individual patient factors such as age, sex and race. These things are very important for making a correct prediction. Methods A total of 394 patients with fibroblastic osteosarcoma were included in the study, adhering to specified inclusion and exclusion criteria. The cohort was subsequently segregated into training and validation sets at a 7:3 ratio. X-tile software facilitated the determination of optimal age and tumor size cutoffs. Missing data were managed using multiple imputation and K-Nearest Neighbor (KNN) methods. The primary endpoint was cancer-specific survival (CSS), categorized into binary data (survival status at 3 and 5 years) and time-to-event data. Independent prognostic factors were ascertained using the Boruta algorithm, which informed the construction of predictive models employing Cox regression and diverse machine learning algorithms such as Survival Tree, Extra Survival Trees, Random Survival Forest, Gradient Boosting Survival Analysis, Fast Kernel Survival SVM, and Minlip Survival Analysis. Model performance metrics included the concordance index (C-index), accuracy, recall, F1 score, and time-dependent Area Under the Curve (AUC). A calibration plot was generated to validate the accuracy of the most proficient machine learning model. Decision curve analysis (DCA) was implemented to ascertain the model's clinical utility. Additionally, we used the SHapley Additive exPlanations (SHAP) method to show how important our model found key things that can predict outcomes. Results For age, the determined optimal cutoff points were established at 40 and 57 years. Regarding tumor size, these points were set at 60mm and 103mm. Our study identified nine significant independent prognostic factors impacting the cancer-specific survival in patients with fibroblastic osteosarcoma. These included age group, tumor stage, tumor size group, radiation, surgery type, primary site, sex, chemotherapy, and grade group. Comparative analysis of different algorithms, utilizing metrics such as accuracy, recall, F1 score, C-index, and time-dependent AUC, highlighted the Extra Survival Trees model as the superior predictive tool for machine learning. This model demonstrated high efficiency (3-year CSS accuracy: 0.91, 5-year CSS accuracy: 0.89), notable recall rates (3-year: 0.81, 5-year: 0.74), and robust F1 scores (3-year: 0.83, 5-year: 0.80), along with an average AUC of 0.89 and a C-index of 0.92 for training and 0.80 for validation. The calibration curve for this model indicated high predictive accuracy, and its clinical usefulness was further corroborated by decision curve analysis (DCA). SHAP analysis identified 'age group', 'tumor stage', and 'tumor size group' as the three most influential variables impacting cancer-specific survival predictions in fibroblastic osteosarcoma. Our study suggested otherwise than previous ones. It showed that radiation and chemotherapy may not work for treating this type of bone cancer called fibroblastic osteosarcoma. Conclusion Research indicates that predictive analysis using machine learning outperforms traditional methods in forecasting outcomes for patients with fibroblastic osteosarcoma. This development offers considerable promise for enhancing tailored therapeutic approaches and prognostic outcomes in fibroblastic osteosarcoma.

Publisher

Research Square Platform LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3