Abstract
Abstract
Background
The investigation into individual survival rates within the patient population was typically conducted using the Cox proportional hazards model, with geometric black box models not being employed
Aims
We aims to evaluate the performance of machine learning algorithm in predicting survival rates more than 5 years for individual patients with colorectal cancer.
Methods
A total of 475 patients with CRC and complete data who had underwent surgery for colorectal cancer were analyze to measure individual's survival rate more than 5 years using a machine learning based on penalized Cox regression. We conducted thorough calculations to measure the individual's survival rate more than 5 years for performance evaluation. The receiver operating characteristic (ROC) curves for the LASSO penalized model, the SCAD penalized model, the unpenalized model, and the RSF model were analyzed.
Results
The least absolute shrinkage and selection operator penalized model displayed a mean AUC of 0.67 ± 0.06, the smoothly clipped absolute deviation penalized model exhibited a mean AUC of 0.65 ± 0.07, the unpenalized model showed a mean AUC of 0.64 ± 0.09. Notably, the random survival forests model outperformed the others, demonstrating the most favorable performance evaluation with a mean AUC of 0.71 ± 0.05.
Conclusions
Penalized Cox model is more efficient and leads to a more generalized model selection compared to the unpenalized Cox model as a prognosis prediction model for CRC. The results indicated that the random forest model, a black box model, outperformed the penalized Cox model in terms of performance.
Publisher
Research Square Platform LLC
Reference27 articles.
1. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries;Sung H;CA Cancer J Clin,2021
2. Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 32 cancer groups, 1990 to 2015: a systematic analysis for the global burden of disease study;Fitzmaurice C;JAMA oncology,2017
3. The ever-increasing importance of cancer as a leading cause of premature death worldwide;Bray F;Cancer,2021
4. Epidemiology of colorectal cancer: incidence, mortality, survival, and risk factors;Rawla P;Prz Gastroenterol,2019
5. A review of colorectal cancer in terms of epidemiology, risk factors, development, symptoms and diagnosis;Sawicki T;Cancers (Basel),2021