Author:
Mohammadian Rad Nastaran,Sosef Odin,Seegers Jord,Koolen Laura J. E. R.,Hoofwijk Julie J. W. A.,Woodruff Henry C.,Hoofwijk Ton A. G. M.,Sosef Meindert,Lambin Philippe
Abstract
ObjectiveColorectal cancer (CRC) is one of the most prevalent cancers worldwide. A considerable percentage of patients who undergo surgery with curative intent will experience cancer recurrence. Early identification of individuals with a higher risk of recurrence is crucial for healthcare professionals to intervene promptly and devise appropriate treatment strategies. In this study, we developed prognostic models for CRC recurrence using machine learning models on a limited number of CEA measurements.MethodA dataset of 1927 patients diagnosed with Stage I-III CRC and referred to Zuyderland Hospital for surgery between 2008 and 2016 was utilized. Machine learning models were trained using this comprehensive dataset, which included demographic details, clinicopathological factors, and serial measurements of Carcinoembryonic Antigen (CEA). In this study, the predictive performance of these models was assessed, and the key prognostic factors influencing colorectal cancer (CRC) recurrence were pinpointedResultAmong the evaluated models, the gradient boosting classifier demonstrated superior performance, achieving an Area Under the Curve (AUC) score of 0.81 and a balanced accuracy rate of 0.73. Recurrence prediction was shown to be feasible with an AUC of 0.71 when using only five post-operative CEA measurements. Furthermore, key factors influencing recurrence were identified and elucidated.ConclusionThis study shows the transformative role of machine learning in recurrence prediction for CRC, particularly by investigating the minimum number of CEA measurements required for effective recurrence prediction. This approach not only contributes to the optimization of clinical workflows but also facilitates the development of more effective, individualized treatment plans, thereby laying the groundwork for future advancements in this area. Future directions involve validating these models in larger and more diverse cohorts. Building on these efforts, our ultimate goal is to develop a risk-based follow-up strategy that can improve patient outcomes and enhance healthcare efficiency.