Gradient Boosting Machine based prediction of chemotherapy response and role of p53 mutational and smoking status for progression free survival in metastatic colorectal cancer

Author:

Yıldız Oğuzhan1,Gürbüz Ali Fuat1,Eryılmaz Melek Karakurt1,Araz Murat1,Yıldırım Mahmut Selman2,Bozcuk Hakan Şat3,Artaç Mehmet1

Affiliation:

1. Department of Medical Oncology, Necmettin Erbakan University School of Medicine, Konya, Turkey

2. Department of Medical Genetics, Necmettin Erbakan University School of Medicine, Konya, Turkey

3. Department of Medical Oncology, Medical Park Antalya Hospital, Antalya, Turkey

Abstract

Abstract

Background: Identifying predictors of response or progression after first-line chemotherapy for stage 4 colorectal cancer remains a challenge. This study aims to evaluate the correlation between patient outcomes and the p53 mutational status and smoking status of tumors using various machine learning methods. Material and methods: We consecutively recruited all patients diagnosed with metastatic colorectal cancer at an academic center within a specified time period. Response to first-line chemotherapy and associated factors were assessed using various machine learning models. The most accurate model was further optimized. Additionally, common clinical features, MMR, p53, and RAS status were tested for correlation with the outcome. Feature importance and calibration plots were generated, and univariate and multivariate Cox models were utilized to analyze associates of progression-free survival (PFS). Results: A total of 101 newly diagnosed metastatic colorectal cancer patients initiating first-line chemotherapy were included. The median age was 62, and 69% of the cases were male. We evaluated 15 machine learning models to predict the binary outcome of best response to chemotherapy, among which LightGBM demonstrated the highest baseline accuracy of 0.71. Further tuning of the LightGBM model improved accuracy to 0.79, with a macro average AUC value of 0.82. Age at diagnosis, maximum metastatic dimension of cancer, and metastatic status at diagnosis were identified as the three most important features. Genetic variables did not establish significant feature importance for response analysis. Survival analysis revealed an association between PFS and p53 mutation status (Exp(B) = 0.52, Wald = 6.98, P = 0.008) and smoking pack years (Exp(B) = 0.99, Wald = 4.28, P = 0.039). Discussion: Utilizing LightGBM as a machine learning method, we developed a predictive model with good accuracy for assessing response to first-line treatment. If confirmed and further improved, such a model could aid in identifying responders to first-line chemotherapy in metastatic colorectal cancer patients and suggesting alternative chemotherapy options for non-responders. Furthermore, our findings highlight the prognostic importance of genetic features, particularly p53 mutation status, and smoking pack years for PFS duration in this context.

Publisher

Research Square Platform LLC

Reference20 articles.

1. Diagnosis and treatment of metastatic colorectal cancer: a review;Biller LH;Jama,2021

2. Analysis of plasma cell-free DNA by ultradeep sequencing in patients with stages I to III colorectal cancer;Reinert T;JAMA oncology,2019

3. American cancer society. Cancer facts and Figs. 2013;Atlanta G;Amer. Cancer Soc.,2013

4. Systemic treatment of colorectal cancer;BM W;Gastroenterology,2008

5. Barhak, J., Visualization and pre-processing of intensive care unit data using python data science tools. Proceedings from MODSIM World 2018, 2018.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3