An AI-driven Predictive Model for Pancreatic Cancer Patients Using Extreme Gradient Boosting-Reference-Cited by-同舟云学术

An AI-driven Predictive Model for Pancreatic Cancer Patients Using Extreme Gradient Boosting

Published:2023-09-11 Issue:4 Volume:22 Page:262-282
ISSN:2214-1766
Container-title:Journal of Statistical Theory and Applications
language:en
Short-container-title:J Stat Theory Appl

Author:

Chakraborty Aditya^ORCID,Tsokos Chris P.^ORCID

Abstract

AbstractPancreatic cancer is one of the deadliest carcinogenic diseases affecting people all over the world. The majority of patients are usually detected at Stage III or Stage IV, and the chances of survival are very low once detected at the late stages. This study focuses on building an efficient data-driven analytical predictive model based on the associated risk factors and identifying the most contributing factors influencing the survival times of patients diagnosed with pancreatic cancer using the XGBoost (eXtreme Gradient Boosting) algorithm. The grid-search mechanism was implemented to compute the optimum values of the hyper-parameters of the analytical model by minimizing the root mean square error (RMSE). The optimum hyperparameters of the final analytical model were selected by comparing the values with 243 competing models. To check the validity of the model, we compared the model’s performance with ten deep neural network models, grown sequentially with different activation functions and optimizers. We also constructed an ensemble model using Gradient Boosting Machine (GBM). The proposed XGBoost model outperformed all competing models we considered with regard to root mean square error (RMSE). After developing the model, the individual risk factors were ranked according to their individual contribution to the response predictions, which is extremely important for pancreatic research organizations to spend their resources on the risk factors causing/influencing the particular type of cancer. The three most influencing risk factors affecting the survival of pancreatic cancer patients were found to be the age of the patient, current BMI, and cigarette smoking years with contributing percentages of 35.5%, 24.3%, and 14.93%, respectively. The predictive model is approximately 96.42% accurate in predicting the survival times of the patients diagnosed with pancreatic cancer and performs excellently on test data. The analytical methodology of developing the model can be utilized for prediction purposes. It can be utilized to predict the time to death related to a specific type of cancer, given a set of numeric, and non-numeric features.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Statistics and Probability

Link

https://link.springer.com/content/pdf/10.1007/s44199-023-00063-7.pdf

Reference51 articles.

1. Agostinelli, F., Hoffman, M., Sadowski, P., Baldi, P.: Learning activation functions to improve deep neural networks. (2014) arXiv preprint arXiv:1412.6830

2. Ahmad, L.G., Eshlaghy, A.T., Poorebrahimi, A., Ebrahimi, M., Razavi, A.R.: Using Three Machine Learning Techniques for Predicting Breast Cancer Recurrence. J. Health Med. Inform. 4, 124 (2013). https://doi.org/10.4172/2157-7420.1000124

3. Amjad, M., et al.: Prediction of pile bearing capacity using XGBoost algorithm: modeling and performance evaluation. Appl. Sci. 12(4), 2126 (2022)

4. Bal, M.S., Bodal, V.K., Kaur, J., Kaur, M., Sharma, S.: Patterns of Cancer: A Study of 500 Punjabi Patients. Asian Pac. J. Cancer Prev. 16(12), 5107–10 (2015)

5. Bebis, G., Georgiopoulos, M.: Feed-forward neural networks. IEEE Potentials 13(4), 27–31 (1994)

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Prediction of sepsis mortality in ICU patients using machine learning methods;BMC Medical Informatics and Decision Making;2024-08-16

2. A Stock Optimization Problem in Finance: Understanding Financial and Economic Indicators through Analytical Predictive Modeling;Mathematics;2024-08-02

3. Artificial Intelligence in Oncology: Applications, Challenges and Future Frontiers;International Journal of Pharmaceutical Investigation;2024-07-01

4. Establishment of prediction model for mortality risk of pancreatic cancer: a retrospective study;BMC Medical Informatics and Decision Making;2024-06-27