Affiliation:
1. Gansu Provincial Hospital
Abstract
Abstract
Purpose
This study aims to establish the best prediction model of lymph node metastasis (LNM) in patients with intermediate and high-risk prostate cancer (PCa) through machine learning (ML), and provide the guideline of accurate clinical diagnosis and precise treatment for clinicals.
Methods
A total of 24,470 patients with intermediate and high-risk PCa were included in this study. Multivariate logistic regression model was used to screen the independent risk factors of LNM. At the same time, six algorithms, namely, random forest (RF), naive bayesian classifier (NBC), xgboost (XGB), gradient boosting machine (GBM), logistic regression (LR) and decision tree (DT) are used to establish risk prediction models. Based on the best prediction performance of ML algorithm, a prediction model is established, and the performance of the model is evaluated from three aspects: area under curve (AUC), sensitivity and specificity.
Results
In multivariate logistic regression analysis, T stage, PSA, Gleason score and bone metastasis were independent predictors of LNM in patients with intermediate and high-risk PCa. By comprehensively comparing the prediction model performance of training set and test set, GBM model has the best prediction performance (F1 score = 0.838, AUROC = 0.804). Finally, we developed a preliminary calculator model that can quickly and accurately calculate the regional LNM in patients with intermediate and high-risk PCa.
Conclusion
T stage, PSA, Gleason and bone metastasis were independent risk factors for predicting LNM in patients with intermediate and high-risk PCa. The prediction model established in this study performs well, however, the GBM model is the best one.
Publisher
Research Square Platform LLC