Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study-Reference-Cited by-同舟云学术

Development and External Validation of Machine Learning-Based Models for Predicting Lung Metastasis in Kidney Cancer: A Large Population-Based Study

Published:2023-06-20 Issue: Volume:2023 Page:1-13
ISSN:1742-1241
Container-title:International Journal of Clinical Practice
language:en
Short-container-title:International Journal of Clinical Practice

Author:

Yi Xinglin¹^ORCID,Zhang Yuhan²,Cai Juan²,Hu Yu²,Wen Kai²,Xie Pan²,Yin Na²,Zhou Xiangdong¹^ORCID,Luo Hu¹^ORCID

Affiliation:

1. Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of the Army Medical University, Chongqing, China

2. Department of Renal Dialysis Center, The First Affiliated Hospital of the Army Medical University, Chongqing, China

Abstract

The accuracy of indices widely used to evaluate lung metastasis (LM) in patients with kidney cancer (KC) is insufficient. Therefore, we aimed at developing a model to estimate the risk of developing LM in KC based on a large population size and machine learning algorithms. Demographic and clinicopathologic variables of patients with KC diagnosed between 2004 and 2017 were retrospectively analyzed. We performed a univariate logistic regression analysis to identify risk factors for LM in patients with KC. Six machine learning (ML) classifiers were established and tuned using the ten-fold cross-validation method. External validation was performed using clinicopathologic information from 492 patients from the Southwest Hospital, Chongqing, China. Algorithm performance was estimated by analyzing the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, precision, recall, F1 score, clinical decision analysis (DCA), and clinical utility curve (CUC). A total of 52,714 eligible patients diagnosed with KC were enrolled, of whom 2,618 developed LM. Variables of age, sex, race, T stage, N stage, tumor size, histology, and grade were identified as important for the prediction of LM. The extreme gradient boosting (XGB) algorithm performed better than other models in both the internal validation (AUC: 0.913, sensitivity: 0.873, specificity: 0.809, and F1 score: 0.325) and the external validation (AUC: 0.904, sensitivity: 0.750, specificity: 0.878, and F1 score: 0.364). This study established a predictive model for LM in KC patients based on ML algorithms which showed high accuracy and applicative value. A web-based predictor was built using the XGB model to help clinicians make more rational and personalized decisions.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

General Medicine

Link

http://downloads.hindawi.com/journals/ijclp/2023/8001899.pdf

Reference40 articles.

1. Renal cell carcinoma: diagnosis and management;R. E. Gray;American Family Physician,2019

2. Kidney cancer: The next decade

3. Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012

4. The World Health Organization 2016 classification of testicular germ cell tumours: a review and update from the International Society of Urological Pathology Testis Consultation Panel

5. Timing the Landmark Events in the Evolution of Clear Cell Renal Cell Cancer: TRACERx Renal