Deep Learning Models for Predicting the Survival of Patients with Hepatocellular Carcinoma Based on a Surveillance, Epidemiology, and End Results (SEER) Database Analysis-Reference-Cited by-同舟云学术

Deep Learning Models for Predicting the Survival of Patients with Hepatocellular Carcinoma Based on a Surveillance, Epidemiology, and End Results (SEER) Database Analysis

Published:2024-02-12 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Wang Shoucheng¹,Shao Mingyi²,Fu Yu²,Zhao Ruixia²,Xing Yunfei²,Zhang Liujie¹,Xu Yang¹

Affiliation:

1. Henan University of Traditional Chinese Medicine

2. First Affiliated Hospital of Henan University of Traditional Chinese Medicine

Abstract

Background This study aims to develop and validate a predictive model for Hepatocellular Carcinoma (HCC) patients using deep learning algorithms and to explore its clinical applicability. Methods HCC patients pathologically diagnosed between January 2011 and December 2015 in the SEER (Surveillance, Epidemiology, and End Results) database of the National Cancer Institute of the United States were selected as study subjects. We utilized two deep learning-based algorithms (DeepSurv and Neural Multi-Task Logistic Regression [NMTLR]) and a machine learning-based algorithm (Random Survival Forest [RSF]) for model training. A multivariable Cox Proportional Hazards (CoxPH) model was also constructed for comparison. The dataset was randomly divided into a training set and a test set in a 7:3 ratio. The training dataset underwent hyperparameter tuning through 1000 iterations of random search and 5-fold cross-validation. Model performance was assessed using the concordance index (C-index), Brier score, and Integrated Brier Score (IBS). The accuracy of predicting 1-year, 3-year, and 5-year survival rates was evaluated using Receiver Operating Characteristic (ROC) curves, calibration plots, and Area Under the Curve (AUC). The primary outcomes were the 1-year, 3-year, and 5-year overall survival rates. Models were developed using DeepSurv, NMTLR, RSF, and Cox Proportional Hazards regression. Model differentiation was evaluated using the C-index, calibration with concordance plots, and risk stratification capability with the log-rank test. Results The study included 2,197 HCC patients, randomly divided into a training cohort (70%, n = 1,537) and a testing cohort (30%, n = 660). Clinical characteristics between the two cohorts showed no significant statistical difference (p > 0.05). The deep learning models outperformed both RSF and CoxPH models, with C-indices of 0.735 (NMTLR) and 0.731 (DeepSurv) in the test dataset. The NMTLR model provided more accurate and better-calibrated survival estimates for predicting 1-year, 3-year, and 5-year survival rates (AUC: 0.803–0.824). We deployed the NMTLR model as a web application for clinical practice. Conclusion The predictive model developed using the deep learning algorithm NMTLR demonstrated excellent performance in prognostication for Primary Hepatocellular Carcinoma.

Publisher

Research Square Platform LLC

Reference25 articles.

1. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries;Sung H;CA Cancer J Clin 2021 Feb

2. International trends in hepatocellular carcinoma incidence, 1978–2012;Petrick JL;Int J Cancer,2020

3. Projections of primary liver cancer to 2030 in 30 countries worldwide;Valery PC;Hepatology,2018

4. Construction and validation of a nomogram for predicting cancer-specific survival in hepatocellular carcinoma patients;Liu K;Sci Rep,2020

5. Książek W, Gandor M, Pławiak P. Comparison of various approaches to combine logistic regression with genetic algorithms in survival prediction of hepatocellular carcinoma. Comput Biol Med. 2021;134:104431. 10.1016/j.compbiomed.2021.104431. Epub 2021 May 11. PMID: 34015670.