Development of a machine learning-based model to predict prognosis of alpha-fetoprotein-positive hepatocellular carcinoma-Reference-Cited by-同舟云学术

Development of a machine learning-based model to predict prognosis of alpha-fetoprotein-positive hepatocellular carcinoma

Published:2024-05-13 Issue:1 Volume:22 Page:
ISSN:1479-5876
Container-title:Journal of Translational Medicine
language:en
Short-container-title:J Transl Med

Author:

Dong Bingtian,Zhang Hua,Duan Yayang,Yao Senbang,Chen Yongjian,Zhang Chaoxue^ORCID

Abstract

Abstract Background Patients with alpha-fetoprotein (AFP)-positive hepatocellular carcinoma (HCC) have aggressive biological behavior and poor prognosis. Therefore, survival time is one of the greatest concerns for patients with AFP-positive HCC. This study aimed to demonstrate the utilization of six machine learning (ML)-based prognostic models to predict overall survival of patients with AFP-positive HCC. Methods Data on patients with AFP-positive HCC were extracted from the Surveillance, Epidemiology, and End Results database. Six ML algorithms (extreme gradient boosting [XGBoost], logistic regression [LR], support vector machine [SVM], random forest [RF], K-nearest neighbor [KNN], and decision tree [ID3]) were used to develop the prognostic models of patients with AFP-positive HCC at one year, three years, and five years. Area under the receiver operating characteristic curve (AUC), confusion matrix, calibration curves, and decision curve analysis (DCA) were used to evaluate the model. Results A total of 2,038 patients with AFP-positive HCC were included for analysis. The 1-, 3-, and 5-year overall survival rates were 60.7%, 28.9%, and 14.3%, respectively. Seventeen features regarding demographics and clinicopathology were included in six ML algorithms to generate a prognostic model. The XGBoost model showed the best performance in predicting survival at 1-year (train set: AUC = 0.771; test set: AUC = 0.782), 3-year (train set: AUC = 0.763; test set: AUC = 0.749) and 5-year (train set: AUC = 0.807; test set: AUC = 0.740). Furthermore, for 1-, 3-, and 5-year survival prediction, the accuracy in the training and test sets was 0.709 and 0.726, 0.721 and 0.726, and 0.778 and 0.784 for the XGBoost model, respectively. Calibration curves and DCA exhibited good predictive performance as well. Conclusions The XGBoost model exhibited good predictive performance, which may provide physicians with an effective tool for early medical intervention and improve the survival of patients.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s12967-024-05203-w.pdf

Reference32 articles.

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer statistics 2020: GLOBOCAN estimates of incidence and Mortality Worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71(3):209–49.

2. Suk FM, Liu CL, Hsu MH, Chuang YT, Wang JP, Liao YJ. Treatment with a new benzimidazole derivative bearing a pyrrolidine side chain overcomes sorafenib resistance in hepatocellular carcinoma. Sci Rep. 2019;9(1):17259.

3. Villanueva A, Hepatocellular Carcinoma. N Engl J Med. 2019;380(15):1450–62.

4. He H, Chen S, Fan Z, Dong Y, Wang Y, Li S, et al. Multi-dimensional single-cell characterization revealed suppressive immune microenvironment in AFP-positive hepatocellular carcinoma. Cell Discov. 2023;9(1):60.

5. Taketa K. Alpha-fetoprotein: reevaluation in hepatology. Hepatology. 1990;12(6):1420–32.