Development of 5-year Survival Machine Learning for Hepatocellular Carcinoma Patients Based on Pre-therapeutic Hematological Biomarkers

Author:

Jiang Tingting1,Liu Xingyu2,He Wencan3,Li Hepei1,Yan Xiang4,Yu Qian5,Mao Shanjun2

Affiliation:

1. Sichuan University

2. Hunan University

3. Central South University

4. IMN-CNM, CSIC (CEI UAM + CSIC)

5. Sun Yat-sen University

Abstract

Abstract

Background Hepatocellular carcinoma (HCC) is the main pathological subtype of primary liver cancer (PLC). Early onset HCC is insidious and characterized by easy metastasis, recurrence, and a high mortality rate. Most patients are in the middle or late stage, with a poor prognosis. Objective This study aimed to develop and validate a prediction model for HCC disease progression by machine learning (ML) algorithms, based on clinical blood biomarkers, circulating tumor cells (CTCs), and circulating endothelial cells (CECs) from pretreatment patients. Furthermore, the risk factors for 5-year survival in HCC patients were identified, and used to guide clinical diagnosis and treatment decisions. Methods A total of 76 newly diagnosed patients with HCC were eventually enrolled between September 2018 and July 2019. The follow-up time was 1–67 months in this group. Patients who survived for 5 years after the first surgery, were divided into a surviving group (n = 34) and a nonsurviving group (n = 42). The pathological data and related survival factors of patients were collected before treatment. The final subset of features was filtered by the support vector machine recursive feature elimination feature (SVM-RFE) algorithm, chi-square test and Student’s test. Prediction models for 5-year survival in patients with HCC were established by logistic regression, support vector machine (SVM), decision tree classification (DTC), random forests (RF), and extreme gradient Boosting (XGBoost), respectively. Additionally, the optimal model was established through validation. The models were evaluated by specificity, F1 score, recall value, accuracy and area under the receiver operating characteristic curve (AUC-ROC). Results Among the included patients, the follow-up time ranged from 1–67 months. The significant variable set, which included 22 variables, was screened. Ranking the importance of variables, the top 22 characteristic variables were as follows: maximum diameter, presence or absence of distant metastasis, CNLC stage, ALB, age, RBC, the large CTC, total bilirubin, PD-L1 (-) CTC, ≥ Pentaploid CTC, AFP, vascular cancer thrombus and satellite nodules, WBC, CTC, BCLC stage, multiple nodules, AST, PD-L (-) CTC-WBC cluster, Triploid CTC, LYM, PD-L1 (-) CEC-WBC cluster and degree of cirrhosis. The AUC-ROC values for predicting the 5-year survival rate of HCC patients by the logistic regression, SVM, DTC, RF, and XGBoost models were 0.7367, 0.9706, 0.6569, 0.7412, and 0.7031, respectively. Among them, the SVM model had the best 5-year survival performance in predicting HCC (Accuracy = 0.9868, F1 score = 0.9882, Recall value = 1.0000). Conclusion The SVM model based on ML methods could predict the 5-year survival of HCC patients and has good recognition ability, with greater accuracy than traditional models. Diagnosis and treatment can be utilized to intervene in the risk factors in this model, and improve patient prognosis.

Publisher

Springer Science and Business Media LLC

Reference38 articles.

1. Ferlay J, Ervik M, Lam F, Colombet M, Mery L, Piñeros M et al. Global Cancer observatory: Cancer today. (2020). Lyon, France: International Agency for Research on Cancer Available at: https://gco.iarc.fr/today (accessed April 1, 2023).

2. Cancer incidence and mortality in China,2022[J];Han BF;Journal of the National Cancer

3. Cancer incidence and mortality in China;Zheng RS;J Natl Cancer Center,2022

4. Cancer incidence and mortality in China,2015[J];Zhang SW;J Nat Cancer Cent,2021

5. Challenges in liver cancer and possible treatment approaches[J];Anwanwan David;Biochim Biophys Acta Rev Cancer,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3