Author:
Li Xiangqin,Ning Ruoqi,Xiao Bing,Meng Silu,Sun Haiying,Fan Xinran,Li Shuang
Abstract
IntroductionCervical cancer is the fourth most common cancer among female worldwide. Early detection and intervention are essential. This study aims to construct an early predictive warning model for cervical cancer and precancerous lesions utilizing clinical data and simple nucleotide polymorphisms (SNPs).MethodsClinical data and germline SNPs were collected from 472 participants. Univariate logistic regression, least absolute shrinkage selection operator (LASSO), and stepwise regression were performed to screen variables. Logistic regression (LR), support vector machine (SVM), random forest (RF), decision tree (DT), extreme gradient boosting(XGBoost) and neural network(NN) were applied to establish models. The receiver operating characteristic (ROC) curve was used to compare the models’ efficiencies. The performance of models was validated using decision curve analysis (DCA).ResultsThe LR model, which included 6 SNPs and 2 clinical variables as independent risk factors for cervical carcinogenesis, was ultimately chosen as the most optimal model. The DCA showed that the LR model had a good clinical application.DiscussionThe predictive model effectively foresees cervical cancer risk using clinical and SNP data, aiding in planning timely interventions. It provides a transparent tool for refining clinical decisions in cervical cancer management.
Funder
National Natural Science Foundation of China