Author:
Jiang Hanliang,Xu Shan,Chen Chunhua
Abstract
Abstract
Background
Lung adenocarcinoma (LUAD) is a major cause of cancer death. Therefore, identifying potential prognostic risk factors is critical to improve the survival of patients with LUAD.
Methods
Here, relevant datasets were downloaded from TCGA and GEO databases to screen the differentially expressed genes (DEGs). Univariate Cox analysis, LASSO regression analysis and multivariate Cox analysis were conducted on the DEGs combined with TCGA clinical data, and finally a risk assessment model based on 10 feature genes was constructed.
Results
The prognosis of patients was evaluated after the patients were grouped based on the median risk score and the results showed that the survival time of patients in the high-risk group was significantly shorter than that in the low-risk group. ROC analysis showed that the AUC values of the 1, 3, 5-year survival were 0.753, 0.724, and 0.73, respectively, indicating that the model was precise in predicting the prognosis, which was also verified in the external dataset GSE72094. In addition, a significant correlation was found between the risk score and the clinical stages of LUAD, that is, a later stage always corresponded to a higher risk score. Then, we performed survival analysis on the 10 feature genes independently in the TCGA-LUAD dataset through the GEPIA database, finding that the high expression of 6 genes (COL5A2, PLEK2, BAIAP2L2, S100P, ZIC2, SFXN1) was associated with the poor prognosis of LUAD patients.
Conclusion
To sum, this study established a 10-gene risk assessment model and further evaluated its value in predicting LUAD prognosis, which provided a new method for the prognosis prediction of LUAD.
Funder
Education of Zhejiang Province
Natural Science Foundation of Zhejiang Province
Publisher
Springer Science and Business Media LLC
Subject
Cancer Research,Genetics,Oncology