The construction of transcriptional risk scores for breast cancer based on lightGBM and multiple omics data
-
Published:2022
Issue:12
Volume:19
Page:12353-12370
-
ISSN:1551-0018
-
Container-title:Mathematical Biosciences and Engineering
-
language:
-
Short-container-title:MBE
Author:
Pan Jianqiao12, Ma Baoshan2, Hou Xiaoyu2, Li Chongyang2, Xiong Tong2, Gong Yi2, Song Fengju1
Affiliation:
1. Department of Epidemiology and Biostatistics, Key Laboratory of Molecular Cancer Epidemiology, Tianjin, National Clinical Research Center of Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin 300060, China 2. School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
Abstract
<abstract>
<sec><title>Background</title><p>Polygenic risk score (PRS) can evaluate the individual-level genetic risk of breast cancer. However, standalone single nucleotide polymorphisms (SNP) data used for PRS may not provide satisfactory prediction accuracy. Additionally, current PRS models based on linear regression have insufficient power to leverage non-linear effects from thousands of associated SNPs. Here, we proposed a transcriptional risk score (TRS) based on multiple omics data to estimate the risk of breast cancer.</p>
</sec>
<sec><title>Methods</title><p>The multiple omics data and clinical data of breast invasive carcinoma (BRCA) were collected from the cancer genome atlas (TCGA) and the gene expression omnibus (GEO). First, we developed a novel TRS model for BRCA utilizing single omic data and LightGBM algorithm. Subsequently, we built a combination model of TRS derived from each omic data to further improve the prediction accuracy. Finally, we performed association analysis and prognosis prediction to evaluate the utility of the TRS generated by our method.</p>
</sec>
<sec><title>Results</title><p>The proposed TRS model achieved better predictive performance than the linear models and other ML methods in single omic dataset. An independent validation dataset also verified the effectiveness of our model. Moreover, the combination of the TRS can efficiently strengthen prediction accuracy. The analysis of prevalence and the associations of the TRS with phenotypes including case-control and cancer stage indicated that the risk of breast cancer increases with the increases of TRS. The survival analysis also suggested that TRS for the cancer stage is an effective prognostic metric of breast cancer patients.</p>
</sec>
<sec><title>Conclusions</title><p>Our proposed TRS model expanded the current definition of PRS from standalone SNP data to multiple omics data and outperformed the linear models, which may provide a powerful tool for diagnostic and prognostic prediction of breast cancer.</p>
</sec>
</abstract>
Publisher
American Institute of Mathematical Sciences (AIMS)
Subject
Applied Mathematics,Computational Mathematics,General Agricultural and Biological Sciences,Modeling and Simulation,General Medicine
Reference48 articles.
1. K. L. Britt, J. Cuzick, K. Phillips, Key steps for effective breast cancer prevention, Nat. Rev. Cancer, 20 (2020), 417–436. https://doi.org/10.1038/s41568-020-0266-x 2. C. Wild, E. Weiderpass, B. Stewart, World cancer report: cancer research for cancer prevention, Lyon: Int. Agency Res. Cancer, 1 (2020), 23–33. https://www.paho.org/en/node/69005 3. D. Thompson, D. Easton, The genetic epidemiology of breast cancer genes, J. Mammary Gland Biol. Neoplasia, 9 (2004), 221–236. https://doi.org/10.1023/B:JOMG.0000048770.90334.3b 4. L. Wu, W. Shi, J. Long, X. Guo, K. Michailidou, J. Beesley, et al., A transcriptome-wide association study of 229,000 women identifies new candidate susceptibility genes for breast cancer, Nat. Genet., 50 (2018), 968–978. https://doi.org/10.1038/s41588-018-0132-x 5. P. Maas, M. Barrdahl, A. D. Joshi, P. L. Auer, M. M. Gaudet, R. L. Milne, et al., Breast cancer risk from modifiable and nonmodifiable risk factors among white women in the United States, JAMA Oncol., 2 (2016), 1295–1302. https://doi.org/10.1001/jamaoncol.2016.1025
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|