Author:
Ellen Jacob G.,Jacob Etai,Nikolaou Nikos,Markuzon Natasha
Abstract
AbstractThe ability to accurately predict non-small cell lung cancer (NSCLC) patient survival is crucial for informing physician decision-making, and the increasing availability of multi-omics data offers the promise of enhancing prognosis predictions. We present a multimodal integration approach that leverages microRNA, mRNA, DNA methylation, long non-coding RNA (lncRNA) and clinical data to predict NSCLC survival and identify patient subtypes, utilizing denoising autoencoders for data compression and integration. Survival performance for patients with lung adenocarcinoma (LUAD) and squamous cell carcinoma (LUSC) was compared across modality combinations and data integration methods. Using The Cancer Genome Atlas data, our results demonstrate that survival prediction models combining multiple modalities outperform single modality models. The highest performance was achieved with a combination of only two modalities, lncRNA and clinical, at concordance indices (C-indices) of 0.69 ± 0.03 for LUAD and 0.62 ± 0.03 for LUSC. Models utilizing all five modalities achieved mean C-indices of 0.67 ± 0.04 and 0.63 ± 0.02 for LUAD and LUSC, respectively, while the best individual modality performance reached C-indices of 0.64 ± 0.03 for LUAD and 0.59 ± 0.03 for LUSC. Analysis of biological differences revealed two distinct survival subtypes with over 900 differentially expressed transcripts.
Publisher
Springer Science and Business Media LLC
Reference54 articles.
1. Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71, 209–249 (2021).
2. Lee, B. et al. DeepBTS: Prediction of recurrence-free survival of non-small cell lung cancer using a time-binned deep neural network. Sci. Rep. 10, 1952 (2020).
3. Wang, J. et al. SurvNet: A novel deep neural network for lung cancer survival analysis with missing values. Front. Oncol. https://doi.org/10.3389/fonc.2020.588990 (2021).
4. Sun, Z., Wigle, D. A. & Yang, P. Non-overlapping and non-cell-type-specific gene expression signatures predict lung cancer survival. J. Clin. Oncol. Off. J. Am. Soc. Clin. Oncol. 26, 877–883 (2008).
5. Zou, X., Hu, Z., Huang, C. & Chang, J. A seven-gene signature with close immune correlation was identified for survival prediction of lung adenocarcinoma. Med. Sci. Monit. Int. Med. J. Exp. Clin. Res. 26, e924269-1-e924269-18 (2020).