Author:
Hira Muta Tah,Razzaque M. A.,Angione Claudio,Scrivens James,Sawan Saladin,Sarker Mosharraf
Abstract
AbstractCancer is a complex disease that deregulates cellular functions at various molecular levels (e.g., DNA, RNA, and proteins). Integrated multi-omics analysis of data from these levels is necessary to understand the aberrant cellular functions accountable for cancer and its development. In recent years, Deep Learning (DL) approaches have become a useful tool in integrated multi-omics analysis of cancer data. However, high dimensional multi-omics data are generally imbalanced with too many molecular features and relatively few patient samples. This imbalance makes a DL based integrated multi-omics analysis difficult. DL-based dimensionality reduction technique, including variational autoencoder (VAE), is a potential solution to balance high dimensional multi-omics data. However, there are few VAE-based integrated multi-omics analyses, and they are limited to pancancer. In this work, we did an integrated multi-omics analysis of ovarian cancer using the compressed features learned through VAE and an improved version of VAE, namely Maximum Mean Discrepancy VAE (MMD-VAE). First, we designed and developed a DL architecture for VAE and MMD-VAE. Then we used the architecture for mono-omics, integrated di-omics and tri-omics data analysis of ovarian cancer through cancer samples identification, molecular subtypes clustering and classification, and survival analysis. The results show that MMD-VAE and VAE-based compressed features can respectively classify the transcriptional subtypes of the TCGA datasets with an accuracy in the range of 93.2-95.5% and 87.1-95.7%. Also, survival analysis results show that VAE and MMD-VAE based compressed representation of omics data can be used in cancer prognosis. Based on the results, we can conclude that (i) VAE and MMD-VAE outperform existing dimensionality reduction techniques, (ii) integrated multi-omics analyses perform better or similar compared to their mono-omics counterparts, and (iii) MMD-VAE performs better than VAE in most omics dataset.
Publisher
Springer Science and Business Media LLC
Reference69 articles.
1. UK. Cancer Research, Ovarian cancer statistics. https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/ovarian-cancer#heading-One.
2. Torre, L. A. et al. Ovarian cancer statistics, 2018. CA 68, 284–296 (2018).
3. Doubeni, C. A., Doubeni, A. R. & Myers, A. E. Diagnosis and management of ovarian cancer. Am. Fam. Physician 93, 937–944 (2016).
4. Rosenthal, A. N., Menon, U. & Jacobs, I. J. Screening for ovarian cancer. Clin. Ostet. Gynecol. 49, 433–447 (2006).
5. Lu, M. & Zhan, X. The crucial role of multiomic approach in cancer research and clinically relevant outcomes. EPMA J. 9, 77–102 (2018).
Cited by
55 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献