Author:
Yao Xiaohui,Jiang Xiaohan,Luo Haoran,Liang Hong,Ye Xiufen,Wei Yanhui,Cong Shan
Abstract
AbstractBackgroundIntegrating multi-omics data is emerging as a critical approach in enhancing our understanding of complex diseases. Innovative computational methods capable of managing high-dimensional and heterogeneous datasets are required to unlock the full potential of such rich and diverse data.MethodsWe propose a Multi-Omics integration framework with auxiliary Classifiers-enhanced AuToencoders (MOCAT), for comprehensive utilization of both intra- and inter-omics information. Additionally, attention mechanisms with confidence learning are incorporated for enhanced feature representation and trustworthy prediction.ResultsExtensive experiments were conducted on four benchmark datasets to evaluate the effectiveness of our proposed model, including BRCA, ROSMAP, LGG, and KIPAN. Our model significantly improved most evaluation measurements and consistently surpassed the state-of-the-art methods. Ablation studies showed that the auxiliary classifiers significantly boosted classification accuracy in both the ROSMAP and LGG datasets. Moreover, the attention mechanisms and confidence evaluation block contributed to improvements in the predictive accuracy and generalizability of our model.ConclusionsThe proposed framework exhibits superior performance in disease classification and biomarker discovery, establishing itself as a robust and versatile tool for analyzing multi-layer biological data. This study highlights the significance of elaborated designed deep learning methodologies in dissecting complex disease phenotypes and improving the accuracy of disease predictions.
Publisher
Cold Spring Harbor Laboratory
Reference48 articles.
1. Multi-omics data integration, interpretation, and its application;Bioinformatics and biology insights,2020
2. Insights from multi-omics integration in complex disease primary tissues
3. Integrative analysis of multi-omics data reveals distinct impacts of ddb1-cul4 associated factors in human lung adenocarcinomas;Scientific reports,2017
4. Argelaguet, R. : Statistical methods for the integrative analysis of single-cell multiomics data (2021)
5. Statistical single cell multi-omics integration;Current Opinion in Systems Biology,2018