Author:
Yao Xiaohui,Jiang Xiaohan,Luo Haoran,Liang Hong,Ye Xiufen,Wei Yanhui,Cong Shan
Abstract
Abstract
Background
Integrating multi-omics data is emerging as a critical approach in enhancing our understanding of complex diseases. Innovative computational methods capable of managing high-dimensional and heterogeneous datasets are required to unlock the full potential of such rich and diverse data.
Methods
We propose a Multi-Omics integration framework with auxiliary Classifiers-enhanced AuToencoders (MOCAT) to utilize intra- and inter-omics information comprehensively. Additionally, attention mechanisms with confidence learning are incorporated for enhanced feature representation and trustworthy prediction.
Results
Extensive experiments were conducted on four benchmark datasets to evaluate the effectiveness of our proposed model, including BRCA, ROSMAP, LGG, and KIPAN. Our model significantly improved most evaluation measurements and consistently surpassed the state-of-the-art methods. Ablation studies showed that the auxiliary classifiers significantly boosted classification accuracy in the ROSMAP and LGG datasets. Moreover, the attention mechanisms and confidence evaluation block contributed to improvements in the predictive accuracy and generalizability of our model.
Conclusions
The proposed framework exhibits superior performance in disease classification and biomarker discovery, establishing itself as a robust and versatile tool for analyzing multi-layer biological data. This study highlights the significance of elaborated designed deep learning methodologies in dissecting complex disease phenotypes and improving the accuracy of disease predictions.
Funder
National Natural Science Foundation of China
Shandong Provincial Natural Science Foundation
Natural Science Foundation of Heilongjiang Province
Fundamental Research Funds for the Central Universities
National Key Research and Development Program of China
Publisher
Springer Science and Business Media LLC
Reference48 articles.
1. Subramanian I, Verma S, Kumar S, Jere A, Anamika K. Multi-omics data integration, interpretation, and its application. Bioinforma Biol Insights. 2020;14:1177932219899051.
2. Kreitmaier P, Katsoula G, Zeggini E. Insights from multi-omics integration in complex disease primary tissues. Trends Genet. 2023;39(1):46–58. https://www.sciencedirect.com/science/article/pii/S0168952522002256.
3. Yan H, Bi L, Wang Y, Zhang X, Hou Z, Wang Q, et al. Integrative analysis of multi-omics data reveals distinct impacts of DDB1-CUL4 associated factors in human lung adenocarcinomas. Sci Rep. 2017;7(1):333.
4. Argelaguet R. Statistical methods for the integrative analysis of single-cell multi-omics data. 2021. https://www.repository.cam.ac.uk/handle/1810/315822. Accessed 16 July 2023.
5. Colomé-Tatché M, Theis FJ. Statistical single cell multi-omics integration. Curr Opin Syst Biol. 2018;7:54–59. https://www.sciencedirect.com/science/article/pii/S2452310018300039.Accessed 25 Aug 2023.