Author:
Duan Xin,Ding Xinnan,Zhao Zhuanzhe
Abstract
Abstract
Background
Characterizing cancer molecular subtypes is crucial for improving prognosis and individualized treatment. Integrative analysis of multi-omics data has become an important approach for disease subtyping, yielding better understanding of the complex biology. Current multi-omics integration tools and methods for cancer subtyping often suffer challenges of high computational efficiency as well as the problem of weight assignment on data types.
Results
Here, we present an efficient multi-omics integration via weighted affinity and self-diffusion (MOSD) to dissect cancer heterogeneity. MOSD first construct local scaling affinity on each data type and then integrate all affinities by weighted linear combination, followed by the self-diffusion to further improve the patients’ similarities for the downstream clustering analysis. To demonstrate the effectiveness and usefulness for cancer subtyping, we apply MOSD across ten cancer types with three measurements (Gene expression, DNA methylation, miRNA).
Conclusions
Our approach exhibits more significant differences in patient survival and computationally efficient benchmarking against several state-of-art integration methods and the identified molecular subtypes reveal strongly biological interpretability. The code as well as its implementation are available in GitHub: https://github.com/DXCODEE/MOSD.
Funder
National Natural Science Foundation of China
Publisher
Springer Science and Business Media LLC