MODEC: an unsupervised clustering method integrating omics data for identifying cancer subtypes-Reference-Cited by-同舟云学术

MODEC: an unsupervised clustering method integrating omics data for identifying cancer subtypes

Published:2022-09-12 Issue:6 Volume:23 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

Zhang Yanting¹,Kiryu Hisanori¹

Affiliation:

1. Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo , 7-3-1 Hongo, Bunkyo-ku, 113-0033, Tokyo, Japan

Abstract

Abstract The identification of cancer subtypes can help researchers understand hidden genomic mechanisms, enhance diagnostic accuracy and improve clinical treatments. With the development of high-throughput techniques, researchers can access large amounts of data from multiple sources. Because of the high dimensionality and complexity of multiomics and clinical data, research into the integration of multiomics data is needed, and developing effective tools for such purposes remains a challenge for researchers. In this work, we proposed an entirely unsupervised clustering method without harnessing any prior knowledge (MODEC). We used manifold optimization and deep-learning techniques to integrate multiomics data for the identification of cancer subtypes and the analysis of significant clinical variables. Since there is nonlinearity in the gene-level datasets, we used manifold optimization methodology to extract essential information from the original omics data to obtain a low-dimensional latent subspace. Then, MODEC uses a deep learning-based clustering module to iteratively define cluster centroids and assign cluster labels to each sample by minimizing the Kullback–Leibler divergence loss. MODEC was applied to six public cancer datasets from The Cancer Genome Atlas database and outperformed eight competing methods in terms of the accuracy and reliability of the subtyping results. MODEC was extremely competitive in the identification of survival patterns and significant clinical features, which could help doctors monitor disease progression and provide more suitable treatment strategies.

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

https://academic.oup.com/bib/article-pdf/23/6/bbac372/47143728/bbac372.pdf

Reference44 articles.

1. Projection-like retractions on matrix manifolds;Absil;SIAM Journal on Optimization,2012

2. Systematic pan-cancer analysis of tumour purity;Aran;Nat Commun,2015

3. Adaptive Control Processes

5. Manifold optimization for k-means clustering

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TACCO: Task-guided Co-clustering of Clinical Concepts and Patient Visits for Disease Subtyping based on EHR Data;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

2. Artificial intelligence (AI) and machine learning (ML) in precision oncology: a review on enhancing discoverability through multiomics integration;The British Journal of Radiology;2023-10

3. Proteomic Profile Distinguishes New Subpopulations of Breast Cancer Patients with Different Survival Outcomes;Cancers;2023-08-24

4. Deep Learning Techniques with Genomic Data in Cancer Prognosis: A Comprehensive Review of the 2021–2023 Literature;Biology;2023-06-21

5. Subtype-DCC: decoupled contrastive clustering method for cancer subtype identification based on multi-omics data;Briefings in Bioinformatics;2023-01-25