Evaluation and comparison of multi-omics data integration methods for cancer subtyping-Reference-Cited by-同舟云学术

Evaluation and comparison of multi-omics data integration methods for cancer subtyping

Published:2021-08-12 Issue:8 Volume:17 Page:e1009224
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

Duan Ran^ORCID,Gao Lin^ORCID,Gao Yong,Hu Yuxuan,Xu Han^ORCID,Huang Mingfeng^ORCID,Song Kuo,Wang Hongda,Dong Yongqiang,Jiang Chaoqun,Zhang Chenxing,Jia Songwei

Abstract

Computational integrative analysis has become a significant approach in the data-driven exploration of biological problems. Many integration methods for cancer subtyping have been proposed, but evaluating these methods has become a complicated problem due to the lack of gold standards. Moreover, questions of practical importance remain to be addressed regarding the impact of selecting appropriate data types and combinations on the performance of integrative studies. Here, we constructed three classes of benchmarking datasets of nine cancers in TCGA by considering all the eleven combinations of four multi-omics data types. Using these datasets, we conducted a comprehensive evaluation of ten representative integration methods for cancer subtyping in terms of accuracy measured by combining both clustering accuracy and clinical significance, robustness, and computational efficiency. We subsequently investigated the influence of different omics data on cancer subtyping and the effectiveness of their combinations. Refuting the widely held intuition that incorporating more types of omics data always produces better results, our analyses showed that there are situations where integrating more omics data negatively impacts the performance of integration methods. Our analyses also suggested several effective combinations for most cancers under our studies, which may be of particular interest to researchers in omics data analysis.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Natural Sciences and Engineering Research Council of Canada Discovery Grant

Fundamental Research Funds for the Central Universities

innovation fund of xidian university

Publisher

Public Library of Science (PLoS)

Subject

Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modelling and Simulation,Ecology, Evolution, Behavior and Systematics

Reference113 articles.

1. Multi-omics Data Integration, Interpretation, and Its Application.;I Subramanian;Bioinform Biol Insights.,2020

2. Pattern fusion analysis by adaptive alignment of multiple heterogeneous omics data;Q Shi;Bioinformatics,2017

3. Methods for the integration of multi-omics data: mathematical aspects;M Bersanelli;BMC Bioinformatics,2016

4. Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery;NK Speicher;Bioinformatics,2015

5. Subtyping: What It is and Its Role in Precision Medicine;S Saria;IEEE Intelligent Systems,2015

Cited by 64 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Generative Models Utilizing Padding Can Efficiently Integrate and Generate Multi-Omics Data;AI;2024-09-05

2. Advance computational tools for multiomics data learning;Biotechnology Advances;2024-09

3. Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data;Journal of the American Statistical Association;2024-08-14

4. COPS: A novel platform for multi-omic disease subtype discovery via robust multi-objective evaluation of clustering algorithms;PLOS Computational Biology;2024-08-05

5. A review of cancer data fusion methods based on deep learning;Information Fusion;2024-08