CATD: A reproducible pipeline for selecting cell-type deconvolution methods across tissues-Reference-Cited by-同舟云学术

CATD: A reproducible pipeline for selecting cell-type deconvolution methods across tissues

Published:2023-01-20 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Vathrakokoili Pournara Anna^ORCID,Miao Zhichao^ORCID,Beker Ozgur Yilmaz^ORCID,Nolte Nadja^ORCID,Brazma Alvis^ORCID,Papatheodorou Irene^ORCID

Abstract

ABSTRACTCell-type deconvolution methods aim to infer cell composition from bulk transcriptomic data. The proliferation of developed methods, coupled with the inconsistent results obtained in many cases, highlights the pressing need for guidance in the selection of appropriate methods. The growing accessibility of systematic single-cell RNA sequencing datasets, often accompanied by bulk expression from related samples, makes it possible to benchmark the existing methods more objectively. Here, we propose a comprehensive assessment of 29 deconvolution methods, leveraging single-cell RNA-sequencing data from different tissues. We evaluate deconvolution across a wide range of simulation scenarios and we show that single-cell regression-based deconvolution methods perform well while their performance is highly dependent on the reference.We also study the impact of bulk-reference differences, including those associated with sample, study, and technology. We provide validation using a gold standard dataset from mononuclear cells and suggest a consensus prediction of proportions when ground truth is not available. We f validated the consensus method on data from the stomach and studied its spillover effect. Lastly, we suggest that the Critical Assessment of Transcriptomic Deconvolution (CATD) pipeline can be employed for simultaneous deconvolution of hundreds of bulk samples and we envision it to be used for speeding up the evaluation of newly developed methods.Key Points

– Thorough assessment of 29 deconvolution methods, leveraging diverse single-cell RNA-sequencing data from various tissues, alongside extensive simulations and validation against known ground truth data.

– Emphasis on the pivotal role of reference selection, tissue type, and technological nuances in determining the efficacy of deconvolution methods.

– Introduction of the user-friendly and robust Critical Assessment of Transcriptomic Deconvolution (CATD) Snakemake pipeline, enabling efficient and reproducible cell-type deconvolution in real bulk RNA-Seq datasets.

Publisher

Cold Spring Harbor Laboratory

Reference72 articles.

1. Accessories to the Crime: Functions of Cells Recruited to the Tumor Microenvironment

2. Implications of the tumor immune microenvironment for staging and therapeutics

3. Jorge, N. A. N. , et al. Poor clinical outcome in metastatic melanoma is associated with a microRNA-modulated immunosuppressive tumor microenvironment. J. Transl. Med. 18, 56 (2020).

4. High-Infiltration of Tumor-Associated Macrophages Predicts Unfavorable Clinical Outcome for Node-Negative Breast Cancer

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Benchmarking second-generation methods for cell-type deconvolution of transcriptomic data;2024-06-11

2. Expression Atlas update: insights from sequencing data at both bulk and single cell level;Nucleic Acids Research;2023-11-22