Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes
-
Published:2023-08-01
Issue:1
Volume:24
Page:
-
ISSN:1474-760X
-
Container-title:Genome Biology
-
language:en
-
Short-container-title:Genome Biol
Author:
Cobos Francisco Avila, Panah Mohammad Javad Najaf, Epps Jessica, Long Xiaochen, Man Tsz-Kwong, Chiu Hua-Sheng, Chomsky Elad, Kiner Evgeny, Krueger Michael J., di Bernardo Diego, Voloch Luis, Molenaar Jan, van Hooff Sander R., Westermann Frank, Jansky Selina, Redell Michele L., Mestdagh Pieter, Sumazin PavelORCID
Abstract
Abstract
Background
RNA profiling technologies at single-cell resolutions, including single-cell and single-nuclei RNA sequencing (scRNA-seq and snRNA-seq, scnRNA-seq for short), can help characterize the composition of tissues and reveal cells that influence key functions in both healthy and disease tissues. However, the use of these technologies is operationally challenging because of high costs and stringent sample-collection requirements. Computational deconvolution methods that infer the composition of bulk-profiled samples using scnRNA-seq-characterized cell types can broaden scnRNA-seq applications, but their effectiveness remains controversial.
Results
We produced the first systematic evaluation of deconvolution methods on datasets with either known or scnRNA-seq-estimated compositions. Our analyses revealed biases that are common to scnRNA-seq 10X Genomics assays and illustrated the importance of accurate and properly controlled data preprocessing and method selection and optimization. Moreover, our results suggested that concurrent RNA-seq and scnRNA-seq profiles can help improve the accuracy of both scnRNA-seq preprocessing and the deconvolution methods that employ them. Indeed, our proposed method, Single-cell RNA Quantity Informed Deconvolution (SQUID), which combines RNA-seq transformation and dampened weighted least-squares deconvolution approaches, consistently outperformed other methods in predicting the composition of cell mixtures and tissue samples.
Conclusions
We showed that analysis of concurrent RNA-seq and scnRNA-seq profiles with SQUID can produce accurate cell-type abundance estimates and that this accuracy improvement was necessary for identifying outcomes-predictive cancer cell subclones in pediatric acute myeloid leukemia and neuroblastoma datasets. These results suggest that deconvolution accuracy improvements are vital to enabling its applications in the life sciences.
Funder
H2020 European Research Council Division of Cancer Prevention, National Cancer Institute Cancer Prevention and Research Institute of Texas
Publisher
Springer Science and Business Media LLC
Reference54 articles.
1. Chen S, Brunskill EW, Potter SS, Dexheimer PJ, Salomonis N, Aronow BJ, Hong CI, Zhang T, Kopan R. Intrinsic age-dependent changes and cell-cell contacts regulate nephron progenitor lifespan. Dev Cell. 2015;35:49–62. 2. Gregorieff A, Liu Y, Inanlou MR, Khomchuk Y, Wrana JL. Yap-dependent reprogramming of Lgr5+ stem cells drives intestinal regeneration and cancer. Nature. 2015;526:715–8. 3. Patel AP, Tirosh I, Trombetta JJ, Shalek AK, Gillespie SM, Wakimoto H, Cahill DP, Nahed BV, Curry WT, Martuza RL. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science. 2014;344:1396–401. 4. Pfister D, Núñez NG, Pinyol R, Govaere O, Pinter M, Szydlowska M, Gupta R, Qiu M, Deczkowska A, Weiner A. NASH limits anti-tumour surveillance in immunotherapy-treated HCC. Nature. 2021;592:450–6. 5. Jordan NV, Bardia A, Wittner BS, Benes C, Ligorio M, Zheng Y, Yu M, Sundaresan TK, Licausi JA, Desai R. HER2 expression identifies dynamic functional states within circulating breast cancer cells. Nature. 2016;537:102–6.
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|