Abstract
AbstractBulk deconvolution with single-cell/nucleus RNA-seq data is critical for understanding heterogeneity in complex biological samples, yet the technological discrepancy across sequencing platforms limits deconvolution accuracy. To address this, we introduce an experimental design to match inter-platform biological signals, hence revealing the technological discrepancy, and then develop a deconvolution framework called DeMixSC using the better-matched, i.e., benchmark, data. Built upon a novel weighted nonnegative least-squares framework, DeMixSC identifies and adjusts genes with high technological discrepancy and aligns the benchmark data with large patient cohorts of matched-tissue-type for large-scale deconvolution. Our results using a benchmark dataset of healthy retinas suggest much-improved deconvolution accuracy. Further analysis of a cohort of 453 patients with age-related macular degeneration supports the broad applicability of DeMixSC. Our findings reveal the impact of technological discrepancy on deconvolution performance and underscore the importance of a well-matched dataset to resolve this challenge. The developed DeMixSC framework is generally applicable for deconvolving large cohorts of disease tissues, and potentially cancer.
Publisher
Cold Spring Harbor Laboratory
Reference47 articles.
1. A roadmap for the Human Developmental Cell Atlas
2. Understanding tumour endothelial cell heterogeneity and function from single-cell omics;Nature Reviews Cancer,2023
3. Applying high-dimensional single-cell technologies to the analysis of cancer immunotherapy;Nature Reviews Clinical Oncology,2020
4. From bulk, single-cell to spatial RNA sequencing;Investigative Ophthalmology & Visual Science,2021
5. RNA sequencing: the teenage years;Nature Reviews Genetics,2019