Precise identification of cell states altered in disease with healthy single-cell references-Reference-Cited by-同舟云学术

Precise identification of cell states altered in disease with healthy single-cell references

Published:2022-11-10 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Dann Emma^ORCID,Teichmann Sarah A.^ORCID,Marioni John C.^ORCID

Abstract

AbstractSingle cell genomics is a powerful tool to distinguish altered cell states in disease tissue samples, through joint analysis with healthy reference datasets. Collections of data from healthy individuals are being integrated in cell atlases that provide a comprehensive view of cellular phenotypes in a tissue. However, it remains unclear whether atlas datasets are suitable references for disease-state identification, or whether matched control samples should be employed, to minimise false discoveries driven by biological and technical confounders. Here we quantitatively compare the use of atlas and control datasets as references for identification of disease-associated cell states, on simulations and real disease scRNA-seq datasets. We find that reliance on a single type of reference dataset introduces false positives. Conversely, using an atlas dataset as reference for latent space learning followed by differential analysis against a matched control dataset leads to precise identification of disease-associated cell states. We show that, when an atlas dataset is available, it is possible to reduce the number of control samples without increasing the rate of false discoveries. Using a cell atlas of blood cells from 12 studies to contextualise data from a case-control COVID-19 cohort, we sensitively detect cell states associated with infection, and distinguish heterogeneous pathological cell states associated with distinct clinical severities. Our analysis provides guiding principles for design of disease cohort studies and efficient use of cell atlases within the Human Cell Atlas.

Publisher

Cold Spring Harbor Laboratory

Reference47 articles.

1. Single-cell RNA-seq reveals ectopic and aberrant lung-resident cell populations in idiopathic pulmonary fibrosis

2. glmGamPoi: fitting Gamma-Poisson generalized linear models on single cell count data’;Bioinformatics,2021

3. Boyeau, P. et al. (2022) ‘Deep generative modeling for quantifying sample-level heterogeneity in single-cell omics’, bioRxiv. https://doi.org/10.1101/2022.10.04.510898.

4. Quantifying the effect of experimental perturbations at single-cell resolution’;Nature biotechnology,2021

5. Chazarra-Gil, R. et al. (2021) ‘Flexible comparison of batch correction methods for single-cell RNA-seq using BatchBench’, Nucleic Acids Research [Preprint]. https://doi.org/10.1093/nar/gkab004.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multimodal weakly supervised learning to identify disease-specific changes in single-cell atlases;2024-07-29

2. Leveraging neighborhood representations of single-cell data to achieve sensitive DE testing with miloDE;Genome Biology;2024-07-18

3. High order expression dependencies finely resolve cryptic states and subtypes in single cell data;2023-12-18

4. Identification of cell types, states and programs by learning gene set representations;2023-09-12

5. Establishing a human bone marrow single cell reference atlas to study ageing and diseases;Frontiers in Immunology;2023-03-15