Differences in molecular sampling and data processing explain variation among single-cell and single-nucleus RNA-seq experiments-Reference-Cited by-同舟云学术

Differences in molecular sampling and data processing explain variation among single-cell and single-nucleus RNA-seq experiments

Published:2024-02 Issue:2 Volume:34 Page:179-188
ISSN:1088-9051
Container-title:Genome Research
language:en
Short-container-title:Genome Res.

Author:

Chamberlin John T.^ORCID,Lee Younghee^ORCID,Marth Gabor T.,Quinlan Aaron R.^ORCID

Abstract

A mechanistic understanding of the biological and technical factors that impact transcript measurements is essential to designing and analyzing single-cell and single-nucleus RNA sequencing experiments. Nuclei contain the same pre-mRNA population as cells, but they contain a small subset of the mRNAs. Nonetheless, early studies argued that single-nucleus analysis yielded results comparable to cellular samples if pre-mRNA measurements were included. However, typical workflows do not distinguish between pre-mRNA and mRNA when estimating gene expression, and variation in their relative abundances across cell types has received limited attention. These gaps are especially important given that incorporating pre-mRNA has become commonplace for both assays, despite known gene length bias in pre-mRNA capture. Here, we reanalyze public data sets from mouse and human to describe the mechanisms and contrasting effects of mRNA and pre-mRNA sampling on gene expression and marker gene selection in single-cell and single-nucleus RNA-seq. We show that pre-mRNA levels vary considerably among cell types, which mediates the degree of gene length bias and limits the generalizability of a recently published normalization method intended to correct for this bias. As an alternative, we repurpose an existing post hoc gene length–based correction method from conventional RNA-seq gene set enrichment analysis. Finally, we show that inclusion of pre-mRNA in bioinformatic processing can impart a larger effect than assay choice itself, which is pivotal to the effective reuse of existing data. These analyses advance our understanding of the sources of variation in single-cell and single-nucleus RNA-seq experiments and provide useful guidance for future studies.

Funder

National Institutes of Health National Library of Medicine

Publisher

Cold Spring Harbor Laboratory

Reference49 articles.

1. Enhancing droplet-based single-nucleus RNA-seq resolution using the semi-supervised machine learning classifier DIEM

2. Single-nucleus and single-cell transcriptomes compared in matched cortical cell types

3. Integrating single-cell transcriptomic data across different conditions, technologies, and species

4. A human cell atlas of fetal gene expression

5. A single-nuclei RNA sequencing study of Mendelian and sporadic AD in the human brain

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Forseti: a mechanistic and predictive model of the splicing status of scRNA-seq reads;Bioinformatics;2024-06-28