Abstract
SummaryOur understanding of cell types has advanced considerably with the publication of single cell atlases. Marker genes play an essential role for experimental validation and computational analyses such as physiological characterization through pathway enrichment, annotation, and deconvolution. However, a framework for quantifying marker replicability and picking replicable markers is currently lacking. Here, using high quality data from the Brain Initiative Cell Census Network (BICCN), we systematically investigate marker replicability for 85 neuronal cell types. We show that, due to dataset-specific noise, we need to combine 5 datasets to obtain robust differentially expressed (DE) genes, particularly for rare populations and lowly expressed genes. We estimate that 10 to 200 meta-analytic markers provide optimal performance in downstream computational tasks. Replicable marker lists condense single cell atlases into interpretable and generalizable information about cell types, opening avenues for downstream applications, including cell type annotation, selection of gene panels and bulk data deconvolution.
Publisher
Cold Spring Harbor Laboratory
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献