Leveraging gene correlations in single cell transcriptomic data-Reference-Cited by-同舟云学术

Leveraging gene correlations in single cell transcriptomic data

Published:2023-03-15 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Silkwood Kai,Dollinger Emmanuel,Gervin Josh,Atwood Scott,Nie Qing,Lander Arthur D.^ORCID

Abstract

AbstractBACKGROUNDMany approaches have been developed to overcome technical noise in single cell RNA-sequencing (scRNAseq). As researchers dig deeper into data—looking for rare cell types, subtleties of cell states, and details of gene regulatory networks—there is a growing need for algorithms with controllable accuracy and fewerad hocparameters and thresholds. Impeding this goal is the fact that an appropriate null distribution for scRNAseq cannot simply be extracted from data when ground truth about biological variation is unknown (i.e., usually).RESULTSWe approach this problem analytically, assuming that scRNAseq data reflect only cell heterogeneity (what we seek to characterize), transcriptional noise (temporal fluctuations randomly distributed across cells), and sampling error (i.e., Poisson noise). We analyze scRNAseq data without normalization—a step that skews distributions, particularly for sparse data—and calculatep-values associated with key statistics. We develop an improved method for selecting features for cell clustering and identifying gene-gene correlations, both positive and negative. Using simulated data, we show that this method, which we call BigSur (Basic Informatics andGeneStatistics fromUnnormalizedReads), captures even weak yet significant correlation structures in scRNAseq data. Applying BigSur to data from a clonal human melanoma cell line, we identify thousands of correlations that, when clustered without supervision into gene communities, align with known cellular components and biological processes, and highlight potentially novel cell biological relationships.CONCLUSIONSNew insights into functionally relevant gene regulatory networks can be obtained using a statistically grounded approach to the identification of gene-gene correlations.

Publisher

Cold Spring Harbor Laboratory

Reference80 articles.

1. Tritschler S , Buttner M , Fischer DS , Lange M , Bergen V , Lickert H , Theis FJ : Concepts and limitations for learning developmental trajectories from single cell genomics. Development 2019, 146.

2. Tam PPL , Ho JWK : Cellular diversity and lineage trajectory: insights from mouse single cell transcriptomes. Development 2020, 147.

3. Nguyen H , Tran D , Tran B , Pehlivan B , Nguyen T : A comprehensive survey of regulatory network inference methods using single cell RNA sequencing data. Brief Bioinform 2021, 22.

4. : Automatic cell type identification methods for single-cell RNA sequencing;Comput Struct Biotechnol J,2021

5. Junttila S , Smolander J , Elo LL : Benchmarking methods for detecting differential states between conditions from multi-subject single-cell RNA-seq data. Brief Bioinform 2022, 23.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SciGeneX: Enhancing transcriptional analysis through gene module detection in single-cell and spatial transcriptomics data;2024-03-20

2. Uncovering Minimal Pathways in Melanoma Initiation;2023-12-10