SIMPLEs: a single-cell RNA sequencing imputation strategy preserving gene modules and cell clusters variation-Reference-Cited by-同舟云学术

SIMPLEs: a single-cell RNA sequencing imputation strategy preserving gene modules and cell clusters variation

Published:2020-01-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Hu Zhirui,Zu Songpeng,Liu Jun S.

Abstract

AbstractA main challenge in analyzing single-cell RNA sequencing (scRNASeq) data is to reduce technical variations yet retain cell heterogeneity. Due to low mRNAs content per cell and molecule losses during the experiment (called “dropout”), the gene expression matrix has substantial zero read counts. Existing imputation methods either treat each cell or each gene identically and independently, which oversimplifies the gene correlation and cell type structure. We propose a statistical model-based approach, called SIMPLEs, which iteratively identifies correlated gene modules and cell clusters and imputes dropouts customized for individual gene module and cell type. Simultaneously, it quantifies the uncertainty of imputation and cell clustering. Optionally, SIMPLEs can integrate bulk RNASeq data for estimating dropout rates. In simulations, SIMPLEs performed significantly better than prevailing scRNASeq imputation methods by various metrics. By applying SIMPLEs to several real data sets, we discovered gene modules that can further classify subtypes of cells. Our imputations successfully recovered the expression trends of marker genes in stem cell differentiation and can discover putative pathways regulating biological processes.

Publisher

Cold Spring Harbor Laboratory

Reference31 articles.

1. Single-cell RNA sequencing to explore immune cell heterogeneity;Nat. Rev. Immunol,2017

2. Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors

3. In situ click chemistry generation of cyclooxygenase-2 inhibitors

4. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq

5. Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Review of Integrative Imputation for Multi-Omics Datasets;Frontiers in Genetics;2020-10-15

2. A review of computational strategies for denoising and imputation of single-cell transcriptomic data;Briefings in Bioinformatics;2020-10-01