Author:
Zhang Xiang,Chen Zhuo,Bhadani Rahul,Cao Siyang,Lu Meng,Lytal Nicholas,Chen Yin,An Lingling
Abstract
Single-cell RNA sequencing (scRNA-seq) reveals the transcriptome diversity in heterogeneous cell populations as it allows researchers to study gene expression at single-cell resolution. The latest advances in scRNA-seq technology have made it possible to profile tens of thousands of individual cells simultaneously. However, the technology also increases the number of missing values, i. e, dropouts, from technical constraints, such as amplification failure during the reverse transcription step. The resulting sparsity of scRNA-seq count data can be very high, with greater than 90% of data entries being zeros, which becomes an obstacle for clustering cell types. Current imputation methods are not robust in the case of high sparsity. In this study, we develop a Neural Network-based Imputation for scRNA-seq count data, NISC. It uses autoencoder, coupled with a weighted loss function and regularization, to correct the dropouts in scRNA-seq count data. A systematic evaluation shows that NISC is an effective imputation approach for handling sparse scRNA-seq count data, and its performance surpasses existing imputation methods in cell type identification.
Subject
Genetics (clinical),Genetics,Molecular Medicine
Reference57 articles.
1. M3Drop: Dropout-Based Feature Selection for scRNASeq;Andrews;Bioinformatics,2019
2. Single Cells Make Big Data: New Challenges and Opportunities in Transcriptomics;Angerer;Curr. Opin. Syst. Biol.,2017
3. Conceptual and empirical comparison of dimensionality reduction algorithms (pca, kpca, lda, mds, svd, lle, isomap, le, ica, t-sne);Anowar;Comp. Sci. Rev.,2021
4. DeepImpute: an Accurate, Fast, and Scalable Deep Neural Network Method to Impute Single-Cell RNA-Seq Data;Arisdakessian;Genome Biol.,2019
5. Imputation of Single-Cell Gene Expression with an Autoencoder Neural Network;Badsha;Quant Biol.,2020
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献