Affiliation:
1. College of Mathematics and Statistics , Shenzhen University, 518000, Guangdong , China
2. College of Life and Health Sciences , Northeastern University, Shenyang, 110169 , China
3. Institute of Fundamental and Frontier Sciences , University of Electronic Science and Technology of China, Chengdu, 610056 , China
Abstract
Abstract
Motivation
Single-cell RNA sequencing (scRNA-seq) technology attracts extensive attention in the biomedical field. It can be used to measure gene expression and analyze the transcriptome at the single-cell level, enabling the identification of cell types based on unsupervised clustering. Data imputation and dimension reduction are conducted before clustering because scRNA-seq has a high ‘dropout’ rate, noise and linear inseparability. However, independence of dimension reduction, imputation and clustering cannot fully characterize the pattern of the scRNA-seq data, resulting in poor clustering performance. Herein, we propose a novel and accurate algorithm, SSNMDI, that utilizes a joint learning approach to simultaneously perform imputation, dimensionality reduction and cell clustering in a non-negative matrix factorization (NMF) framework. In addition, we integrate the cell annotation as prior information, then transform the joint learning into a semi-supervised NMF model. Through experiments on 14 datasets, we demonstrate that SSNMDI has a faster convergence speed, better dimensionality reduction performance and a more accurate cell clustering performance than previous methods, providing an accurate and robust strategy for analyzing scRNA-seq data. Biological analysis are also conducted to validate the biological significance of our method, including pseudotime analysis, gene ontology and survival analysis. We believe that we are among the first to introduce imputation, partial label information, dimension reduction and clustering to the single-cell field.
Availability and implementation
The source code for SSNMDI is available at https://github.com/yushanqiu/SSNMDI.
Funder
National Natural Science Foundation of China
Guangdong Basic and Applied Basic Research Foundation
Natural Science Foundation of SZU
Special Projects of the Central Government in Guidance of Local Science and Technology Development
Publisher
Oxford University Press (OUP)
Subject
Molecular Biology,Information Systems
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献