Author:
Arisdakessian Cedric,Poirion Olivier,Yunits Breck,Zhu Xun,Garmire Lana X.
Abstract
BackgroundSingle-cell RNA sequencing (scRNA-seq) offers new opportunities to study gene expression of tens of thousands of single cells simultaneously. However, a significant problem of current scRNA-seq data is the large fractions of missing values or “dropouts” in gene counts. Incorrect handling of dropouts may affect downstream bioinformatics analysis. As the number of scRNA-seq datasets grows drastically, it is crucial to have accurate and efficient imputation methods to handle these dropouts.MethodsWe present DeepImpute, a deep neural network based imputation algorithm. The architecture of DeepImpute efficiently uses dropout layers and loss functions to learn patterns in the data, allowing for accurate imputation.ResultsOverall DeepImpute yields better accuracy than other publicly available scRNA-Seq imputation methods on experimental data, as measured by mean squared error or Pearson’s correlation coefficient. Moreover, its efficient implementation provides significantly higher performance over the other methods as dataset size increases. Additionally, as a machine learning method, DeepImpute allows to use a subset of data to train the model and save even more computing time, without much sacrifice on the prediction accuracy.ConclusionsDeepImpute is an accurate, fast and scalable imputation tool that is suited to handle the ever increasing volume of scRNA-seq data. The package is freely available at https://github.com/lanagarmire/DeepImpute
Publisher
Cold Spring Harbor Laboratory
Reference43 articles.
1. Abadi,M. et al. (2016) TensorFlow: A System for Large-Scale Machine Learning. In, OSDI., pp.265–283.
2. Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer;Metabolomics Data. J. Proteome Res.,2018
3. Andrews,T.S. and Hemberg,M. (2016) Modelling dropouts allows for unbiased identification of marker genes in scRNASeq experiments. bioRxiv, 065094.
4. MISSING DATA IMPUTATION IN THE ELECTRONIC HEALTH RECORD USING DEEPLY LEARNED AUTOENCODERS;Pac. Symp. Biocomput,2017
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献