Abstract
AbstractSingle-cell genomic technologies provide an unprecedented opportunity to define molecular cell types in a data-driven fashion, but present unique data integration challenges. Many analyses require “mosaic integration”, including both features shared across datasets and features exclusive to a single experiment. Previous computational integration approaches require that the input matrices share the same number of either genes or cells, and thus can use only shared features. To address this limitation, we derive a nonnegative matrix factorization algorithm for integrating single-cell datasets containing both shared and unshared features. The key advance is incorporating an additional metagene matrix that allows unshared features to inform the factorization. We demonstrate that incorporating unshared features significantly improves integration of single-cell RNA-seq, spatial transcriptomic, SNARE-seq, and cross-species datasets. We have incorporated the UINMF algorithm into the open-source LIGER R package (https://github.com/welch-lab/liger).
Funder
U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute
U.S. Department of Health & Human Services | NIH | National Institute of Allergy and Infectious Diseases
U.S. Department of Health & Human Services | NIH | National Institute of Mental Health
Publisher
Springer Science and Business Media LLC
Subject
General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry,Multidisciplinary
Reference43 articles.
1. Method of the Year 2019: Single-cell multimodal omics. Nat. Methods 17, 1 (2020).
2. Chen, S., Lake, B. B. & Zhang, K. High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell. Nat. Biotechnol. 37, 1452–1457 (2019).
3. Liu, J., Huang, Y., Singh, R., Vert, J. P. & Noble, W. S. Jointly embedding multiple single-cell omics measurements. BioRxiv (2019).
4. Ma, S. et al. Chromatin Potential Identified by Shared Single-. Cell Profiling RNA Chromatin. Cell 183, 1103–1116.e20 (2020).
5. Genomics, 10x. Chromium Next GEM Single Cell Multiome ATAC + Gene Expression Reagent Kits User Guide. (2020).
Cited by
45 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献