Linking cells across single-cell modalities by synergistic matching of neighborhood structure
Author:
Hristov Borislav H1,
Bilmes Jeffrey A23,
Noble William Stafford13
Affiliation:
1. Department of Genome Sciences
2. Department of Electrical Engineering
3. Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, USA
Abstract
Abstract
Motivation
A wide variety of experimental methods are available to characterize different properties of single cells in a complex biosample. However, because these measurement techniques are typically destructive, researchers are often presented with complementary measurements from disjoint subsets of cells, providing a fragmented view of the cell’s biological processes. This creates a need for computational tools capable of integrating disjoint multi-omics data. Because different measurements typically do not share any features, the problem requires the integration to be done in unsupervised fashion. Recently, several methods have been proposed that project the cell measurements into a common latent space and attempt to align the corresponding low-dimensional manifolds.
Results
In this study, we present an approach, Synmatch, which produces a direct matching of the cells between modalities by exploiting information about neighborhood structure in each modality. Synmatch relies on the intuition that cells which are close in one measurement space should be close in the other as well. This allows us to formulate the matching problem as a constrained supermodular optimization problem over neighborhood structures that can be solved efficiently. We show that our approach successfully matches cells in small real multi-omics datasets and performs favorably when compared with recently published state-of-the-art methods. Further, we demonstrate that Synmatch is capable of scaling to large datasets of thousands of cells.
Availability and implementation
The Synmatch code and data used in this manuscript are available at https://github.com/Noble-Lab/synmatch.
Funder
National Institutes of Health
Publisher
Oxford University Press (OUP)
Subject
Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability
Reference31 articles.
1. Computational strategies for single-cell multi-omics integration;Adossa;Comput. Struct. Biotechnol. J,2021
2. MOFA+: a probabilistic framework for comprehensive integration of structured single-cell data;Argelaguet;Genome Biol,2020
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献