Abstract
AbstractTime-series single-cell RNA sequencing (scRNA-seq) data have opened a door to elucidate cell differentiation processes. In this context, the optimal transport (OT) theory has attracted attention to interpolate scRNA-seq data and infer the trajectories of cell differentiation. However, there remain critical issues in interpretability and computational cost. This paper presents scEGOT, a novel comprehensive trajectory inference framework for single-cell data based on entropic Gaussian mixture optimal transport (EGOT). By constructing a theory of EGOT via an explicit construction of the entropic transport plan and its connection to a continuous OT with its error estimates, EGOT is realized as a generative model with high interpretability and low computational cost, dramatically facilitating the inference of cell trajectories and dynamics from time-series data. The scEGOT framework provides comprehensive outputs from multiple perspectives, including cell state graphs, velocity fields of cell differentiation, time interpolations of single-cell data, space-time continuous videos of cell differentiation with gene expressions, gene regulatory networks, and reconstructions of Waddington’s epigenetic landscape. To demonstrate that scEGOT is a powerful and versatile tool for single-cell biology, we applied it to time-series scRNA-seq data of the human primordial germ cell-like cell (human PGCLC) induction system. Using scEGOT, we precisely identified the PGCLC progenitor population and the bifurcation time of the segregation. Our analysis suggests that a known marker geneTFAP2Aalone is not sufficient to identify the PGCLC progenitor cell population, but thatNKX1-2is also required. In addition, we found thatMESP1andGATA6may also be crucial for PGCLC/somatic cell segregation.
Publisher
Cold Spring Harbor Laboratory
Reference59 articles.
1. Generalizing RNA velocity to transient cell states through dynamical modeling
2. C. Bunne , L. Papaxanthos , A. Krause , and M. Cuturi . Proximal optimal transport modeling of population dynamics. In International Conference on Artificial Intelligence and Statistics, pages 6511–6528. PMLR, 2022.
3. An extension of kakutani’s theorem on infinite product measures to the tensor product of semifinite w⇤-algebras;Transactions of the American Mathematical Society,1969
4. Computational methods for trajectory inference from single-cell transcriptomics
5. A. Castillo-Venzor , C. A. Penfold , M. D. Morgan , W. W. C. Tang , T. Kobayashi , F. C. K. Wong , S. Bergmann , E. Slatery , T. E. Boroviak , J. C. Marioni , and M. A. Surani . Origin and segregation of the human germline. preprint, Developmental Biology, July 2022.