scEGOT: Single-cell trajectory inference framework based on entropic Gaussian mixture optimal transport

Author:

Yachimura ToshiakiORCID,Wang Hanbo,Imoto Yusuke,Yoshida Momoko,Tasaki Sohei,Kojima Yoji,Yabuta Yukihiro,Saitou Mitinori,Hiraoka Yasuaki

Abstract

AbstractTime-series single-cell RNA sequencing (scRNA-seq) data have opened a door to elucidate cell differentiation processes. In this context, the optimal transport (OT) theory has attracted attention to interpolate scRNA-seq data and infer the trajectories of cell differentiation. However, there remain critical issues in interpretability and computational cost. This paper presents scEGOT, a novel comprehensive trajectory inference framework for single-cell data based on entropic Gaussian mixture optimal transport (EGOT). By constructing a theory of EGOT via an explicit construction of the entropic transport plan and its connection to a continuous OT with its error estimates, EGOT is realized as a generative model with high interpretability and low computational cost, dramatically facilitating the inference of cell trajectories and dynamics from time-series data. The scEGOT framework provides comprehensive outputs from multiple perspectives, including cell state graphs, velocity fields of cell differentiation, time interpolations of single-cell data, space-time continuous videos of cell differentiation with gene expressions, gene regulatory networks, and reconstructions of Waddington’s epigenetic landscape. To demonstrate that scEGOT is a powerful and versatile tool for single-cell biology, we applied it to time-series scRNA-seq data of the human primordial germ cell-like cell (human PGCLC) induction system. Using scEGOT, we precisely identified the PGCLC progenitor population and the bifurcation time of the segregation. Our analysis suggests that a known marker geneTFAP2Aalone is not sufficient to identify the PGCLC progenitor cell population, but thatNKX1-2is also required. In addition, we found thatMESP1andGATA6may also be crucial for PGCLC/somatic cell segregation.

Publisher

Cold Spring Harbor Laboratory

Reference59 articles.

1. Generalizing RNA velocity to transient cell states through dynamical modeling

2. C. Bunne , L. Papaxanthos , A. Krause , and M. Cuturi . Proximal optimal transport modeling of population dynamics. In International Conference on Artificial Intelligence and Statistics, pages 6511–6528. PMLR, 2022.

3. An extension of kakutani’s theorem on infinite product measures to the tensor product of semifinite w⇤-algebras;Transactions of the American Mathematical Society,1969

4. Computational methods for trajectory inference from single-cell transcriptomics

5. A. Castillo-Venzor , C. A. Penfold , M. D. Morgan , W. W. C. Tang , T. Kobayashi , F. C. K. Wong , S. Bergmann , E. Slatery , T. E. Boroviak , J. C. Marioni , and M. A. Surani . Origin and segregation of the human germline. preprint, Developmental Biology, July 2022.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3