Joint dimension reduction and clustering analysis of single-cell RNA-seq and spatial transcriptomics data

Author:

Liu Wei12ORCID,Liao Xu2,Yang Yi2,Lin Huazhen3,Yeong Joe45,Zhou Xiang6ORCID,Shi Xingjie17,Liu Jin2ORCID

Affiliation:

1. Academy of Statistics and Interdisciplinary Sciences, East China Normal University , Shanghai, 200062, China

2. Centre for Quantitative Medicine, Health Services & Systems Research , Duke-NUS Medical School, 169857, Singapore

3. Center of Statistical Research and School of Statistics, Southwestern University of Finance and Economics , Chengdu, 611130, China

4. Institute of Molecular and Cell Biology(IMCB), Agency of Science , Technology and Research(A*STAR), 138673, Singapore

5. Department of Anatomical Pathology , Singapore General Hospital, 169856, Singapore

6. Department of Biostatistics, University of Michigan , Ann Arbor, 48109, USA

7. Key Laboratory of Advanced Theory and Application in Statistics and Data Science-MOE, School of Statistics, East China Normal University , Shanghai, 200062, China

Abstract

Abstract Dimension reduction and (spatial) clustering is usually performed sequentially; however, the low-dimensional embeddings estimated in the dimension-reduction step may not be relevant to the class labels inferred in the clustering step. We therefore developed a computation method, Dimension-Reduction Spatial-Clustering (DR-SC), that can simultaneously perform dimension reduction and (spatial) clustering within a unified framework. Joint analysis by DR-SC produces accurate (spatial) clustering results and ensures the effective extraction of biologically informative low-dimensional features. DR-SC is applicable to spatial clustering in spatial transcriptomics that characterizes the spatial organization of the tissue by segregating it into multiple tissue structures. Here, DR-SC relies on a latent hidden Markov random field model to encourage the spatial smoothness of the detected spatial cluster boundaries. Underlying DR-SC is an efficient expectation-maximization algorithm based on an iterative conditional mode. As such, DR-SC is scalable to large sample sizes and can optimize the spatial smoothness parameter in a data-driven manner. With comprehensive simulations and real data applications, we show that DR-SC outperforms existing clustering and spatial clustering methods: it extracts more biologically relevant features than conventional dimension reduction methods, improves clustering performance, and offers improved trajectory inference and visualization for downstream trajectory inference analyses.

Funder

Ministry of Education, Singapore

Natural Science Foundation of China

Natural Science Foundation of Shanghai

Publisher

Oxford University Press (OUP)

Subject

Genetics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3