Author:
Li Huimin,Jiang Xi,Guo Lei,Xie Yang,Xu Lin,Li Qiwei
Abstract
SummaryRecent breakthroughs in spatially resolved transcriptomics (SRT) technologies have enabled comprehensive molecular characterization at the spot or cellular level while preserving spatial information. Cells are the fundamental building blocks of tissues, organized into distinct yet connected components. Although many non-spatial and spatial clustering approaches have been used to partition the entire region into mutually exclusive spatial domains based on the SRT high-dimensional molecular profile, most require an ad-hoc selection of less interpretable dimensional-reduction techniques. To overcome this challenge, we propose a zero-inflated negative binomial mixture model to cluster spots or cells based on their molecular profiles. To increase interpretability, we employ a feature selection mechanism to provide a low-dimensional summary of the SRT molecular profile in terms of discriminating genes that shed light on the clustering result. We further incorporate the SRT geospatial profileviaa Markov random field prior. We demonstrate how this joint modeling strategy improves clustering accuracy, compared with alternative state-of-the-art approaches, through simulation studies and two real data applications.
Publisher
Cold Spring Harbor Laboratory
Reference39 articles.
1. Blondel, V. D. , Guillaume, J.-L. , Lambiotte, R. , and Lefebvre, E. (2008). Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment 2008, P10008.
2. Boots, B. , Sugihara, K. , Chiu, S. N. , and Okabe, A. (2009). Spatial tessellations: concepts and applications of Voronoi diagrams.
3. Model-based clustering for expression data via a Dirichlet process mixture model;Bayesian inference for gene expression and proteomics,2006
4. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution