Abstract
AbstractBiological techniques for spatially resolved transcriptomics (SRT) have advanced rapidly in both throughput and spatial resolution for a single spatial location. This progress necessitates the development of efficient and scalable spatial dimension reduction methods that can handle large-scale SRT data from multiple sections. Here, we developed FAST as a fast and efficient generalized probabilistic factor analysis for spatially aware dimension reduction, which simultaneously accounts for the count nature of SRT data and extracts a low-dimensional representation of SRT data across multiple sections, while preserving biological effects with consideration of spatial smoothness among nearby locations. Compared with existing methods, FAST uniquely models the count data across multiple sections while using a local spatial dependence with scalable computational complexity. Using both simulated and real datasets, we demonstrated the improved correlation between FAST estimated embeddings and annotated cell/domain types. Furthermore, FAST exhibits remarkable speed, with only FAST being applicable to analyze a mouse embryo Stereo-seq dataset with >2.3 million locations in only 2 hours. More importantly, FAST identified the differential activities of immune-related transcription factors between tumor and non-tumor clusters and also predicted a carcinogenesis factorCCNHas the upstream regulator of differentially expressed genes in a breast cancer Xenium dataset.
Publisher
Cold Spring Harbor Laboratory
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献