Identification of Stem Cells from Large Cell Populations with Topological Scoring-Reference-Cited by-同舟云学术

Identification of Stem Cells from Large Cell Populations with Topological Scoring

Published:2020-04-09 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Sardiu Mihaela E.,Andrew Box C.,Haug Jeff,Washburn Michael P.^ORCID

Abstract

AbstractMachine learning and topological analysis methods are becoming increasingly used on various large-scale omics datasets. Modern high dimensional flow cytometry data sets share many features with other omics datasets like genomics and proteomics. For example, genomics or proteomics datasets can be sparse and have high dimensionality, and flow cytometry datasets can also share these features. This makes flow cytometry data potentially a suitable candidate for employing machine learning and topological scoring strategies, for example, to gain novel insights into patterns within the data. We have previously developed the Topological Score (TopS) and implemented it for the analysis of quantitative protein interaction network datasets. Here we show that the TopS approach for large scale data analysis is applicable to the analysis of a previously described flow cytometry sorted human hematopoietic stem cell dataset. We demonstrate that TopS is capable of effectively sorting this dataset into cell populations and identify rare cell populations. We demonstrate the utility of TopS when coupled with multiple approaches including topological data analysis, X-shift clustering, and t-Distributed Stochastic Neighbor Embedding (t-SNE). Our results suggest that TopS could be effectively used to analyze large scale flow cytometry datasets to find rare cell populations.

Publisher

Cold Spring Harbor Laboratory

Reference28 articles.

1. C. Wu , F. Zhou , J. Ren , X. Li , Y. Jiang and S. Ma , High-throughput, 2019, 8.

2. A review on machine learning principles for multi-view biological data integration

3. Multi-omics approaches to disease

4. Generating topological protein interaction scores and data visualization with TopS