Differential correlation across subpopulations of single cells in subtypes of acute myeloid leukemia

Author:

McGee Reginald L.ORCID,Behbehani Gregory K.,Coombes Kevin R.

Abstract

AbstractMass cytometers can record 40-50 parameters per single cell for millions of cells in a sample, and in particular, for leukemic cells. Many methods have been developed to cluster phenotypically similar cells within cytometry data, but there are fewer methods to visualize activity and interactions of pairs of proteins across these populations. We have developed a workflow for analyzing correlations associated with predfined populations. By clustering blood samples from acute myeloid leukemia (AML) patients and normal controls using an established algorithm, we obtained a minimum spanning tree of clusters of single cells. Using surface marker expression, we identified clusters on the tree that belonged to phenotypes of interest. Next, we computed correlations between pairs of proteins in each cluster. We developed a novel, coherent, probability-based statistic to test differences between vectors of correlation coefficients. By comparing all combinations of the normal controls under the statistic, we created an empirical distribution that provided a conservative measure of differential correlation. Using this empirically-derived distribution to define significance, we compared pooled samples from AML subtypes and normal controls to detect differential correlations. Given the structure present within this cytometry data set, we found it natural to consider correlations in this manner versus aggregating all data and computing a single correlation. Our results have the advantage that we can localize the statistical measure to determine contributions from particular phenotypic populations. Differentially correlated pairs of proteins can be further explored by considering a population’s distribution of correlation coefficients or biaxially plotting protein expressions within individual cells in a given population. Our approach leads to a better understanding of the nonlinear relationships that exist in the cytometry data.Author summaryWe introduce a novel method for analyzing the abundance of single cell data collected by high-throughput technologies. Due to the high dimensionality of such datasets, there is a need for methods to identify significant interactions between genes or proteins. In particular, we are interested in statistical differences between correlations of proteins within populations of cells determined by traditional immunophenotyping techniques. In this paper, we have demonstrated the utility of this new framework in the case of blood samples from individuals with different subtypes of leukemia and compare to healthy controls. The motivation for this application is that differences between can illuminate potential drivers of the disease. We have illustrated an example of how intracellular events can be detected by our statistic. Finally, this method is flexible in that it can also be applied in a variety of contexts where there are vectors of correlation coefficients that need to be compared.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3