The effective number of shared dimensions: A simple method for revealing shared structure between datasets

Author:

Giaffar Hamza,Buxó Camille RullánORCID,Aoi Mikio

Abstract

AbstractA number of recent studies have sought to understand the behavior of artificial and biological neural networks by comparing representations across layers, networks and brain areas. Simultaneously, there has been growing interest in using dimensionality of a dataset as a proxy for computational complexity. At the intersection of these topics, studies exploring the dimensionality of shared computational and representational subspaces have relied on model-based methods, but a standard, model-free measure is lacking. Here we present a candidate measure for shared dimensionality that we call the effective number of shared dimensions (ENSD). The ENSD can be applied to data matrices sharing at least one dimension, reduces to the well-known participation ratio when both data sets are equivalent and has a number of other robust and intuitive mathematical properties. Notably, the ENSD can be written as a similarity metric that is a re-scaled version of centered kernel alignment (CKA) but additionally describes the dimensionality of the aligned subspaces. Unlike methods like canonical correlation analysis (CCA), the ENSD is robust to cases where data is sparse or low rank. We demonstrate its utility and computational efficiency by a direct comparison of CKA and ENSD on across-layer similarities in convolutional neural networks as well as by recovering results from recent studies in neuroscience on communication subspaces between brain regions. Finally, we demonstrate how the ENSD and its constituent statistics allow us to perform a variety of multi-modal analyses of multivariate datasets. Specifically, we use connectomic data to probe the alignment of parallel pathways in the fly olfactory system, revealing novel results in the interaction between innate and learned olfactory representations. Altogether, we show that the ENSD is an interpretable and computationally efficient model-free measure of shared dimensionality and that it can be used to probe shared structure in a wide variety of data types.

Publisher

Cold Spring Harbor Laboratory

Reference52 articles.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3