Manifold-adaptive dimension estimation revisited

Author:

Benkő Zsigmond12,Stippinger Marcell1,Rehus Roberta1,Bencze Attila1,Fabó Dániel3,Hajnal Boglárka23,Eröss Loránd G.45,Telcs András167,Somogyvári Zoltán18

Affiliation:

1. Department of Computational Sciences, Wigner Research Centre for Physics, Budapest, Hungary

2. János Szentágothai Doctoral School of Neurosciences, Semmelweis University, Budapest, Hungary

3. Epilepsy Center, Department of Neurology, National Institute of Clinical Neurosciences, Budapest, Hungary

4. Department of Functional Neurosurgery, National Institute of Clinical Neurosciences, Budapest, Hungary

5. Faculty of Information Technology and Bionics, Péter Pázmány Catholic University, Budapest, Hungary

6. Department of Computer Science and Information Theory, Faculty of Electrical Engineering and Informatics, Budapest University of Technology and Economics, Budapest, Hungary

7. Department of Quantitative Methods, Faculty of Business and Economics,, University of Pannonia, Veszprém, Hungary

8. Neuromicrosystems ltd., Budapest, Hungary

Abstract

Data dimensionality informs us about data complexity and sets limit on the structure of successful signal processing pipelines. In this work we revisit and improve the manifold adaptive Farahmand-Szepesvári-Audibert (FSA) dimension estimator, making it one of the best nearest neighbor-based dimension estimators available. We compute the probability density function of local FSA estimates, if the local manifold density is uniform. Based on the probability density function, we propose to use the median of local estimates as a basic global measure of intrinsic dimensionality, and we demonstrate the advantages of this asymptotically unbiased estimator over the previously proposed statistics: the mode and the mean. Additionally, from the probability density function, we derive the maximum likelihood formula for global intrinsic dimensionality, if i.i.d. holds. We tackle edge and finite-sample effects with an exponential correction formula, calibrated on hypercube datasets. We compare the performance of the corrected median-FSA estimator with kNN estimators: maximum likelihood (Levina-Bickel), the 2NN and two implementations of DANCo (R and MATLAB). We show that corrected median-FSA estimator beats the maximum likelihood estimator and it is on equal footing with DANCo for standard synthetic benchmarks according to mean percentage error and error rate metrics. With the median-FSA algorithm, we reveal diverse changes in the neural dynamics while resting state and during epileptic seizures. We identify brain areas with lower-dimensional dynamics that are possible causal sources and candidates for being seizure onset zones.

Funder

The BME NC TKP2020 grant of NKFIH Hungary

The BME-Artificial Intelligence FIKP grant of EMMI

The National Brain Research Program of Hungary

The National Brain Project II, NRDIO Hungary, PATTERN Group

NKFIH

Publisher

PeerJ

Subject

General Computer Science

Reference54 articles.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3