An empirical comparison of connectivity-based distances on a graph and their computational scalability

Author:

Miasnikof Pierre1ORCID,Shestopaloff Alexander Y2,Pitsoulis Leonidas3,Ponomarenko Alexander4

Affiliation:

1. The Edward S. Rogers Sr. Department of Electrical & Computer Engineering, 10 King’s College Road, Toronto, ON, M5S 3G4, Canada and Data Sciences Institute, University of Toronto, 700 University Avenue, 17th Floor, Toronto, ON, M5G 1Z5, Canada

2. School of Mathematical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK

3. Department of Electrical Engineering and Computer Science, Aristotle University of Thessaloniki, Thessaloniki 54125, Greece

4. Laboratory of Algorithms and Technologies for Networks Analysis, National Research University Higher School of Economics, 136 Rodionova str., Nizhny Novgorod 603093, Russia and Center for Hydroacoustics and Geophysical Research Division, Institute of Applied Physics of The Russian Academy of Sciences, 46 Ulyanova str., Nizhny Novgorod 603950, Russia

Abstract

Abstract In this study, we compare distance measures with respect to their ability to capture vertex community structure and the scalability of their computation. Our goal is to find a distance measure which can be used in an aggregate pairwise minimization clustering scheme. The minimization should lead to subsets of vertices with high induced subgraph density. Our definition of distance is rooted in the notion that vertices sharing more connections are closer to each other than vertices which share fewer connections. This definition differs from that of the geodesic distance typically used in graphs. It is based on neighbourhood overlap, not shortest path. We compare four distance measures from the literature and evaluate their accuracy in reflecting intra-cluster density, when aggregated (averaged) at the cluster level. Our tests are conducted on synthetic graphs, where clusters and intra-cluster densities are known in advance. We find that amplified commute, Otsuka–Ochiai and Jaccard distances display a consistent inverse relation to intra-cluster density. We also conclude that the computation of amplified commute distance does not scale as well to large graphs as that of the other two distances.

Funder

Fujitsu Limited and Fujitsu Consulting (Canada) Inc

Publisher

Oxford University Press (OUP)

Subject

Applied Mathematics,Computational Mathematics,Control and Optimization,Management Science and Operations Research,Computer Networks and Communications

Reference51 articles.

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Average Jaccard index of random graphs;Journal of Applied Probability;2024-02-26

2. Graph clustering with Boltzmann machines;Discrete Applied Mathematics;2024-01

3. Statistical Network Similarity;Complex Networks and Their Applications XI;2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3