Selecting a significance level in sequential testing procedures for community detection-Reference-Cited by-同舟云学术

Selecting a significance level in sequential testing procedures for community detection

Published:2023-08-01 Issue:1 Volume:8 Page:
ISSN:2364-8228
Container-title:Applied Network Science
language:en
Short-container-title:Appl Netw Sci

Author:

Ghosh Riddhi Pratim,Barnett Ian

Abstract

AbstractWhile there have been numerous sequential algorithms developed to estimate community structure in networks, there is little available guidance and study of what significance level or stopping parameter to use in these sequential testing procedures. Most algorithms rely on prespecifiying the number of communities or use an arbitrary stopping rule. We provide a principled approach to selecting a nominal significance level for sequential community detection procedures by controlling the tolerance ratio, defined as the ratio of underfitting and overfitting probability of estimating the number of clusters in fitting a network. We introduce an algorithm for specifying this significance level from a user-specified tolerance ratio, and demonstrate its utility with a sequential modularity maximization approach in a stochastic block model framework. We evaluate the performance of the proposed algorithm through extensive simulations and demonstrate its utility in controlling the tolerance ratio in single-cell RNA sequencing clustering by cell type and by clustering a congressional voting network.

Funder

National Institute of Mental Health

Publisher

Springer Science and Business Media LLC

Subject

Computational Mathematics,Computer Networks and Communications,Multidisciplinary

Link

https://link.springer.com/content/pdf/10.1007/s41109-023-00567-2.pdf

Reference40 articles.

1. Albert R, Jeong H, Barabási A-L (1999) Diameter of the world-wide web. Nature 401(6749):130–131

2. Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B (Methodol) 57(1):289–300

3. Bickel PJ, Sarkar P (2015) Hypothesis testing for automated community detection in networks. J R Stat Soc Ser B (Stat Methodol) 1(78):253–273

4. Bickel PJ, Sarkar P (2016) Hypothesis testing for automated community detection in networks. J R Stat Soc Ser B (Stat Methodol) 78(1):253–273

5. Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008(10):P10008

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A generalized hypothesis test for community structure in networks;Network Science;2024-03-11