Abstract
Abstract
The complexity underlying real-world systems implies that standard statistical hypothesis testing methods may not be adequate for these peculiar applications. Specifically, we show that the likelihood-ratio (LR) test’s null-distribution needs to be modified to accommodate the complexity found in multi-edge network data. When working with independent observations, the p-values of LR tests are approximated using a χ
2 distribution. However, such an approximation should not be used when dealing with multi-edge network data. This type of data is characterized by multiple correlations and competitions that make the standard approximation unsuitable. We provide a solution to the problem by providing a better approximation of the LR test null-distribution through a beta distribution. Finally, we empirically show that even for a small multi-edge network, the standard χ
2 approximation provides erroneous results, while the proposed beta approximation yields the correct p-value estimation.
Subject
Artificial Intelligence,Computer Networks and Communications,Computer Science Applications,Information Systems
Reference33 articles.
1. Information theory and an extension of the maximum likelihood principle;Akaike,1973
2. A new look at the statistical model identification;Akaike;IEEE Trans. Autom. Control,1974
3. Why online does not equal offline: comparing online and real-world political support among politicians;Brandenberger,2021
4. Quantifying triadic closure in multi-edge social networks;Brandenberger,2019
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献