Near-Optimal and Practical Algorithms for Graph Scan Statistics with Connectivity Constraints-Reference-Cited by-同舟云学术

Near-Optimal and Practical Algorithms for Graph Scan Statistics with Connectivity Constraints

Published:2019-04-30 Issue:2 Volume:13 Page:1-33
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Cadena Jose¹^ORCID,Chen Feng²,Vullikanti Anil¹

Affiliation:

1. Dept. of Computer Science and Biocomplexity Institute, Virginia Tech

2. Dept. of Computer Science, University of Albany - SUNY

Abstract

One fundamental task in network analysis is detecting “hotspots” or “anomalies” in the network; that is, detecting subgraphs where there is significantly more activity than one would expect given historical data or some baseline process. Scan statistics is one popular approach used for anomalous subgraph detection. This methodology involves maximizing a score function over all connected subgraphs, which is a challenging computational problem. A number of heuristics have been proposed for these problems, but they do not provide any quality guarantees. Here, we propose a framework for designing algorithms for optimizing a large class of scan statistics for networks, subject to connectivity constraints. Our algorithms run in time that scales linearly on the size of the graph and depends on a parameter we call the “effective solution size,” while providing rigorous approximation guarantees. In contrast, most prior methods have super-linear running times in terms of graph size. Extensive empirical evidence demonstrates the effectiveness and efficiency of our proposed algorithms in comparison with state-of-the-art methods. Our approach improves on the performance relative to all prior methods, giving up to over 25% increase in the score. Further, our algorithms scale to networks with up to a million nodes, which is 1--2 orders of magnitude larger than all prior applications.

Funder

National Science Foundation

U.S. Department of Energy

Defense Threat Reduction Agency

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3309712

Reference53 articles.

1. Spatial scan statistics

2. oddball: Spotting Anomalies in Weighted Graphs

3. Graph based anomaly detection and description: a survey

4. Color-coding

5. Accuracy Evaluation of the Unified P-Value from Combining Correlated P-Values

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Method for Anomaly Detection in Power Energy Topology Graph Data Based on Domain Knowledge Graph and Graph Neural Networks;2024 IEEE 10th Conference on Big Data Security on Cloud (BigDataSecurity);2024-05-10

2. Fast calculation of p-values for one-sided Kolmogorov-Smirnov type statistics;Computational Statistics & Data Analysis;2023-09

3. NetMix2: A Principled Network Propagation Algorithm for Identifying Altered Subnetworks;Journal of Computational Biology;2022-12-01

4. Public transportation network scan for rapid surveillance;Biostatistics & Epidemiology;2022-06-02

5. Public transportation network scan for rapid surveillance;Biostatistics & Epidemiology;2022-05-27