Network Sampling-Reference-Cited by-同舟云学术

Network Sampling

Published:2014-06 Issue:2 Volume:8 Page:1-56
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Ahmed Nesreen K.¹,Neville Jennifer¹,Kompella Ramana¹

Affiliation:

1. Purdue University, West Lafayette, IN USA

Abstract

Network sampling is integral to the analysis of social, information, and biological networks. Since many real-world networks are massive in size, continuously evolving, and/or distributed in nature, the network structure is often sampled in order to facilitate study. For these reasons, a more thorough and complete understanding of network sampling is critical to support the field of network science. In this paper, we outline a framework for the general problem of network sampling by highlighting the different objectives, population and units of interest, and classes of network sampling methods. In addition, we propose a spectrum of computational models for network sampling methods, ranging from the traditionally studied model based on the assumption of a static domain to a more challenging model that is appropriate for streaming domains. We design a family of sampling methods based on the concept of graph induction that generalize across the full spectrum of computational models (from static to streaming) while efficiently preserving many of the topological properties of the input graphs. Furthermore, we demonstrate how traditional static sampling algorithms can be modified for graph streams for each of the three main classes of sampling methods: node, edge, and topology-based sampling. Experimental results indicate that our proposed family of sampling methods more accurately preserve the underlying properties of the graph in both static and streaming domains. Finally, we study the impact of network sampling algorithms on the parameter estimation and performance evaluation of relational classification algorithms.

Funder

Division of Information and Intelligent Systems

Army Research Office

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2601438

Reference109 articles.

1. The political blogosphere and the 2004 U.S. election

2. On dense pattern mining in graph streams

3. Outlier detection in graph streams

Cited by 151 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Graph-Guided Bayesian Factor Model for Integrative Analysis of Multi-modal Data with Noisy Network Information;Statistics in Biosciences;2024-08-11

2. Link prediction accuracy on real-world networks under non-uniform missing-edge patterns;PLOS ONE;2024-07-18

3. Sampling unknown large networks restricted by low sampling rates;Scientific Reports;2024-06-10

4. Weighted Jump in Random Walk graph sampling;Neurocomputing;2024-06

5. A spanning tree approach to social network sampling with degree constraints;Social Network Analysis and Mining;2024-05-18