Affiliation:
1. Hong Kong University of Science and Technology
2. Yahoo Labs, Barcelona
Abstract
Data in several applications can be represented as an uncertain graph whose edges are labeled with a probability of existence. Exact query processing on uncertain graphs is prohibitive for most applications, as it involves evaluation over an exponential number of instantiations. Thus, typical approaches employ Monte-Carlo sampling, which (i) draws a number of possible graphs (samples), (ii) evaluates the query on each of them, and (iii) aggregates the individual answers to generate the final result. However, this approach can also be extremely time consuming for large uncertain graphs commonly found in practice. To facilitate efficiency, we study the problem of extracting a single
representative
instance from an uncertain graph. Conventional processing techniques can then be applied on this representative to closely approximate the result on the original graph.
In order to maintain data utility, the representative instance should preserve structural characteristics of the uncertain graph. We start with representatives that capture the expected vertex degrees, as this is a fundamental property of the graph topology. We then generalize the notion of vertex degree to the concept of
n
-clique cardinality, that is, the number of cliques of size
n
that contain a vertex. For the first problem, we propose two methods: Average Degree Rewiring (ADR), which is based on random edge rewiring, and Approximate B-Matching (ABM), which applies graph matching techniques. For the second problem, we develop a greedy approach and a game-theoretic framework. We experimentally demonstrate, with real uncertain graphs, that indeed the representative instances can be used to answer, efficiently and accurately, queries based on several metrics such as shortest path distance, clustering coefficient, and betweenness centrality.
Publisher
Association for Computing Machinery (ACM)
Cited by
29 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Generic network sparsification via degree- and subgraph-based edge sampling;Information Sciences;2024-09
2. Network Sparsification via Degree- and Subgraph-based Edge Sampling;2022 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM);2022-11-10
3. An Uncertain Graph Privacy Preserving Scheme Based on Node Similarity in Social Networks;2022 IEEE 19th International Conference on Mobile Ad Hoc and Smart Systems (MASS);2022-10
4. Sage;Proceedings of the VLDB Endowment;2022-09
5. A survey on mining and analysis of uncertain graphs;Knowledge and Information Systems;2022-06-28