Affiliation:
1. Northeastern University, China
2. Northeastern University, China and Wuhan University, Wuhan, Hubei, China
3. Microsoft Research Asia
4. Hong Kong University of Science and Technology, Hong Kong, China
Abstract
Retrieving graphs containing a query graph from a large graph database is a key task in many graph-based applications, including chemical compounds discovery, protein complex prediction, and structural pattern recognition. However, graph data handled by these applications is often noisy, incomplete, and inaccurate because of the way the data is produced. In this paper, we study subgraph queries over uncertain graphs. Specifically, we consider the problem of answering threshold-based probabilistic queries over a large uncertain graph database with the possible world semantics. We prove that problem is #P-complete, therefore, we adopt a
filtering-and-verification
strategy to speed up the search. In the
filtering
phase, we use a probabilistic inverted index, PIndex, based on subgraph features obtained by an optimal feature selection process. During the
verification
phase, we develop exact and bound algorithms to validate the remaining candidates. Extensive experimental results demonstrate the effectiveness of the proposed algorithms.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
34 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献