Efficient query processing on graph databases-Reference-Cited by-同舟云学术

Efficient query processing on graph databases

Published:2009-04 Issue:1 Volume:34 Page:1-48
ISSN:0362-5915
Container-title:ACM Transactions on Database Systems
language:en
Short-container-title:ACM Trans. Database Syst.

Author:

Cheng James¹,Ke Yiping²,Ng Wilfred³

Affiliation:

1. Nanyang Technological University, Singapore

2. The Chinese University of Hong Kong, New Territories, Hong Kong

3. The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong

Abstract

We study the problem of processing subgraph queries on a database that consists of a set of graphs. The answer to a subgraph query is the set of graphs in the database that are supergraphs of the query. In this article, we propose an efficient index, FG*-index , to solve this problem. The cost of processing a subgraph query using most existing indexes mainly consists of two parts: the index probing cost and the candidate verification cost. Index probing is to find the query in the index, or to find the graphs from which we can generate a candidate answer set for the query. Candidate verification is to test whether each graph in the candidate set is indeed a supergraph of the query. We design FG*-index to minimize these two costs as follows. FG*-index consists of three components: the FG-index , the feature-index , and the FAQ-index . First, the FG-index employs the concept of Frequent subGraph ( FG ) to allow the set of queries that are FGs to be answered without candidate verification. We call this set of queries FG-queries . We can enlarge the set of FG-queries so that more queries can be answered without candidate verification; however, a larger set of FG-queries implies a larger FG-index and hence the index probing cost also increases. We propose the feature-index to reduce the index probing cost. The feature-index uses features to filter false results that are matched in the FG-index, so that we can quickly find the truly matching graphs for a query. For processing non-FG-queries, we propose the FAQ-index, which is dynamically constructed from the set of Frequently Asked non-FG-Queries ( FAQs ). Using the FAQ-index, verification is not required for processing FAQs and only a small number of candidates need to be verified for processing non-FG-queries that are not frequently asked . Finally, a comprehensive set of experiments verifies that query processing using FG*-index is up to orders of magnitude more efficient than state-of-the-art indexes and it is also more scalable.

Funder

Research Grants Council, University Grants Committee, Hong Kong

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/1508857.1508859

Reference39 articles.

1. D(k)-index

2. \delta-Tolerance Closed Frequent Itemsets

3. Effective elimination of redundant association rules

4. Maintaining frequent closed itemsets over a sliding window

Cited by 35 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fast Subgraph Search with Graph Code Indices;Lecture Notes in Computer Science;2024

2. Distributed Subgraph Query Processing Using Filtering Scores on Spark;Electronics;2023-08-29

3. SubTempora: A Hybrid Approach for Optimising Subgraph Searching;Communications in Computer and Information Science;2023

4. GO-DEVS: Storage and Retrieval System for DEVS Models Using Graph and Ontology Representation;Sensors;2021-10-12

5. Similar Supergraph Search Based on Graph Edit Distance;Algorithms;2021-07-27