Affiliation:
1. Fudan University
2. Harbin Institute of Technology
3. Microsoft Research Asia
Abstract
The ability to handle large scale graph data is crucial to an increasing number of applications. Much work has been dedicated to supporting basic graph operations such as subgraph matching, reachability, regular expression matching, etc. In many cases, graph indices are employed to speed up query processing. Typically, most indices require either super-linear indexing time or super-linear indexing space. Unfortunately, for very large graphs, super-linear approaches are almost always infeasible. In this paper, we study the problem of subgraph matching on billion-node graphs. We present a novel algorithm that supports efficient subgraph matching for graphs deployed on a distributed memory store. Instead of relying on super-linear indices, we use efficient graph exploration and massive parallel computing for query processing. Our experimental results demonstrate the feasibility of performing subgraph matching on web-scale graph data.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
226 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. ArcMatch: high-performance subgraph matching for labeled graphs by exploiting edge domains;Data Mining and Knowledge Discovery;2024-08-07
2. GPU-accelerated relaxed graph pattern matching algorithms;The Journal of Supercomputing;2024-06-16
3. Understanding High-Performance Subgraph Pattern Matching: A Systems Perspective;Proceedings of the 7th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA);2024-06-09
4. From Motif to Path: Connectivity and Homophily;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
5. LearnSC: An Efficient and Unified Learning-Based Framework for Subgraph Counting Problem;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13