Fast Continuous Subgraph Matching over Streaming Graphs via Backtracking Reduction

Author:

Yang Rongjian1ORCID,Zhang Zhijie1ORCID,Zheng Weiguo1ORCID,Yu Jeffrey Xu2ORCID

Affiliation:

1. Fudan University, Shanghai, China

2. The Chinese University of Hong Kong, Hong Kong, China

Abstract

Streaming graphs are drawing increasing attention in both academic and industrial communities as many graphs in real applications evolve over time. Continuous subgraph matching (shorted as CSM) aims to report the incremental matches of a query graph in such streaming graphs. It involves two major steps, i.e., candidate maintenance and incremental match generation, to answer CSM. Throughout the course of continuous subgraph matching, incremental match generation backtracking over the search space dominates the total cost. However, most previous approaches focus on developing techniques for efficient candidate maintenance, while incremental match generation receives less attention despite its importance in CSM. Aiming to minimize the overall cost, we propose two techniques to reduce backtrackings in this paper. We present a cost-effective index CaLiG that yields tighter candidate maintenance, shrinking the search space of backtracking. In addition, we develop a novel incremental matching paradigm KSS that decomposes the query vertices into conditional kernel vertices and shell vertices. With the matches of kernel vertices, the incremental matches can be produced immediately by joining the candidates of shell vertices without any backtrackings. Benefiting from reduced backtrackings, the elapsed time of CSM decreases significantly. Extensive experiments over real graphs show that our method runs faster than the state-of-the-art algorithm orders of magnitude.

Funder

the Research Grants Council of Hong Kong, China

Publisher

Association for Computing Machinery (ACM)

Reference40 articles.

1. [n. d.]. Anonymized Internet Traces 2013 . https://catalog.caida.org/details/dataset/passive_2013_pcap. [n. d.]. Anonymized Internet Traces 2013. https://catalog.caida.org/details/dataset/passive_2013_pcap.

2. Ehab Abdelhamid , Mustafa Canim , Mohammad Sadoghi , Bishwaranjan Bhattacharjee , Yuan-Chi Chang , and Panos Kalnis . 2018 . Incremental Frequent Subgraph Mining on Large Evolving Graphs. In 2018 IEEE 34th International Conference on Data Engineering (ICDE). 1767--1768 . Ehab Abdelhamid, Mustafa Canim, Mohammad Sadoghi, Bishwaranjan Bhattacharjee, Yuan-Chi Chang, and Panos Kalnis. 2018. Incremental Frequent Subgraph Mining on Large Evolving Graphs. In 2018 IEEE 34th International Conference on Data Engineering (ICDE). 1767--1768.

3. Bibek Bhattarai , Hang Liu , and H. Howie Huang . 2019. CECI: Compact Embedding Cluster Index for Scalable Subgraph Matching . In Proceedings of the 2019 International Conference on Management of Data ( Amsterdam, Netherlands) (SIGMOD '19). Association for Computing Machinery, New York, NY, USA, 1447--1462. Bibek Bhattarai, Hang Liu, and H. Howie Huang. 2019. CECI: Compact Embedding Cluster Index for Scalable Subgraph Matching. In Proceedings of the 2019 International Conference on Management of Data (Amsterdam, Netherlands) (SIGMOD '19). Association for Computing Machinery, New York, NY, USA, 1447--1462.

4. Efficient Subgraph Matching by Postponing Cartesian Products

5. Vincenzo Bonnici , Rosalba Giugno , Alfredo Pulvirenti , Dennis Shasha , and Alfredo Ferro . 2013. A subgraph isomorphism algorithm and its application to biochemical data. BMC Bioinformatics 14 , SUPPL7 (22 April 2013 ). https://doi.org/10. 1186/1471--2105--14-S7-S13 Funding Information : This article is published as part of a supplement. The publication costs for this article were funded by PO grant - FESR 2007--2013 Linea di intervento 4.1.1.2, CUP G 23F11000840004.. Vincenzo Bonnici, Rosalba Giugno, Alfredo Pulvirenti, Dennis Shasha, and Alfredo Ferro. 2013. A subgraph isomorphism algorithm and its application to biochemical data. BMC Bioinformatics 14, SUPPL7 (22 April 2013). https://doi.org/10. 1186/1471--2105--14-S7-S13 Funding Information: This article is published as part of a supplement. The publication costs for this article were funded by PO grant - FESR 2007--2013 Linea di intervento 4.1.1.2, CUP G23F11000840004..

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3