GraphMineSuite

Author:

Besta Maciej1,Vonarburg-Shmaria Zur1,Schaffner Yannick1,Schwarz Leonardo1,Kwasniewski Grzegorz1,Gianinazzi Lukas1,Beranek Jakub2,Janda Kacper3,Holenstein Tobias1,Leisinger Sebastian1,Tatkowski Peter1,Ozdemir Esref1,Balla Adrian1,Copik Marcin1,Lindenberger Philipp1,Konieczny Marek3,Mutlu Onur1,Hoefler Torsten1

Affiliation:

1. ETH Zurich, Zurich, Switzerland

2. VSB, Ostrava, Czech Republic

3. AGH-UST, Krakow, Poland

Abstract

We propose GraphMineSuite (GMS): the first benchmarking suite for graph mining that facilitates evaluating and constructing high-performance graph mining algorithms. First, GMS comes with a benchmark specification based on extensive literature review, prescribing representative problems, algorithms, and datasets. Second, GMS offers a carefully designed software platform for seamless testing of different fine-grained elements of graph mining algorithms, such as graph representations or algorithm subroutines. The platform includes parallel implementations of more than 40 considered baselines, and it facilitates developing complex and fast mining algorithms. High modularity is possible by harnessing set algebra operations such as set intersection and difference, which enables breaking complex graph mining algorithms into simple building blocks that can be separately experimented with. GMS is supported with a broad concurrency analysis for portability in performance insights, and a novel performance metric to assess the throughput of graph mining algorithms, enabling more insightful evaluation. As use cases, we harness GMS to rapidly redesign and accelerate state-of-the-art baselines of core graph mining problems: degeneracy reordering (by >2X), maximal clique listing (by >9×), k -clique listing (by up to 1.1×), and subgraph isomorphism (by 2.5×), also obtaining better theoretical performance bounds.

Publisher

VLDB Endowment

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Differentiating Set Intersections in Maximal Clique Enumeration by Function and Subproblem Size;Proceedings of the 38th ACM International Conference on Supercomputing;2024-05-30

2. Distributed Memory Implementation of Bron-Kerbosch Algorithm;IEEE Access;2024

3. Graph Pattern Mining Paradigms: Consolidation and Renewed Bearing;2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics (HiPC);2023-12-18

4. Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries;ACM Computing Surveys;2023-09-15

5. ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations;SC22: International Conference for High Performance Computing, Networking, Storage and Analysis;2022-11

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3