Demystifying Graph Sparsification Algorithms in Graph Properties Preservation-Reference-Cited by-同舟云学术

Demystifying Graph Sparsification Algorithms in Graph Properties Preservation

Published:2023-11 Issue:3 Volume:17 Page:427-440
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Chen Yuhan¹,Ye Haojie¹,Vedula Sanketh²,Bronstein Alex²,Dreslinski Ronald¹,Mudge Trevor¹,Talati Nishil¹

Affiliation:

1. University of Michigan

2. Technion

Abstract

Graph sparsification is a technique that approximates a given graph by a sparse graph with a subset of vertices and/or edges. The goal of an effective sparsification algorithm is to maintain specific graph properties relevant to the downstream task while minimizing the graph's size. Graph algorithms often suffer from long execution time due to the irregularity and the large real-world graph size. Graph sparsification can be applied to greatly reduce the run time of graph algorithms by substituting the full graph with a much smaller sparsified graph, without significantly degrading the output quality. However, the interaction between numerous sparsifiers and graph properties is not widely explored, and the potential of graph sparsification is not fully understood. In this work, we cover 16 widely-used graph metrics, 12 representative graph sparsification algorithms, and 14 real-world input graphs spanning various categories, exhibiting diverse characteristics, sizes, and densities. We developed a framework to extensively assess the performance of these sparsification algorithms against graph metrics, and provide insights to the results. Our study shows that there is no one sparsifier that performs the best in preserving all graph properties, e.g. sparsifiers that preserve distance-related graph properties (eccentricity) struggle to perform well on Graph Neural Networks (GNN). This paper presents a comprehensive experimental study evaluating the performance of sparsification algorithms in preserving essential graph metrics. The insights inform future research in incorporating matching graph sparsification to graph algorithms to maximize benefits while minimizing quality degradation. Furthermore, we provide a framework to facilitate the future evaluation of evolving sparsification algorithms, graph metrics, and ever-growing graph data.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.14778/3632093.3632106

Reference78 articles.

1. 2022. Spanning Tree. https://en.wikipedia.org/wiki/Spanning_tree (last accessed date: 11/15/2023).

2. 2022. Tree (graph theory). https://en.wikipedia.org/wiki/Tree_(graph_theory) (last accessed date: 11/15/2023).

3. 2023. Clustering coefficient. https://en.wikipedia.org/wiki/Clustering_coefficient (last accessed date: 11/15/2023).

4. 2023. Connected graph. https://mathworld.wolfram.com/ConnectedGraph.html (last accessed date: 11/15/2023).

5. 2023. Cut (graph theory). https://en.wikipedia.org/wiki/Cut_(graph_theory) (last accessed date: 11/15/2023).

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient Topology-aware Data Augmentation for High-Degree Graph Neural Networks;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

2. Analysis of Pruned Deep Models Trained with Neuroevolution;Proceedings of the Genetic and Evolutionary Computation Conference Companion;2024-07-14