A Hierarchical Parallel Graph Summarization Approach Based on Ranking Nodes
-
Published:2023-04-07
Issue:8
Volume:13
Page:4664
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Liu Qiang1ORCID, Wei Jiaxing1, Liu Hao1, Ji Yimu1
Affiliation:
1. School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
Abstract
Graph summarization techniques are vital in simplifying and extracting enormous quantities of graph data. Traditional static graph structure-based summarization algorithms generally follow a minimum description length (MDL) style, and concentrate on minimizing the graph storage overhead. However, these methods also suffer from incomprehensive summary dimensions and inefficiency problems. In addition, the need for graph summarization techniques often varies among different graph applications, but an ideal summary method should generally retain the important characteristics of the key nodes in the final summary graph. This paper proposes a novel method based on ranking nodes, called HRNS, that follows a hierarchical parallel graph summarization approach. The HRNS first preprocesses the node ranking using a hybrid weighted importance strategy, and introduces the node importance factor into traditional MDL-based summarization algorithms; it then leverages a hierarchical parallel process to accelerate the summary computation. The experimental results obtained using both real and simulated datasets show that HRNS can efficiently extract nodes with high importance, and that the average importance over six datasets ranges from 0.107 to 0.167; thus, HRNS can achieve a significant performance gain on speedups, as the sum error ratios are also lower than the methods traditionally used.
Funder
NUPTSF National Natural Science Foundation of China Postgraduate Research and Practice Innovation Program of Jiangsu Province Natural Science Foundation of Jiangsu Province Jiangsu Planned Projects for Postdoctoral Research Funds Open Research Project of Zhejiang Lab
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference40 articles.
1. China Internet Network Information Center (2022, April 24). The 49th Statistical Report on China’s Internet Development. Available online: https://www.cnnic.com.cn/. 2. Shin, K., Ghoting, A., Kim, M., and Raghavan, H. (2019, January 13–17). SWeG: Lossless and Lossy Summarization of Web-Scale Graphs. Proceedings of the World Wide Web Conference, San Francisco, CA, USA. 3. Lee, K., Jo, H., Ko, J., Lim, S., and Shin, K. (2020, January 6–10). Ssumm: Sparse summarization of massive graphs. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Virtual Events, CA, USA. 4. Dixon, S. (2022, April 28). Number of Monthly Active Facebook Users Worldwide as of 1st Quarter 2022 (In Millions). Available online: https://www.statista.com. 5. Graph summarization methods and applications: A survey;Liu;ACM Comput. Surv. (CSUR),2018
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Amazon Product Dataset Community Detection Metrics and Algorithms;Advanced Interdisciplinary Applications of Machine Learning Python Libraries for Data Science;2023-06-30
|
|