Abstract
Graph data are pervasive worldwide, e.g., social networks, citation networks, and web graphs. A real-world graph can be huge and requires heavy computational and storage resources for processing. Various graph compression techniques have been presented to accelerate the processing time and utilize memory efficiently. SOTA approaches decompose a graph into fixed-size submatrices and compress it by applying the existing graph compression algorithm. This approach is promising if the input graph is dense. Otherwise, an optimal graph compression ratio cannot be achieved. Graphs such as those used by social networks exhibit a power-law distribution. Thus, applying compression to the fixed-size block of a matrix could lead to the empty cell processing of that matrix. In this paper, we solve the problem of ordered matrix compression on a deep level, dividing the block into sub-blocks to achieve the best compression ratio. We observe that the ordered matrix compression ratio could be improved by adopting variable-shape regions, considering both horizontal- and vertical-shaped regions. In our empirical evaluation, the proposed approach achieved a 93.8% compression ratio on average, compared with existing SOTA graph compression techniques.
Funder
Institute for Information & communications Technology Promotion(IITP) grant funded by the Korea government
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference39 articles.
1. Graph Summarization Methods and Applications: A Survey;Liu;ACM Comput. Surv.,2018
2. itri: Index-based triangle listing in massive graphs;Rasel;Inf. Sci.,2016
3. Dhulipala, L., Kabiljo, I., Karrer, B., Ottaviano, G., Pupyrev, S., and Shalita, A. (2016, January 13–17). Compressing graphs and indexes with recursive graph bisection. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA.
4. Alam, A., Umair, M., Dolgorsuren, B., Akhond, M.R., Ali, M.A., Qudus, U., and Lee, Y.K. (2018). Distributed In-Memory Granularity-Based Time-Series Graph Compression, Korean Society of Information Science and Technology. Korean Society of Information Science and Technology Academic Papers.
5. StarZIP: Streaming graph compression technique for data archiving;Dolgorsuren;IEEE Access,2019