Affiliation:
1. University of Oklahoma, Norman, OK, USA
2. Loyola University of Chicago, Chicago, IL, USA
3. California State University Dominguez Hills, Carson, CA, USA
Abstract
Time-evolving web and social network graphs are modeled as a set of pages/individuals (nodes) and their arcs (links/relationships) that change over time. Due to their popularity, they have become increasingly massive in terms of their number of nodes, arcs, and lifetimes. However, these graphs are extremely sparse throughout their lifetimes. For example, it is estimated that Facebook has over a billion vertices, yet at any point in time, it has far less than 0.001% of all possible relationships. The space required to store these large sparse graphs may not fit in most main memories using underlying representations such as a series of adjacency matrices or adjacency lists.
We propose building a compressed data structure that has a compressed binary tree corresponding to each row of each adjacency matrix of the time-evolving graph. We do not explicitly construct the adjacency matrix, and our algorithms take the time-evolving arc list representation as input for its construction. Our compressed structure allows for directed and undirected graphs, faster arc and neighborhood queries, as well as the ability for arcs and frames to be added and removed directly from the compressed structure (streaming operations). We use publicly available network data sets such as Flickr, Yahoo!, and Wikipedia in our experiments and show that our new technique performs as well or better than our benchmarks on all datasets in terms of compression size and other vital metrics.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications
Reference22 articles.
1. Jérôme Kunegis. KONECT – The Koblenz Network Collection. 2020. Point contact Wikipedia articles edited. http://konect.uni-koblenz.de/.
2. The Max Planck Institute for Software Systems. 2020. User-to-user link crawled on Flickr Social Network. http://socialnetworks.mpi-sws.org/data-www2009.html.
3. Yahoo! Research. 2020. Yahoo! Network Flows Data version 1.0. http://webscope.sandbox.yahoo.com/catalog.php?datatype=g.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献