TEA+ : A Novel Temporal Graph Random Walk Engine with Hybrid Storage Architecture

Author:

Huan Chengying1ORCID,Liu Yongchao2ORCID,Zhang Heng3ORCID,Song Shuaiwen4ORCID,Pandey Santosh5ORCID,Chen Shiyang5ORCID,Fang Xiangfei3ORCID,Jin Yue2ORCID,Lepers Baptiste6ORCID,Wu Yanjun7ORCID,Liu Hang5ORCID

Affiliation:

1. Institute of Software Chinese Academy of Sciences, Beijing, China and Rutgers University, New Brunswick, USA, and Tsinghua University, Beijing, China, and Tsinghua University, Beijing China

2. Ant Group, Hangzhou, China

3. Institute of Software Chinese Academy of Sciences, Beijing China

4. Sydney University, Sydney Australia

5. Rutgers University, New Brunswick, United States

6. Université de Neuchâtel, Neuchatel, Switzerland

7. Institute of Software, Chinese Academy of Sciences, Beijing China

Abstract

Many real-world networks are characterized by being temporal and dynamic, wherein the temporal information signifies the changes in connections, such as the addition or removal of links between nodes. Employing random walks on these temporal networks is a crucial technique for understanding the structural evolution of such graphs over time. However, existing state-of-the-art sampling methods are designed for traditional static graphs, and as such, they struggle to efficiently handle the dynamic aspects of temporal networks. This deficiency can be attributed to several challenges, including increased sampling complexity, extensive index space, limited programmability, and a lack of scalability. In this article, we introduce TEA+ , a robust, fast, and scalable engine for conducting random walks on temporal graphs. Central to TEA+ is an innovative hybrid sampling method that amalgamates two Monte Carlo sampling techniques. This fusion significantly diminishes space complexity while maintaining a fast sampling speed. Additionally, TEA+ integrates a range of optimizations that significantly enhance sampling efficiency. This is further supported by an effective graph updating strategy, skilled in managing dynamic graph modifications and adeptly handling the insertion and deletion of both edges and vertices. For ease of implementation, we propose a temporal-centric programming model, designed to simplify the development of various random walk algorithms on temporal graphs. To ensure optimal performance across storage constraints, TEA+ features a degree-aware hybrid storage architecture, capable of adeptly scaling in different memory environments. Experimental results showcase the prowess of TEA+ , as it attains up to three orders of magnitude speedups compared to current random walk engines on extensive temporal graphs.

Funder

Natural Science Foundation of China

Youth Innovation Promotion Association CAS

National Science Foundation CRII Award

NSF

CAREER Award

SOAR fellowship, University of Sydney Faculty Startup funding, Australia Research Council (ARC) Discovery Project

Ant Group through Ant Research Intern Program

Publisher

Association for Computing Machinery (ACM)

Reference51 articles.

1. Yahoo Webscope. n.d. Home Page. Retrieved March 19 2024 from http://webscope.sandbox.yahoo.com

2. Web Data Commons. n.d. The 2012 Common Crawl Graph. Retrieved March 19 2024 from http://webdatacommons.org

3. CommonGraph: Graph Analytics on Evolving Data

4. Scale-free networks are rare

5. R-MAT: A Recursive Model for Graph Mining

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3