From "think like a vertex" to "think like a graph"-Reference-Cited by-同舟云学术

From "think like a vertex" to "think like a graph"

Published:2013-11 Issue:3 Volume:7 Page:193-204
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Tian Yuanyuan¹,Balmin Andrey²,Corsten Severin Andreas³,Tatikonda Shirish¹,McPherson John¹

Affiliation:

1. IBM Almaden Research Center

2. GraphSQL

3. IBM Deutschland GmbH, Germany

Abstract

To meet the challenge of processing rapidly growing graph and network data created by modern applications, a number of distributed graph processing systems have emerged, such as Pregel and GraphLab. All these systems divide input graphs into partitions, and employ a "think like a vertex" programming model to support iterative graph computation. This vertex-centric model is easy to program and has been proved useful for many graph algorithms. However, this model hides the partitioning information from the users, thus prevents many algorithm-specific optimizations. This often results in longer execution time due to excessive network messages (e.g. in Pregel) or heavy scheduling overhead to ensure data consistency (e.g. in GraphLab). To address this limitation, we propose a new "think like a graph" programming paradigm. Under this graph-centric model, the partition structure is opened up to the users, and can be utilized so that communication within a partition can bypass the heavy message passing or scheduling machinery. We implemented this model in a new system, called Giraph++, based on Apache Giraph, an open source implementation of Pregel. We explore the applicability of the graph-centric model to three categories of graph algorithms, and demonstrate its flexibility and superior performance, especially on well-partitioned data. For example, on a web graph with 118 million vertices and 855 million edges, the graph-centric version of connected component detection algorithm runs 63X faster and uses 204X fewer network messages than its vertex-centric counterpart.

Publisher

VLDB Endowment

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/2732232.2732238

Cited by 243 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey of Distributed Graph Algorithms on Massive Graphs;ACM Computing Surveys;2024-09-05

2. GraphScope Flex: LEGO-like Graph Computing Stack;Companion of the 2024 International Conference on Management of Data;2024-06-09

3. GraFlex: Flexible Graph Processing on FPGAs through Customized Scalable Interconnection Network;Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays;2024-04

4. Ingress: an automated incremental graph processing system;The VLDB Journal;2024-02-20

5. CGgraph: An Ultra-Fast Graph Processing System on Modern Commodity CPU-GPU Co-processor;Proceedings of the VLDB Endowment;2024-02