Executing Dynamic Data-Graph Computations Deterministically Using Chromatic Scheduling

Author:

Kaler Tim1,Hasenplaugh William1,Schardl Tao B.1,Leiserson Charles E.1

Affiliation:

1. MIT CSAIL, Cambridge, MA

Abstract

A data-graph computation—popularized by such programming systems as Galois, Pregel, GraphLab, PowerGraph, and GraphChi—is an algorithm that performs local updates on the vertices of a graph. During each round of a data-graph computation, an update function atomically modifies the data associated with a vertex as a function of the vertex’s prior data and that of adjacent vertices. A dynamic data-graph computation updates only an active subset of the vertices during a round, and those updates determine the set of active vertices for the next round. This article introduces P rism , a chromatic-scheduling algorithm for executing dynamic data-graph computations. P rism uses a vertex coloring of the graph to coordinate updates performed in a round, precluding the need for mutual-exclusion locks or other nondeterministic data synchronization. A multibag data structure is used by P rism to maintain a dynamic set of active vertices as an unordered set partitioned by color. We analyze P rism using work-span analysis. Let G = ( V , E ) be a degree-Δ graph colored with χ colors, and suppose that QV is the set of active vertices in a round. Define size ( Q )= | Q | + ∑ vQ deg( v ), which is proportional to the space required to store the vertices of Q using a sparse-graph layout. We show that a P -processor execution of P rism performs updates in Q using O (χ (lg ( Q/χ ) + lg Δ ) + lg P span and Θ( size ( Q ) + P ) work. These theoretical guarantees are matched by good empirical performance. To isolate the effect of the scheduling algorithm on performance, we modified GraphLab to incorporate P rism and studied seven application benchmarks on a 12-core multicore machine. P rism executes the benchmarks 1.2 to 2.1 times faster than GraphLab’s nondeterministic lock-based scheduler while providing deterministic behavior. This article also presents P rism -R, a variation of P rism that executes dynamic data-graph computations deterministically even when updates modify global variables with associative operations. P rism -R satisfies the same theoretical bounds as P rism , but its implementation is more involved, incorporating a multivector data structure to maintain a deterministically ordered set of vertices partitioned by color. Despite its additional complexity, P rism -R is only marginally slower than P rism . On the seven application benchmarks studied, P rism -R incurs a 7% geometric mean overhead relative to P rism .

Funder

NSF Graduate Research Fellowship

National Science Foundation

Intel Corporation and Foxconn Technology Group

Publisher

Association for Computing Machinery (ACM)

Subject

Computational Theory and Mathematics,Computer Science Applications,Hardware and Architecture,Modelling and Simulation,Software

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. High-performance and balanced parallel graph coloring on multicore platforms;The Journal of Supercomputing;2022-11-07

2. BiPart;Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming;2021-02-17

3. Graph Coloring on the GPU;2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2019-05

4. Brief Announcement;Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures;2018-07-11

5. A Multicore Path to Connectomics-on-Demand;Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming;2017-01-26

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3