Affiliation:
1. University of Edinburgh University and Beihang University
2. Southwest Jiaotong University
3. University of Edinburgh and Washington State University
4. Tsinghua University
Abstract
This paper studies fundamental problems for distributed graph simulation. Given a pattern query
Q
and a graph
G
that is fragmented and distributed, a graph simulation algorithm
A
is to compute the matches
Q
(
G
) of
Q
in
G
. We say that
A
is
parallel scalable
in (a)
response time
if its parallel computational cost is determined by the largest fragment
F
m
of
G
and the size |
Q
| of query
Q
, and (b)
data shipment
if its total amount of data shipped is determined by |
Q
| and the number of fragments of
G, independent
of the size of graph
G
. (1) We prove an
impossibility theorem
: there exists
no
distributed graph simulation algorithm that is parallel scalable in
either
response time
or
data shipment. (2) However, we show that distributed graph simulation is
partition bounded, i.e.
, its response time depends only on |
Q
|, |
F
m
| and the number |
V
f
| of nodes in
G
with edges across different fragments; and its data shipment depends on |
Q
| and the number |
E
f
| of crossing edges only. We provide the first algorithms with these performance guarantees. (3) We also identify special cases of patterns and graphs when parallel scalability is possible. (4) We experimentally verify the scalability and efficiency of our algorithms.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
36 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献