Affiliation:
1. Research Directorate, National Security Agency, Fort Meade, MD 20755
Abstract
Triangle centrality is introduced for finding important vertices in a graph based on the concentration of triangles surrounding each vertex. It has the distinct feature of allowing a vertex to be central if it is in many triangles or none at all.
Given a simple, undirected graph
\(G=(V,E)\)
, with
\(n=|V|\)
vertices and
\(m=|E|\)
edges, let
\(\triangle(v)\)
and
\(\triangle(G)\)
denote the respective triangle counts of
\(v\)
and
\(G\)
. Let
\(N(v)\)
be the neighborhood set of
\(v\)
. Respectively,
\(N_{\triangle}(v)\)
and
\(N_{\triangle}[v]=\{v\}\cup N_{\triangle}(v)\)
denote the set of neighbors that are in triangles with
\(v\)
and the closed set including
\(v\)
. Then the triangle centrality for a vertex
\(v\)
is
\(TC(v) = \frac{\frac{1}{3}\sum_{u\in N_\triangle[v]} \triangle(u) + \sum_{w\in \{N(v)\setminus N_\triangle(v)\}} \triangle(w)}{\triangle(G)}.\)
We show experimentally that triangle centrality is broadly applicable to many different types of networks. Our empirical results demonstrate that 30% of the time triangle centrality identified central vertices that differed with those found by five well-known centrality measures, which suggests novelty without being overly specialized. It is also asymptotically faster to compute on sparse graphs than all but the most trivial of these other measures.
We introduce optimal algorithms that compute triangle centrality in
\(O(m\bar{\delta})\)
time and
\(O(m+n)\)
space, where
\(\bar{\delta}\leq O(\sqrt{m})\)
is the
average degeneracy
introduced by Burkhardt, Faber, and Harris (2020). In practical applications
\(\bar{\delta}\)
is much smaller than
\(\sqrt{m}\)
so triangle centrality can be computed in nearly linear-time. On a Concurrent Read Exclusive Write (CREW) Parallel Random Access Memory (PRAM) machine, we give a near work-optimal parallel algorithm that takes
\(O(\log n)\)
time using
\(O(m\sqrt{m})\)
CREW PRAM processors. In MapReduce, we show it takes four rounds using
\(O(m\sqrt{m})\)
communication bits, and is therefore optimal. We also derive a linear algebraic formulation of triangle centrality which can be computed in
\(O(m\bar{\delta})\)
time on sparse graphs.
Publisher
Association for Computing Machinery (ACM)