Affiliation:
1. MPI for Software Systems, Kaiserslautern, Germany
2. Georgia Institute of Technology, GA, USA
3. MPI for Intelligent Systems, Tübingen, Germany
Abstract
If a piece of information is released from a set of media sites, can it spread, in 1 month, to a million web pages? Can we efficiently find a small set of media sites among millions that can maximize the spread of the information, in 1 month? The two problems are called influence estimation and maximization problems respectively, which are very challenging since both the time-sensitive nature of the problems and the issue of scalability need to be addressed simultaneously. In this article, we propose two algorithms for influence estimation in continuous-time diffusion networks. The first one uses continuous-time Markov chains to estimate influence
exactly
on networks with exponential, or, more generally, phase-type transmission functions, but does not scale to large-scale networks, and the second one is a highly efficient randomized algorithm, which estimates the influence of every node in a network with general transmission functions, |ν| nodes and |ε| edges to an accuracy of ϵ using
n
=
O
(1/ϵ
2
) randomizations and up to logarithmic factors
O
(
n
|ε|+
n
|ν| computations. We then show that finding the set of most influential source nodes in a continuous time diffusion network is an NP-hard problem and develop an efficient greedy algorithm with provable near-optimal performance. When used as subroutines in the influence maximization algorithm, the exact influence estimation algorithm is guaranteed to find a set of
C
nodes with an influence of at least (1 − 1/
e
)OPT and the randomized algorithm is guaranteed to find a set with an influence of at least 1 − 1/
e
)OPT − 2
C
ε, where OPT is the optimal value. Experiments on both synthetic and real-world data show that the proposed algorithms significantly improve over previous state-of-the-art methods in terms of the accuracy of the estimated influence and the quality of the selected nodes to maximize the influence, and the randomized algorithm can easily scale up to networks of millions of nodes.
Funder
International Conference on Machine Learning
Neural Information Processing Systems
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Science Applications,General Business, Management and Accounting,Information Systems
Reference54 articles.
1. Fitting phase-type distributions via the EM algorithm;Asmussen S.;Scandinavian Journal of Statistics,1996
2. Topic-aware social influence propagation models
3. S. Bharathi D. Kempe and M. Salek. 2007. Competitive influence maximization in social networks. Internet and Network Economics (2007) 306--311. S. Bharathi D. Kempe and M. Salek. 2007. Competitive influence maximization in social networks. Internet and Network Economics (2007) 306--311.
4. Maximizing influence in a competitive social network
Cited by
64 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献