Affiliation:
1. Boston University, Boston MA, USA
2. Max Planck Institut für Informatik, Germany
Abstract
Consider a social network and suppose that we are only given the number of common friends between each pair of users. Can we reconstruct the underlying network? Similarly, consider a set of documents and the words that appear in them. If we only know the number of common words for every pair of documents, as well as the number of common documents for every pair of words, can we infer which words appear in which documents? In this article, we develop a general methodology for answering questions like these.
We formalize these questions in what we call the R
econstruct
problem: given information about the common neighbors of nodes in a network, our goal is to reconstruct the hidden binary matrix that indicates the presence or absence of relationships between individual nodes. In fact, we propose two different variants of this problem: one where the number of connections of every node (i.e., the degree of every node) is known and a second one where it is unknown. We call these variants the
degree-aware
and the
degree-oblivious
versions of the R
econstruct
problem, respectively.
Our algorithms for both variants exploit the properties of the singular value decomposition of the hidden binary matrix. More specifically, we show that using the available neighborhood information, we can reconstruct the hidden matrix by finding the components of its singular value decomposition and then combining them appropriately. Our extensive experimental study suggests that our methods are able to reconstruct binary matrices of different characteristics with up to 100% accuracy.
Funder
Division of Information and Intelligent Systems
Division of Computer and Network Systems
Publisher
Association for Computing Machinery (ACM)
Reference30 articles.
1. Ricardo Baeza-Yates and Berthier Ribeiro-Neto. 2011. Modern Information Retrieval. ACM Press/Addison Wesley. 10.1016/S0022-0000(03)00025-4 Ricardo Baeza-Yates and Berthier Ribeiro-Neto. 2011. Modern Information Retrieval. ACM Press/Addison Wesley. 10.1016/S0022-0000(03)00025-4
2. Inferring invisible traffic
3. Supervised reconstruction of biological networks with local models
4. Supervised prediction of drug–target interactions using bipartite local models
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献