Affiliation:
1. Department of Statistics Universidad Nacional de Colombia Bogotá D. C. Colombia
2. Department of Statistics University of Washington Washington D. C. USA
Abstract
AbstractIn this article, we introduce a novel Bayesian approach for linking multiple social networks in order to discover the same real world person having different accounts across networks. In particular, we develop a latent model that allows us to jointly characterize the network and linkage structures relying on both relational and profile data. In contrast to other existing approaches in the machine learning literature, our Bayesian implementation naturally provides uncertainty quantification via posterior probabilities for the linkage structure itself or any function of it. Our findings clearly suggest that our methodology can produce accurate point estimates of the linkage structure even in the absence of profile information, and also, in an identity resolution setting, our results confirm that including relational data into the matching process improves the linkage accuracy. We illustrate our methodology using real data from popular social networks such as Twitter, Facebook, and YouTube.
Funder
National Science Foundation of Sri Lanka
Subject
Management Science and Operations Research,General Business, Management and Accounting,Modeling and Simulation