Linking Multiple User Identities of Multiple Services from Massive Mobility Traces
-
Published:2021-08-31
Issue:4
Volume:12
Page:1-28
-
ISSN:2157-6904
-
Container-title:ACM Transactions on Intelligent Systems and Technology
-
language:en
-
Short-container-title:ACM Trans. Intell. Syst. Technol.
Author:
Wang Huandong1,
Li Yong1,
Wang Gang2,
Jin Depeng1
Affiliation:
1. Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, China
2. University of Illinois at Urbana-Champaign (UIUC), USA
Abstract
Understanding the linkability of online user identifiers (IDs) is critical to both service providers (for business intelligence) and individual users (for assessing privacy risks). Existing methods are designed to match IDs across
two services
but face key challenges of matching multiple services in practice, particularly when users have multiple IDs per service. In this article, we propose a novel system to link IDs across multiple services by exploring the spatial-temporal features of user activities, of which the core idea is that the same user's online IDs are more likely to repeatedly appear at the same location. Specifically, we first utilize a
contact graph
to capture the “co-location” of all IDs across multiple services. Based on this graph, we propose a set-wise matching algorithm to discover candidate ID sets and use Bayesian inference to generate confidence scores for candidate ranking, which is proved to be optimal. We evaluate our system using two real-world ground-truth datasets from an Internet service provider (4 services, 815K IDs) and Twitter-Foursquare (2 services, 770 IDs). Extensive results show that our system significantly outperforms the state-of-the-art algorithms in accuracy (AUC is higher by 0.1–0.2), and it is highly robust against data quality, matching order, and number of services.
Funder
National Key Research and Development Program of China
National Nature Science Foundation of China
Beijing Natural Science Foundation
Beijing National Research Center for Information Science and Technology
Tsinghua University - Tencent Joint Laboratory for Internet Innovation Technology
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Theoretical Computer Science
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献