Affiliation:
1. University of Notre Dame, IN, USA
2. Tsinghua University, Beijing, P. R. China
3. Zhejiang University, Hangzhou, P. R. China
4. Northwestern University, IL, USA
Abstract
Users with demographic profiles in social networks offer the potential to understand the social principles that underpin our highly connected world, from individuals, to groups, to societies. In this article, we harness the power of network and data sciences to model the interplay between user demographics and social behavior and further study to what extent users’ demographic profiles can be inferred from their mobile communication patterns. By modeling over 7 million users and 1 billion mobile communication records, we find that during the active dating period (i.e., 18--35 years old), users are active in broadening social connections with males and females alike, while after reaching 35 years of age people tend to keep small, closed, and same-gender social circles. Further, we formalize the demographic prediction problem of inferring users’ gender and age simultaneously. We propose a factor graph-based
WhoAmI
method to address the problem by leveraging not only the correlations between network features and users’ gender/age, but also the interrelations between gender and age. In addition, we identify a new problem—coupled network demographic prediction across multiple mobile operators—and present a coupled variant of the
WhoAmI
method to address its unique challenges. Our extensive experiments demonstrate the effectiveness, scalability, and applicability of the
WhoAmI
methods. Finally, our study finds a greater than 80% potential predictability for inferring users’ gender from phone call behavior and 73% for users’ age from text messaging interactions.
Funder
Army Research Laboratory
National Basic Research Program of China
National Science Foundation
National High-tech R8D Program
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Science Applications,General Business, Management and Accounting,Information Systems
Reference59 articles.
1. Talayeh Aledavood Eduardo López Sam G. B. Roberts Felix Reed-Tsochas Esteban Moro Egido Robin I. M. Dunbar and Jari Saramäki. 2015. Channel-specific daily patterns in mobile phone communication. CoRR abs/1507.04596 (2015). Talayeh Aledavood Eduardo López Sam G. B. Roberts Felix Reed-Tsochas Esteban Moro Egido Robin I. M. Dunbar and Jari Saramäki. 2015. Channel-specific daily patterns in mobile phone communication. CoRR abs/1507.04596 (2015).
2. Bin Bi Milad Shokouhi Michal Kosinski and Thore Graepel. 2013. Inferring the demographics of search users: Social data meets search queries. In WWW’13. 131--140. 10.1145/2488388.2488401 Bin Bi Milad Shokouhi Michal Kosinski and Thore Graepel. 2013. Inferring the demographics of search users: Social data meets search queries. In WWW’13. 131--140. 10.1145/2488388.2488401
3. Vincent D. Blondel Adeline Decuyper and Gautier Krings. 2015. A survey of results on mobile phone datasets analysis. arXiv:1502.03406 (2015). Vincent D. Blondel Adeline Decuyper and Gautier Krings. 2015. A survey of results on mobile phone datasets analysis. arXiv:1502.03406 (2015).
4. F. Calabrese L. Ferrari and V. Blondel. 2014. Urban sensing using mobile phones network data: A survey of research. ACM Comput. Surv. (2014). 10.1145/2655691 F. Calabrese L. Ferrari and V. Blondel. 2014. Urban sensing using mobile phones network data: A survey of research. ACM Comput. Surv. (2014). 10.1145/2655691
5. Deepayan Chakrabarti Stanislav Funiak Jonathan Chang and Sofus A. Macskassy. 2014. Joint inference of multiple label types in large networks. In ICML’14. 874--882. Deepayan Chakrabarti Stanislav Funiak Jonathan Chang and Sofus A. Macskassy. 2014. Joint inference of multiple label types in large networks. In ICML’14. 874--882.
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献