Taxonomy and Evaluation for Microblog Popularity Prediction

Author:

Gao Xiaofeng1ORCID,Cao Zhenhao2,Li Sha3,Yao Bin4,Chen Guihai4,Tang Shaojie5

Affiliation:

1. Shanghai Jiao Tong University, Shanghai, China

2. Shanghai Jiao Tong University, Shanghai

3. University of Illinois at Urbana-Champaign, Illinois, USA

4. Shanghai Jiao Tong University, Shanghai, P.R. China

5. University of Texas at Dallas, TX, U.S

Abstract

As social networks become a major source of information, predicting the outcome of information diffusion has appeared intriguing to both researchers and practitioners. By organizing and categorizing the joint efforts of numerous studies on popularity prediction, this article presents a hierarchical taxonomy and helps to establish a systematic overview of popularity prediction methods for microblog. Specifically, we uncover three lines of thoughts: the feature-based approach, time-series modelling, and the collaborative filtering approach and analyse them, respectively. Furthermore, we also categorize prediction methods based on their underlying rationale: whether they attempt to model the motivation of users or monitor the early responses. Finally, we put these prediction methods to test by performing experiments on real-life data collected from popular social networks Twitter and Weibo. We compare the methods in terms of accuracy, efficiency, timeliness, robustness, and bias. As far as we are concerned, there is no precedented survey aimed at microblog popularity prediction at the time of submission. By establishing a taxonomy and evaluation for the first time, we hope to provide an in-depth review of state-of-the-art prediction methods and point out directions for further research. Our evaluations show that time-series modelling has the advantage of high accuracy and the ability to improve over time. The feature-based methods using only temporal features performs nearly as well as using all possible features, producing average results. This suggests that temporal features do have strong predictive power and that power is better exploited with time-series models. On the other hand, this implies that we know little about the future popularity of an item before it is posted, which may be the focus of further research.

Funder

the National Natural Science Foundation of China

the National Key R&D Program of China

the Shanghai Science and Technology Fund

Huawei Innovation Research Program

the State Key Laboratory of Air Traffic Management System and Technology

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference104 articles.

1. Mohamed Ahmed Stella Spagna Felipe Huici and Saverio Niccolini. 2013. A peek into the future: Predicting the evolution of popularity in user generated content. In WSDM. 607--616. 10.1145/2433396.2433473 Mohamed Ahmed Stella Spagna Felipe Huici and Saverio Niccolini. 2013. A peek into the future: Predicting the evolution of popularity in user generated content. In WSDM. 607--616. 10.1145/2433396.2433473

2. Sultan Alzahrani Saud Alashri Anvesh Koppela Hasan Davulcu and Ismail Toroslu. 2015. A network-based model for predicting hashtag breakouts in Twitter. In SBP. 3--12. Sultan Alzahrani Saud Alashri Anvesh Koppela Hasan Davulcu and Ismail Toroslu. 2015. A network-based model for predicting hashtag breakouts in Twitter. In SBP. 3--12.

3. Predicting the Future with Social Media

4. Eytan Bakshy Jake Hofman Winter Mason and Duncan Watts. 2011. Everyone’s an influencer: Quantifying influence on Twitter. In WSDM. 65--74. 10.1145/1935826.1935845 Eytan Bakshy Jake Hofman Winter Mason and Duncan Watts. 2011. Everyone’s an influencer: Quantifying influence on Twitter. In WSDM. 65--74. 10.1145/1935826.1935845

5. Peng Bao. 2016. Modeling and predicting popularity dynamics via an influence-based self-excited Hawkes process. In CIKM. 1897--1900. 10.1145/2983323.2983868 Peng Bao. 2016. Modeling and predicting popularity dynamics via an influence-based self-excited Hawkes process. In CIKM. 1897--1900. 10.1145/2983323.2983868

Cited by 41 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3