Abstract
PurposeThis paper aims to estimate the population in a specific space from the numbers of posted tweets and their senders, using Twitter's real-time property and location information data.Design/methodology/approachThe population to be estimated was set to be the attendance at each game among the six baseball teams of the Japan Professional Baseball Pacific League held at the main stadium of each team. The relation between the attendance and Twitter data was analyzed, and regression models using Twitter data were used to estimate the attendances.FindingsThe correlation coefficient tended to be larger for the attendance and tweeting users than for the attendance and that of the number of tweets. Furthermore, the comparison and evaluation of several regression models combining Twitter data, game data and weather data for estimating the attendance showed the usefulness of Twitter data, and that using the number of tweeting users improved the accuracy of population estimation.Originality/valueWhile there are many studies on event detection or location identification using Twitter data, no study has been reported on the estimation of the population in a specific space using “time information” and “location information” characteristic of Twitter data. Using Twitter data, which contains users' messages, for estimating the population can be extended to various types of analyses, such as the analysis of feelings and opinions of the groups in the space.
Subject
Library and Information Sciences,Information Systems
Reference9 articles.
1. Twitter catches the flu: detecting influenza epidemics using Twitter,2011
2. Twitter mood predicts the stock market;Journal of Computational Science,2011
3. A study of effectively finding tweets with location information,2011
4. Behavioral analysis of social media users based on DNS queries,2014
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献