Abstract
AbstractGoogle search is ubiquitous, and Google Trends (GT) is a potentially useful access point for big data on many topics the world over. We propose a new ‘variance-in-time’ method for forecasting events using GT. By collecting multiple and overlapping samples of GT data over time, our algorithm leverages variation both in the mean and the variance of a search term in order to accommodate some idiosyncracies in the GT platform. To elucidate our approach, we use it to forecast protests in the United States. We use data from the Crowd Counting Consortium between 2017 and 2019 to build a sample of true protest events as well as a synthetic control group where no protests occurred. The model’s out-of-sample forecasts predict protests with higher accuracy than extant work using structural predictors, high frequency event data, or other sources of big data such as Twitter. Our results provide new insights into work specifically on political protests, while providing a general approach to GT that should be useful to researchers of many important, if rare, phenomena.
Publisher
Cambridge University Press (CUP)
Subject
Political Science and International Relations,Sociology and Political Science
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献