Affiliation:
1. Biomedical Informatics and Digital Health, School of Medical Sciences, Faculty of Medicine and Health, The University of Sydney, Australia
Abstract
Franzén (2023 : 1) has warned of ‘big problems’ when researchers attempt to use Google Trends (GT) data. The evidence she provides is examined, additional evidence is obtained and analysed, and a new set of conclusions are derived. The anomalies previously encountered are due to a combination of factors, but can be explained by noting that Google samples its data to provide GT results, these data are also scaled, which can exacerbate variation between samples, and Denmark is a small country and Jakob Scharf a low-probability search term, both of which would increase variation in search probabilities provided by GT. When multiple samples are obtained and aggregated (medians are best for low-probability search terms), this variation is controlled, and a stable time series is derived. Researchers should not see GT as an easy source of data, but should do the work required to understand the data, and should use it, and interpret their results, within the limitations inherent in these data. It is important to aggregate multiple samples (preferably with the median for each time point) in order to obtain more stable estimates from GT.
Subject
Sociology and Political Science
Reference10 articles.
1. Anonymous (2019) Google Trends – exact same keywords and timeline – inconsistent trends provided. Available at: https://support.google.com/google-ads/thread/8389370?msgid=26184434 (accessed 23 February 2023).
2. Experimental Evaluation of Scale, and Patterns of Systematic Inconsistencies in Google Trends Data
3. Predicting the Present with Google Trends