Affiliation:
1. Heidelberg University, Germany
2. Ulm University, Germany
3. University of Stuttgart, Germany
Abstract
Studying political opinions of citizens stands as a fundamental pursuit for both policymakers and researchers. While traditional surveys remain the primary method to investigate individual political opinions, the advent of social media data (SMD) offers novel prospects. However, the number of studies using SMD to extract individuals’ political opinions are limited and differ greatly in their methodological approaches and levels of success. Recent studies highlight the benefits of analyzing individuals’ social media network structure to estimate political opinions. Nevertheless, current methodologies exhibit limitations, including the use of simplistic linear models and a predominant focus on samples from the United States. Addressing these issues, we employ an unsupervised Variational Autoencoder (VAE) machine learning model to extract individual opinion estimates from SMD of N = 276 008 German Twitter (now called ’X’) users, compare its performance to a linear model and validate model estimates on self-reported opinion measures. Our findings suggest that the VAE captures Twitter users’ network structure more precisely, leading to higher accuracy in following decision predictions and associations with self-reported political ideology and voting intentions. Our study emphasizes the need for advanced analytical approaches capable to capture complex relationships in social media networks when studying political opinion, at least in non-US contexts.