Author:
Chen Jinghui,Mizuno Takayuki,Doi Shohei
Abstract
Traditional monolingual word embedding models transform words into high-dimensional vectors which represent semantics relations between words as relationships between vectors in the high-dimensional space. They serve as productive tools to interpret multifarious aspects of the social world in social science research. Building on the previous research which interprets multifaceted meanings of words by projecting them onto word-level dimensions defined by differences between antonyms, we extend the architecture of establishing word-level cultural dimensions to the sentence level and adopt a Language-agnostic BERT model (LaBSE) to detect position similarities in a multi-language environment. We assess the efficacy of our sentence-level methodology using Twitter data from US politicians, comparing it to the traditional word-level embedding model. We also adopt Latent Dirichlet Allocation (LDA) to investigate detailed topics in these tweets and interpret politicians' positions from different angles. In addition, we adopt Twitter data from Spanish politicians and visualize their positions in a multi-language space to analyze position similarities across countries. The results show that our sentence-level methodology outperform traditional word-level model. We also demonstrate that our methodology is effective dealing with fine-sorted themes from the result that political positions towards different topics vary even within the same politicians. Through verification using American and Spanish political datasets, we find that the positioning of American and Spanish politicians on our defined liberal-conservative axis aligns with social common sense, political news, and previous research. Our architecture improves the standard word-level methodology and can be considered as a useful architecture for sentence-level applications in the future.
Reference39 articles.
1. Debate reaction ideal points: political ideology measurement using real-time reaction data;Argyle;Stat. Politics Policy,2021
2. BaskaM.
Top Republican Mitch McConnell Steps Down – Here's Why That's Good News for LGBTQ+ People2024
3. Efficacy of BERT embeddings on predicting disaster from twitter data;Chanda;arXiv [Preprint]. arXiv,2021
4. ChengI.
Arkansas Senator Wants to Ban Cell Phones in Schools.2023
5. CioffiC.
The Equality Act has Languished in McConnell's Senate but Sponsor Says it's Still Historic.2020