Stance Detection on Short Turkish Text: A Case Study of Russia-Ukraine War-Reference-Cited by-同舟云学术

Stance Detection on Short Turkish Text: A Case Study of Russia-Ukraine War

Published:2024-06-08 Issue:3 Volume:24 Page:602-619
ISSN:2149-3367
Container-title:Afyon Kocatepe University Journal of Sciences and Engineering
language:tr
Short-container-title:

Author:

Fırat Eray¹^ORCID,Arslan Serdar¹^ORCID

Affiliation:

1. ÇANKAYA ÜNİVERSİTESİ

Abstract

In recent years, social media has emerged as a crucial source of information for gauging public sentiment on a variety of topics. As a result, the need for automated data extraction from these platforms has grown. Stance detection, a subtask in natural language processing, plays a pivotal role in this process by automatically determining users' opinions regarding specific subjects, events, or individuals. To address this, we developed a labeled Turkish dataset focused on determining users' stances on the Russia-Ukraine War using social media content. The dataset, comprising 8215 tweets from Twitter, was meticulously cleaned and annotated for two key targets: Russia and Ukraine. We evaluated several machine learning methods, including Support Vector Machines, Random Forest, k-Nearest Neighbor, XGBoost, Long-Short Term Memory (LSTM), and Gated Recurrent Unit (GRU), with word embeddings from GloVe and FastText. Additionally, we incorporated a transformer-based approach for stance detection. Given the dataset's imbalance between targets, we applied undersampling and oversampling techniques alongside these algorithms. Our experiment results indicate that BERT-based models outperformed all other methods, with LSTM and GRU producing similarly strong outcomes. The newly established Turkish corpus stands as a valuable resource in this field, with potential for future use in conjunction with transformer-based approaches. In summary, this study advances the field of stance detection research in the context of Turkish text.

Publisher

Afyon Kocatepe Universitesi Fen Ve Muhendislik Bilimleri Dergisi

Reference30 articles.

1. ALDayel, Abeer, and Walid Magdy. 2021. “Stance Detection on Social Media: State of the Art and Trends.” Information Processing and Management 58(4):102597. https://www.doi.org/10.1016/j.ipm.2021.102597.

2. Allaway, Emily, and Kathleen McKeown. 2020. “Zero-Shot Stance Detection: A Dataset and Model Using Generalized Topic Representations.” EMNLP 2020 - 2020 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference 8913–31. https://www.doi.org/10.18653/v1/2020.emnlp-main.717.

3. Bojanowski, Piotr, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. “Enriching Word Vectors with Subword Information.”

4. Breiman, Leo. 2001. “Random Forests.” Machine Learning 45(1):5–32. https://www.doi.org/10.1023/A:1010933404324.

5. Chawla, N. V, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer. 2002. “{SMOTE}: Synthetic Minority Over-Sampling Technique.” Journal of Artificial Intelligence Research 16:321–57. https://www.doi.org/10.1613/jair.953.