Urdu Sentiment Analysis-Reference-Cited by-同舟云学术

Urdu Sentiment Analysis

Published:2022-06-01 Issue:1 Volume:27 Page:30-42
ISSN:2255-8691
Container-title:Applied Computer Systems
language:en
Short-container-title:

Author:

Rehman Iffraah¹,Soomro Tariq Rahim¹^ORCID

Affiliation:

1. CCSIS, Institute of Business Management (IoBM) , Karachi , Pakistan

Abstract

Abstract The world is heading towards more modernized and digitalized data and therefore a significant growth is observed in the active number of social media users with each passing day. Each post and comment can give an insight into valuable information about a certain topic or issue, a product or a brand, etc. Similarly, the process to uncover the underlying information from the opinion that a person keeps about any entity is called a sentiment analysis. The analysis can be carried out through two main approaches, i.e., either lexicon-based or machine learning algorithms. A significant amount of work in the different domains has been done in numerous languages for sentiment analysis, but minimal research has been conducted on the national language of Pakistan, which is Urdu. Twitter users who are familiar with Urdu update the tweets in two different textual formats either in Urdu Script (Nastaleeq) or in Roman Urdu. Thus, the paper is an attempt to perform the sentiment analysis on the Urdu language by extracting the tweets (Nastaleeq and Roman Urdu both) from Twitter using Tweepy API. A machine learning-based approach has been adopted for this study and the tool opted for the purpose is WEKA. The best algorithm was identified based on evaluation metrics, which comprise the number of correctly and incorrectly classified instances, accuracy, precision, and recall. SMO was found to be the most suitable machine learning algorithm for performing the sentiment analysis on Urdu (Nastaleeq) tweets, while the Roman Urdu Random Forest algorithm was identified as the best one.

Publisher

Walter de Gruyter GmbH

Link

https://www.sciendo.com/pdf/10.2478/acss-2022-0004

Reference47 articles.

1. [1] J. Serrano-Guerrero, J. A. Olivas, F. P. Romero, and E. Herrera-Viedma, “Sentiment analysis: A review and comparative analysis of web,” Information Sciences, vol. 311, pp. 18–38, Aug. 2015. https://doi.org/10.1016/j.ins.2015.03.040

2. [2] L. Zhang, S. Wang, and B. Liu, “Deep learning for sentiment analysis: A survey,” WIRES data mining and knowledge discovery, vol. 8, no. 4, July 2018. https://doi.org/10.1002/widm.1253

3. [3] M. Giatsogloua, M. G. Vozalis, K. Diamantaras, A. Vakali, G. Sarigiannidis, and K. C. Chatzisavvas, “Sentiment analysis leveraging emotions and word embeddings,” Expert Systems with Applications, vol. 69, pp. 214–224, Mar. 2017. https://doi.org/10.1016/j.eswa.2016.10.043

4. [4] K. K. Mohbey, B. Bakariya, and V. Kalal, “A study and comparison of sentiment analysis techniques using demonetization: Case study,” in Sentiment Analysis and Knowledge Discovery in Contemporary Business, 2018, pp. 1–14. https://doi.org/10.4018/978-1-5225-4999-4.ch001

5. [5] C. S. Khoo and S. B. Johnkhan, “Lexicon-based sentiment analysis: Comparative Evaluation of Six Sentiment Lexicons,” Journal of Information Science, vol. 44, no. 4, pp. 491–511, 19 Apr. 2017. https://doi.org/10.1177/0165551517703514

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A hybrid dependency-based approach for Urdu sentiment analysis;Scientific Reports;2023-12-12

2. Sentiment Analysis Based on Urdu Reviews Using Hybrid Deep Learning Models;Applied Computer Systems;2023-12-01

3. Innovations in Urdu Sentiment Analysis Using Machine and Deep Learning Techniques for Two-Class Classification of Symmetric Datasets;Symmetry;2023-05-05

4. BERT-Based Sentiment Analysis for Low-Resourced Languages: A Case Study of Urdu Language;IEEE Access;2023