Affiliation:
1. Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
Abstract
YouTube is a popular video-sharing platform that offers a diverse range of content. Assessing the quality of a video without watching it poses a significant challenge, especially considering the recent removal of the dislike count feature on YouTube. Although comments have the potential to provide insights into video content quality, navigating through the comments section can be time-consuming and overwhelming work for both content creators and viewers. This paper proposes an NLP-based model to classify Arabic comments as positive or negative. It was trained on a novel dataset of 4212 labeled comments, with a Kappa score of 0.818. The model uses six classifiers: SVM, Naïve Bayes, Logistic Regression, KNN, Decision Tree, and Random Forest. It achieved 94.62% accuracy and an MCC score of 91.46% with NB. Precision, Recall, and F1-measure for NB were 94.64%, 94.64%, and 94.62%, respectively. The Decision Tree had a suboptimal performance with 84.10% accuracy and an MCC score of 69.64% without TF-IDF. This study provides valuable insights for content creators to improve their content and audience engagement by analyzing viewers’ sentiments toward the videos. Furthermore, it bridges a literature gap by offering a comprehensive approach to Arabic sentiment analysis, which is currently limited in the field.
Subject
Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems
Reference35 articles.
1. Tiwari, S., Trivedi, M.C., Kolhe, M.L., Mishra, K.K., and Singh, B.K. (2022). Advances in Data and Information Sciences, Proceedings of ICDIS 2021, Springer Nature Singapore Pte Ltd.
2. AlOtaibi, S., and Khan, M.B. (2017). Sentiment analysis challenges of informal Arabic language. Int. J. Adv. Comput. Sci. Appl., 8.
3. Sentiment Analysis of English Text with Multilevel Features;Rao;Sci. Program.,2022
4. Implementation Naïve Bayes Classification for Sentiment Analysis on Internet Movie Database;Samsir;Build. Inform. Technol. Sci.,2022
5. Prediction of the academic performance of slow learners using efficient machine learning algorithm;Geetha;Adv. Comput. Intell.,2021
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献