ArSentBERT: fine-tuned bidirectional encoder representations from transformers model for Arabic sentiment classification-Reference-Cited by-同舟云学术

ArSentBERT: fine-tuned bidirectional encoder representations from transformers model for Arabic sentiment classification

Published:2023-04-01 Issue:2 Volume:12 Page:1196-1202
ISSN:2302-9285
Container-title:Bulletin of Electrical Engineering and Informatics
language:
Short-container-title:Bulletin EEI

Author:

Abdelfattah Mohamed Fawzy^ORCID,Fakhr Mohamed Waleed^ORCID,Rizka Mohamed Abo^ORCID

Abstract

Sentiment analysis in the Arabic language is challenging because of its linguistic complexity. Arabic is complex in words, paragraphs, and sentence structure. Moreover, most Arabic documents contain multiple dialects, writing alphabets, and styles (e.g., Franco-Arab). Nevertheless, fine-tuned bidirectional encoder representations from transformers (BERT) models can provide a reasonable prediction accuracy for Arabic sentiment classification tasks. This paper presents a fine-tuning approach for BERT models for classifying Arabic sentiments. It uses Arabic BERT pre-trained models and tokenizers and includes three stages. The first stage is text preprocessing and data cleaning. The second stage uses transfer-learning of the pre-trained models’ weights and trains all encoder layers. The third stage uses a fully connected layer and a drop-out layer for classification. We tested our fine-tuned models on five different datasets that contain reviews in Arabic with different dialects and compared the results to 11 state-of-the-art models. The experiment results show that our models provide better prediction accuracy than our competitors. We show that the choice of the pre-trained BERT model and the tokenizer type improves the accuracy of Arabic sentiment classification.

Publisher

Institute of Advanced Engineering and Science

Subject

Electrical and Electronic Engineering,Control and Optimization,Computer Networks and Communications,Hardware and Architecture,Instrumentation,Information Systems,Control and Systems Engineering,Computer Science (miscellaneous)

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving Arabic sentiment analysis across context-aware attention deep model based on natural language processing;Language Resources and Evaluation;2024-04-27

2. Enhancing Deep Learning Models for Sentiment Analysis Integrating Texts and Emojis: A Comprehensive Survey;2024 10th International Conference on Communication and Signal Processing (ICCSP);2024-04-12

3. Hybrid Approach for Multi-Classification of News Documents Using Artificial Intelligence;2024 5th International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV);2024-03-11

4. Multilingual, monolingual and mono-dialectal transfer learning for Moroccan Arabic sentiment classification;Social Network Analysis and Mining;2023-12-06

5. Enhancing Arabic Aspect-Based Sentiment Analysis Using End-to-End Model;IEEE Access;2023