Affiliation:
1. Faculty of Information Science and Technology, Multimedia University, Melaka 75450, Malaysia
Abstract
This paper proposes a novel hybrid model for sentiment analysis. The model leverages the strengths of both the Transformer model, represented by the Robustly Optimized BERT Pretraining Approach (RoBERTa), and the Recurrent Neural Network, represented by Gated Recurrent Units (GRU). The RoBERTa model provides the capability to project the texts into a discriminative embedding space through its attention mechanism, while the GRU model captures the long-range dependencies of the embedding and addresses the vanishing gradients problem. To overcome the challenge of imbalanced datasets in sentiment analysis, this paper also proposes the use of data augmentation with word embeddings by over-sampling the minority classes. This enhances the representation capacity of the model, making it more robust and accurate in handling the sentiment classification task. The proposed RoBERTa-GRU model was evaluated on three widely used sentiment analysis datasets: IMDb, Sentiment140, and Twitter US Airline Sentiment. The results show that the model achieved an accuracy of 94.63% on IMDb, 89.59% on Sentiment140, and 91.52% on Twitter US Airline Sentiment. These results demonstrate the effectiveness of the proposed RoBERTa-GRU hybrid model in sentiment analysis.
Funder
Fundamental Research Grant Scheme of the Ministry of Higher Education
Multimedia University Internal Research Grant
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference26 articles.
1. A comprehensive survey on sentiment analysis: Approaches, challenges and trends;Birjali;Knowl.-Based Syst.,2021
2. Bahdanau, D., Cho, K., and Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv.
3. Advanced classification method of twitter data using sentiment analysis for airline service;Hemakala;Int. J. Comput. Sci. Eng.,2018
4. Makhmudah, U., Bukhori, S., Putra, J.A., and Yudha, B.A.B. (2019, January 16–17). Sentiment Analysis Of Indonesian Homosexual Tweets Using Support Vector Machine Method. Proceedings of the 2019 International Conference on Computer Science, Information Technology, and Electrical Engineering (ICOMITEE), Jember, Indonesia.
5. AlSalman, H. (2020, January 19–21). An improved approach for sentiment analysis of arabic tweets in twitter social media. Proceedings of the 2020 3rd International Conference on Computer Applications & Information Security (ICCAIS), Riyadh, Saudi Arabia.
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献