Affiliation:
1. JomoKenyatta University of Agriculture and Technology
Abstract
Abstract
Sentiment analysis has become an important area of research in natural language processing. This technique has a wide range of applications such as comprehending user preferences in ecommerce feedback portals, politics, and in governance. However, accurate sentiment analysis requires robust text representation techniques that can convert words into precise vectors that represent the input text. There are two categories of text representation techniques: lexicon-based techniques and machine learning-based techniques. From research, both techniques have limitations. For instance, pre-trained word embeddings such as Word2Vec, Glove and Bidirectional Encoder Representations from Transformers (BERT) generate vectors by considering word distances, similarities and occurrences ignoring other aspects such as word sentiment orientation. Aiming at such limitations, this paper presents a sentiment classification model (named LeBERT) combining Sentiment Lexicon, N-grams, BERT and CNN. In the model, Sentiment Lexicon, N-grams and BERT are used to vectorize words selected from a section of the input text. CNN is used as the deep neural network classifier for feature mapping and giving the output sentiment class. The proposed model is evaluated on Yelp’s three datasets (movie, restaurant and products’ reviews) using accuracy, precision and F-measure as performance metrics. The experimental results indicate that the proposed LeBERT model outperform the existing state-of-the-art models with an F-measure score of 88.73% in binary sentiment classification.
Publisher
Research Square Platform LLC
Reference39 articles.
1. Jiang Z., Gao B., He Y., Han Y., Doyle P., and Zhu Q. (2021). Text Classification Using Novel Term Weighting Scheme-Based Improved TF-IDF for Internet Media Reports. Mathematical Problems in Engineering Volume 2021, https://doi.org/10.1155/2021/6619088.
2. Ensemble of Classifiers and Term Weighting Schemes for Sentiment Analysis in Turkish;Onan A;Scientific Research Communications,2021
3. Kalarani P. & Brunda S. S. (2015). An overview on Research Challenges in Opinion Mining and Sentiment Analysis. International Journal of Innovative Research in Computer and Communication Engineering. Vol. 3, Issue 10, October 2015, ISSN: 2320–9801.
4. Yang J., Xiu P., Sun L., Ying L., Muthu B. (2022). Social media data analytics for business decision making system to competitive analysis, Information Processing & Management, Volume 59, Issue 1, 2022, 102751, ISSN 0306–4573, https://doi.org/10.1016/j.ipm.2021.102751.
5. Li Rao. (2022). Sentiment Analysis of English Text with Multilevel Features, Scientific Programming, vol. 2022, Article ID 7605125, 10 pages, 2022. https://doi.org/10.1155/2022/7605125.