Abstract
PurposeThis paper purposed a multi-facet sentiment analysis system.Design/methodology/approachHence, This paper uses multidomain resources to build a sentiment analysis system. The manual lexicon based features that are extracted from the resources are fed into a machine learning classifier to compare their performance afterward. The manual lexicon is replaced with a custom BOW to deal with its time consuming construction. To help the system run faster and make the model interpretable, this will be performed by employing different existing and custom approaches such as term occurrence, information gain, principal component analysis, semantic clustering, and POS tagging filters.FindingsThe proposed system featured by lexicon extraction automation and characteristics size optimization proved its efficiency when applied to multidomain and benchmark datasets by reaching 93.59% accuracy which makes it competitive to the state-of-the-art systems.Originality/valueThe construction of a custom BOW. Optimizing features based on existing and custom feature selection and clustering approaches.
Subject
Computer Science Applications,Information Systems,Software
Reference68 articles.
1. Sentiment analysis and opinion mining;Synth Lectures Hum Lang Tech,2012
2. Conceptual and empirical comparison of dimensionality reduction algorithms (PCA, KPCA, LDA, MDS, SVD, LLE, ISOMAP, LE, ICA, t-SNE);Computer Sci Rev,2021
3. Recent trends in dimension reduction methods;ICIDSSD,2021
4. A comprehensive review of dimensionality reduction techniques for feature selection and feature extraction;J Appl Sci Technology Trends,2020
5. Overview and comparative study of dimensionality reduction techniques for high dimensional data;Inf Fusion,2020
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献