Author:
Pujo Ariesanto Akhmad Ekka,Adi Kusworo,Puji Widodo Aris
Abstract
Customers typically provide both online and physical services they use ratings and reviews. However, the volume of reviews might grow very quickly. The power of machine learning to recognize this kind of data is astounding. Numerous algorithms that could be employed for job of sentiment analysis have been developed to categorize tweets about airline sentiment into positive, neutral, or negative categories, this study compares the effectiveness algorithm for machine learning Naive Bayes (NB), Logistic Regression (LR), Decision Tree (DT), Support Vector Machine (SVM), Adaboost, Extreme Gradient Boosting (XGB), Light Gradient Boosting Machine (LGBM), and Random Forest (RF) dividing the Twitter airline sentiment data into positive, neutral, or negative categories using the TF IDF model. The experiment involved two phases of activity: a classification algorithm utilizing SMOTE and sans SMOTE with Stratified K-Fold CV algorithm. With the RF model, the greatest performance accuracy for SMOTE is 97.56%. Without SMOTE, the RF with a value of 92.21% provides the maximum performance accuracy. The findings demonstrate that SMOTE oversampling can improve sentiment analysis accuracy.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Comprehensive review and comparative analysis of transformer models in sentiment analysis;Knowledge and Information Systems;2024-09-06
2. Exploring Tech-Enabled Solutions;Advances in Hospitality, Tourism, and the Services Industry;2024-06-30