Tweets Classification on the Base of Sentiments for US Airline Companies-Reference-Cited by-同舟云学术

Tweets Classification on the Base of Sentiments for US Airline Companies

Published:2019-11-04 Issue:11 Volume:21 Page:1078
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Rustam Furqan,Ashraf Imran^ORCID,Mehmood Arif,Ullah Saleem^ORCID,Choi Gyu

Abstract

The use of data from social networks such as Twitter has been increased during the last few years to improve political campaigns, quality of products and services, sentiment analysis, etc. Tweets classification based on user sentiments is a collaborative and important task for many organizations. This paper proposes a voting classifier (VC) to help sentiment analysis for such organizations. The VC is based on logistic regression (LR) and stochastic gradient descent classifier (SGDC) and uses a soft voting mechanism to make the final prediction. Tweets were classified into positive, negative and neutral classes based on the sentiments they contain. In addition, a variety of machine learning classifiers were evaluated using accuracy, precision, recall and F1 score as the performance metrics. The impact of feature extraction techniques, including term frequency (TF), term frequency-inverse document frequency (TF-IDF), and word2vec, on classification accuracy was investigated as well. Moreover, the performance of a deep long short-term memory (LSTM) network was analyzed on the selected dataset. The results show that the proposed VC performs better than that of other classifiers. The VC is able to achieve an accuracy of 0.789, and 0.791 with TF and TF-IDF feature extraction, respectively. The results demonstrate that ensemble classifiers achieve higher accuracy than non-ensemble classifiers. Experiments further proved that the performance of machine learning classifiers is better when TF-IDF is used as the feature extraction method. Word2vec feature extraction performs worse than TF and TF-IDF feature extraction. The LSTM achieves a lower accuracy than machine learning classifiers.

Funder

National Research Foundation of Korea

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/21/11/1078/pdf

Reference52 articles.

1. 2.5 Quintillion Bytes of Data Created Every Day. How Does CPG & Retail Manage It;Jacobson,2013

2. Introduction for the Special Issue on Beyond the Hypes of Geospatial Big Data: Theories, Methods, Analytics, and Applications

3. Opinion Mining and Sentiment Analysis

4. Election 2006 online;Rainie,2007

Cited by 126 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ArSa-Tweets: A novel Arabic sarcasm detection system based on deep learning model;Heliyon;2024-09

2. Ensemble Learning with Pre-Trained Transformers for Crash Severity Classification: A Deep NLP Approach;Algorithms;2024-06-30

3. Fuzzy-CNN: Improving personal human identification based on IRIS recognition using LBP features;Journal of Information Security and Applications;2024-06

4. Scientific text citation analysis using CNN features and ensemble learning model;PLOS ONE;2024-05-28

5. ASIF: attention-based sentiment inquiry framework for profound product recommendations;Multimedia Tools and Applications;2024-05-11