Abstract
Social networks such as twitter have emerged as social platforms that can impart a massive knowledge base for people to share their unique ideas and perspectives on various topics and issues with friends and families. Sentiment analysis based on machine learning has been successful in discovering the opinion of the people using redundantly available data. However, recent studies have pointed out that imbalanced data can have a negative impact on the results. In this paper, we propose a framework for improved sentiment analysis through various ordered preprocessing steps with the combination of resampling of minority classes to produce greater performance. The performance of the technique can vary depending on the dataset as its initial focus is on feature selection and feature combination. Multiple machine learning algorithms are utilized for the classification of tweets into positive, negative, or neutral. Results have revealed that random minority oversampling can provide improved performance and it can tackle the issue of class imbalance.
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference39 articles.
1. The Evolution of Social Commerce: The People, Management, Technology, and Information Dimensions
2. Language-Independent Bayesian Sentiment Mining of Twitter;Davies;Proceedings of the Fifth International Workshop on Social Network Mining and Analysis (SNAKDD 2011),2011
3. Opinion Mining and Sentiment Analysis
4. Lexicon-Based Methods for Sentiment Analysis
http://direct.mit.edu/coli/article-pdf/37/2/267/1798865/coli_a_00049.pdf
5. A systematic literature review on machine learning applications for consumer sentiment analysis using online reviews
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献