Affiliation:
1. King Saud University, Saudi Arabia
Abstract
The fact that people freely express their opinions and ideas in no more than 140 characters makes Twitter one of the most prevalent social networking websites in the world. Being popular in Saudi Arabia, we believe that tweets are a good source to capture the public’s sentiment, especially since the country is in a fractious region. Going over the challenges and the difficulties that the Arabic tweets present – using Saudi Arabia as a basis – we propose our solution. A typical problem is the practice of tweeting in dialectical Arabic. Based on our observation we recommend a hybrid approach that combines semantic orientation and machine learning techniques. Through this approach, the lexical-based classifier will label the training data, a time-consuming task often prepared manually. The output of the lexical classifier will be used as training data for the SVM machine learning classifier. The experiments show that our hybrid approach improved the F-measure of the lexical classifier by 5.76% while the accuracy jumped by 16.41%, achieving an overall F-measure and accuracy of 84 and 84.01% respectively.
Subject
Library and Information Sciences,Information Systems
Reference41 articles.
1. Ahlqvist T, Back A, Halonen M, Heinonen S. Social media roadmaps: Exploring the futures triggered by social media. VTT Tiedotteita Research Notes 2454, Espoo, Finland, 2008.
2. Users of the world, unite! The challenges and opportunities of Social Media
3. Sentiment analysis in Twitter
4. Opinion Mining and Sentiment Analysis
5. Sentiment Analysis and Opinion Mining
Cited by
92 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献