Affiliation:
1. Department of Computer Science and Engineering, NMAM Institute of Technology, Nitte, Karkala, India
Abstract
Twitter is a popular microblogging social media, using which its users can share useful information. Keeping a track of user postings and common hashtags allows us to understand what is happening around the world and what are people’s opinions on it. As such, a Twitter trend analysis analyzes Twitter data and hashtags to determine what topics are being talked about the most on Twitter. Feature extraction and trend detection can be performed using machine learning algorithms. Big data tools and techniques are needed to extract relevant information from continuous steam of data originating from Twitter. The objectives of this research work are to analyze the relative popularity of different hashtags and which field has the maximum share of voice. Along with this, the common interests of the community can also be determined. Twitter trends plan an important role in the business field, marketing, politics, sports, and entertainment activities. The proposed work implemented the Twitter trend analysis using latent Dirichlet allocation, cosine similarity, K means clustering, and Jaccard similarity techniques and compared the results with Big Data Apache SPARK tool implementation. The LDA technique for trend analysis resulted in an accuracy of 74% and Jaccard with an accuracy of 83% for static data. The results proved that the real-time tweets are analyzed comparatively faster in the Big Data Apache SPARK tool than in the normal execution environment.
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Information Systems
Reference22 articles.
1. Sentiment analysis in twitter using lexicon based and polarity multiplication;M. Mashuri,2019
2. Sentiment analysis using naive Bayes algorithm of the data crawler: Twitter;M. Wongkar,2019
3. Text mining of Twitter data using a latent Dirichlet allocation topic model and sentiment analysis;S. Yang;International Journal of Computer and Information Engineering,2018
4. Topic modelling Twitter data with latent Dirichlet allocation method;E. S. Negara
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献