Abstract
Abstract
Topic extraction and categorization is an important task because by doing that it is easy to find out which are the topics most discussed by the users in their tweets or opinions and need to be analyzed. In this work, topics are extracted from positive and negative opinions and then categorized into different groups. For performing this, first a collection of opinions are divided into two sets- positive opinions and negative opinions by using a sentiment analyzer. Then a method is proposed to find out the most discussed topics in the set of positive opinions and negative opinions. For extracting the topics from a set of opinions the noun words are extracted from the set of the opinions. After extracting the topics, the similar topics have been combined by using synonymy relation. Then the frequent topic words are represented with the help of GloVe embedding technique. Finally, the topics are categorized by using a clustering algorithm by applying it on the frequent topic words. For the evaluation of the proposed method, tweets from a Twitter User dataset are used. The results obtained from the experiments by applying the proposed method on the dataset give promising result and provide interesting and meaningful clusters of topics. Moreover an analysis of the result obtained for both positive and negative opinions is also presented.
Publisher
Research Square Platform LLC
Reference22 articles.
1. Topic detection and tracking techniques on twitter: A systematic review;Asgari-Chenaghlu M;Complexity,2021
2. Tools and approaches for topic detection from twitter streams: survey;Ibrahim R;Knowl Inf Syst,2018
3. An automatic topic identification algorithm;Baghdadi H;J Comput Sci,2011
4. A review of approaches for topic detection in twitter;Mottaghinia Z;J Experimental Theoretical Artif Intell,2020
5. Emerging topic detection in twitter stream based on high utility pattern mining;Choi HJ;Expert Syst Appl,2018