Novel cluster set optimization model with unique identifier tagging for twitter data analysis

Author:

Vanam Harika1,JebersonRetna Raj R2,Janga Vijaykumar3

Affiliation:

1. Computer Science & Engineering, Satyabhama Institute of Science and Technology (Deemed to be University) Chennai, India

2. Department of Information Technology, Satyabhama Institute of Science and Technology (Deemed to be University) Chennai, India

3. Department of AI & ML, Balaji Institute of Technology & Science, Narsampet, Warangal, Telangana, India

Abstract

Blogs, internet forums, social networks, and micro-blogging sites are some of the growing number of places where users can voice their opinions. Opinions on any given product, issue, service, or idea are contained in data, making them a valuable resource in their own right. Popular social networking services like Twitter, Facebook, and Google+ allows expressing views on a variety of topics, participating in discussions, or sending messages to a global user. Twitter sentiment analysis has received a lot of attention recently.Sentiment analysis is finding how a person feels about a topic from their written response about it and it can be separated into positive and negative through its use. Doing so enables to classify the tweets made by a user in to appropriate classification category based on which some decisions can be made. The literature proposed approaches to develop the classifiers on the Twitter datasets. Operations, including tokenization, stop-word removal, and stemming will be performed. NLP converts the text to a machine-readable representation. Artificial Intelligence (AI) combines NLP data to evaluate if a situation is positive or negative. The document’s subjectivity can be identified using ML and NLP techniques to categorize them in to positive, neutral, or negative. Performing sentiment analysis in Twitter data can be tedious due to limited size, unstructured nature, misspellings, slang, and abbreviations. For this task, a Tweet Analyzing Model for Cluster Set Optimization with Unique Identifier Tagging (TAM-CSO-UIT) was built using prospects to determine positive or negative sentiment in tweets obtained from Twitter. This approach assigns a +ve/-ve value to each entry in the Tweet database based on probability assignment using n-gram model. To perform this effectively the tweet dataset is considered as a sliding window of length L. The proposed model accurately analyses and classifies the tweets.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference14 articles.

1. A framework for Arabic sentiment analysis using supervised classification;Duwairi;Int J Data Mining, Modelling and Management,2016

2. Sentiment Analysis for Modern Standard Arabic And Colloquial;Hossam Ibrahim;International Journal on Natural Language Computing (IJNLC),2015

3. The use of psycho-physiological interaction analysis with FMRI-data in is research-a guideline;Hubert;Commun Assoc Inf Syst (CAIS),2017

4. Social information discovery enhanced by sentiment analysis techniques;Diamantini;FutGenerComput Syst,2019

5. A Topic based Approach for Sentiment Analysis on Twitter Data

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3