A Performance Comparison of Unsupervised Techniques for Event Detection from Oscar Tweets-Reference-Cited by-同舟云学术

A Performance Comparison of Unsupervised Techniques for Event Detection from Oscar Tweets

Published:2022-05-24 Issue: Volume:2022 Page:1-14
ISSN:1687-5273
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Malik Muzamil¹^ORCID,Aslam Waqar¹^ORCID,Aslam Zahid¹,Alharbi Abdullah²,Alouffi Bader³^ORCID,Rauf Hafiz Tayyab⁴

Affiliation:

1. Department of Computer Science & Information Technology, Islamia University of Bahawalpur, Bahawalpur, Pakistan

2. Department of Information Technology, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia

3. Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia

4. Centre for Smart Systems, AI and Cybersecurity, Staffordshire University, Stoke-on-Trent, UK

Abstract

People’s lives are influenced by social media. It is an essential source for sharing news, awareness, detecting events, people’s interests, etc. Social media covers a wide range of topics and events to be discussed. Extensive work has been published to capture the interesting events and insights from datasets. Many techniques are presented to detect events from social media networks like Twitter. In text mining, most of the work is done on a specific dataset, and there is the need to present some new datasets to analyse the performance and generic nature of Topic Detection and Tracking methods. Therefore, this paper publishes a dataset of real-life event, the Oscars 2018, gathered from Twitter and makes a comparison of soft frequent pattern mining (SFPM), singular value decomposition and k-means (K-SVD), feature-pivot (Feat-p), document-pivot (Doc-p), and latent Dirichlet allocation (LDA). The dataset contains 2,160,738 tweets collected using some seed words. Only English tweets are considered. All of the methods applied in this paper are unsupervised. This area needs to be explored on different datasets. The Oscars 2018 is evaluated using keyword precision (K-Prec), keyword recall (K-Rec), and topic recall (T-Rec) for detecting events of greater interest. The highest K-Prec, K-Rec, and T-Rec were achieved by SFPM, but they started to decrease as the number of clusters increased. The lowest performance was achieved by Feat-p in terms of all three metrics. Experiments on the Oscars 2018 dataset demonstrated that all the methods are generic in nature and produce meaningful clusters.

Funder

Taif University

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2022/5980043.pdf

Reference34 articles.

1. A rule dynamics approach to event detection in Twitter with its application to sports and politics

2. Mining Streaming Tweets for Real-Time Event Credibility Prediction in Twitter

3. Detecting life events from twitter based on temporal semantic features

4. Real-time event detection from the Twitter data stream using the TwitterNews+ Framework

5. Sensing Trending Topics in Twitter