Natural Disaster on Twitter: Role of Feature Extraction Method of Word2Vec and Lexicon Based for Determining Direct Eyewitness-Reference-Cited by-同舟云学术

Natural Disaster on Twitter: Role of Feature Extraction Method of Word2Vec and Lexicon Based for Determining Direct Eyewitness

Published:2021-11-15 Issue:23 Volume:18 Page:680
ISSN:2774-0226
Container-title:Trends in Sciences
language:
Short-container-title:Trends Sci

Author:

Faisal Mohammad Reza,Nugroho Radityo Adi,Ramadhani Rahmat,Abadi Friska,Herteno Rudy,Saragih Triando Hamonangan

Abstract

Researchers have collected Twitter data to study a wide range of topics, one of which is a natural disaster. A social network sensor was developed in existing research to filter natural disaster information from direct eyewitnesses, none eyewitnesses, and non-natural disaster information. It can be used as a tool for early warning or monitoring when natural disasters occur. The main component of the social network sensor is the text tweet classification. Similar to text classification research in general, the challenge is the feature extraction method to convert Twitter text into structured data. The strategy commonly used is vector space representation. However, it has the potential to produce high dimension data. This research focuses on the feature extraction method to resolve high dimension data issues. We propose a hybrid approach of word2vec-based and lexicon-based feature extraction to produce new features. The Experiment result shows that the proposed method has fewer features and improves classification performance with an average AUC value of 0.84, and the number of features is 150. The value is obtained by using only the word2vec-based method. In the end, this research shows that lexicon-based did not influence the improvement in the performance of social network sensor predictions in natural disasters. HIGHLIGHTS Implementation of text classification is generally only used to perform sentiment analysis, it is still rare to use it to perform text classification for use in determining direct eyewitnesses in cases of natural disasters One of the common problems in text mining research is the extracted features from the vector space representation method generate high dimension data A hybrid approach of word2vec-based and lexicon-based feature extraction experiment was conducted in order to find a method that can generate new features with low dimensions and also improve the classification performance GRAPHICAL ABSTRACT

Publisher

College of Graduate Studies, Walailak University

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Harvesting Natural Disaster Reports from Social Media with 1D Convolutional Neural Network and Long Short-Term Memory;2023 Eighth International Conference on Informatics and Computing (ICIC);2023-12-08

2. An Effective Preprocessing Data on Performance of Machine Learning for ECG-Based Personal Authentication;Proceedings of the 8th International Conference on Sustainable Information Engineering and Technology;2023-10-24

3. A Social Community Sensor for Natural Disaster Monitoring in Indonesia Using Hybrid 2D CNN LSTM;Proceedings of the 8th International Conference on Sustainable Information Engineering and Technology;2023-10-24

4. The Effect of Channel Size on Performance of 1D CNN Architecture for Automatic Detection of Self-Reported COVID-19 Symptoms on Twitter;2023 International Seminar on Intelligent Technology and Its Applications (ISITIA);2023-07-26

5. A comparison of word embedding-based extraction feature techniques and deep learning models of natural disaster messages classification;Journal of Computer Sciences Institute;2023-06-30