Abstract
Since the recent outbreak of COVID-19, many scientists have started working on distinct challenges related to mining the available large datasets from social media as an effective asset to understand people’s responses to the pandemic. This study presents a comprehensive social data mining approach to provide in-depth insights related to the COVID-19 pandemic and applied to the Arabic language. We first developed a technique to infer geospatial information from non-geotagged Arabic tweets. Secondly, a sentiment analysis mechanism at various levels of spatial granularities and separate topic scales is introduced. We applied sentiment-based classifications at various location resolutions (regions/countries) and separate topic abstraction levels (subtopics and main topics). In addition, a correlation-based analysis of Arabic tweets and the official health providers’ data will be presented. Moreover, we implemented several mechanisms of topic-based analysis using occurrence-based and statistical correlation approaches. Finally, we conducted a set of experiments and visualized our results based on a combined geo-social dataset, official health records, and lockdown data worldwide. Our results show that the total percentage of location-enabled tweets has increased from 2% to 46% (about 2.5M tweets). A positive correlation between top topics (lockdown and vaccine) and the COVID-19 new cases has also been recorded, while negative feelings of Arab Twitter users were generally raised during this pandemic, on topics related to lockdown, closure, and law enforcement.
Subject
Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development
Reference52 articles.
1. Digital 2021: Global Overview Report
https://datareportal.com/reports/digital-2021-global-overview-report
2. A proposed approach for emotion lexicon enrichment;Mohsen;Int. J. Comput. Electr. Autom. Control. Inf. Eng.,2016
3. Advanced Automatic Lexicon with Sentiment Analysis Algorithms for Arabic Reviews
4. Internet World Users by Language: Top 10 Languages
https://datareportal.com/reports/digital-2021-global-overview-report
5. Middle East Internet Statistics, Population, Facebook and Telecommunications Reports
https://eipss-eg.org/wp-content/uploads/2015/10/stats5.htm
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献