A Pipeline for Rapid Post-Crisis Twitter Data Acquisition, Filtering and Visualization-Reference-Cited by-同舟云学术

A Pipeline for Rapid Post-Crisis Twitter Data Acquisition, Filtering and Visualization

Published:2019-04-02 Issue:2 Volume:7 Page:33
ISSN:2227-7080
Container-title:Technologies
language:en
Short-container-title:Technologies

Author:

Kejriwal Mayank^ORCID,Gu Yao

Abstract

Due to instant availability of data on social media platforms like Twitter, and advances in machine learning and data management technology, real-time crisis informatics has emerged as a prolific research area in the last decade. Although several benchmarks are now available, especially on portals like CrisisLex, an important, practical problem that has not been addressed thus far is the rapid acquisition, benchmarking and visual exploration of data from free, publicly available streams like the Twitter API in the immediate aftermath of a crisis. In this paper, we present such a pipeline for facilitating immediate post-crisis data collection, curation and relevance filtering from the Twitter API. The pipeline is minimally supervised, alleviating the need for feature engineering by including a judicious mix of data preprocessing and fast text embeddings, along with an active learning framework. We illustrate the utility of the pipeline by describing a recent case study wherein it was used to collect and analyze millions of tweets in the immediate aftermath of the Las Vegas shootings in 2017.

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-7080/7/2/33/pdf

Reference61 articles.

1. Crisis informatics—New data for extraordinary times

2. Design Challenges/Solutions for Environments Supporting the Analysis of Social Media Data in Crisis Informatics Research

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Agent-Based Data Acquisition Pipeline for Image Data;IEEE Access;2024

2. Strengthening Post-Disaster Management Activities by Rating Social Media Corpus;Research Anthology on Managing Crisis and Risk Communications;2022-07-01

3. A Feasibility Study of Open-Source Sentiment Analysis and Text Classification Systems on Disaster-Specific Social Media Data;2021 IEEE Symposium Series on Computational Intelligence (SSCI);2021-12-05

4. A social Beaufort scale to detect high winds using language in social media posts;Scientific Reports;2021-02-11

5. Visual Exploration and Debugging of Machine Learning Classification over Social Media Data;Lecture Notes in Social Networks;2021