Abstract
Background
At the time of this writing, the coronavirus disease (COVID-19) pandemic outbreak has already put tremendous strain on many countries' citizens, resources, and economies around the world. Social distancing measures, travel bans, self-quarantines, and business closures are changing the very fabric of societies worldwide. With people forced out of public spaces, much of the conversation about these phenomena now occurs online on social media platforms like Twitter.
Objective
In this paper, we describe a multilingual COVID-19 Twitter data set that we are making available to the research community via our COVID-19-TweetIDs GitHub repository.
Methods
We started this ongoing data collection on January 28, 2020, leveraging Twitter’s streaming application programming interface (API) and Tweepy to follow certain keywords and accounts that were trending at the time data collection began. We used Twitter’s search API to query for past tweets, resulting in the earliest tweets in our collection dating back to January 21, 2020.
Results
Since the inception of our collection, we have actively maintained and updated our GitHub repository on a weekly basis. We have published over 123 million tweets, with over 60% of the tweets in English. This paper also presents basic statistics that show that Twitter activity responds and reacts to COVID-19-related events.
Conclusions
It is our hope that our contribution will enable the study of online conversation dynamics in the context of a planetary-scale epidemic outbreak of unprecedented proportions and implications. This data set could also help track COVID-19-related misinformation and unverified rumors or enable the understanding of fear and panic—and undoubtedly more.
Subject
Public Health, Environmental and Occupational Health,Health Informatics
Reference37 articles.
1. TaylorDNY Times20202020-04-10A Timeline of the Coronavirus Pandemichttps://www.nytimes.com/article/coronavirus-timeline.html
2. World Health Organization20202020-04-10Rolling updates on a coronavirus disease (COVID-19)https://www.who.int/emergencies/diseases/novel-coronavirus-2019/events-as-they-happen
3. An interactive web-based dashboard to track COVID-19 in real time
4. WuJSmithSKhuranaMSiemaszkoCDeJesus-BanosBNBC News20202020-05-15Stay-at-home orders across the countryhttps://www.nbcnews.com/health/health-news/here-are-stay-home-orders-across-country-n1168736
5. Centers for Disease Control and Prevention20202020-05-15Social Distancinghttps://www.cdc.gov/coronavirus/2019-ncov/prevent-getting-sick/social-distancing.html
Cited by
476 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献