Extracting Self-Reported COVID-19 Symptom Tweets and Twitter Movement Mobility Origin/Destination Matrices to Inform Disease Models

Author:

Rosato Conor1ORCID,Moore Robert E.1,Carter Matthew1ORCID,Heap John2,Harris John3ORCID,Storopoli Jose4ORCID,Maskell Simon1ORCID

Affiliation:

1. Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool L69 3GJ, UK

2. Computational Biology Facility, University of Liverpool, Liverpool L69 3GJ, UK

3. Public Health England, London NW9 5EQ, UK

4. Department of Computer Science, Universidade Nove de Julho—UNINOVE, Sao Paulo 03155-000, Brazil

Abstract

The emergence of the novel coronavirus (COVID-19) generated a need to quickly and accurately assemble up-to-date information related to its spread. In this research article, we propose two methods in which Twitter is useful when modelling the spread of COVID-19: (1) machine learning algorithms trained in English, Spanish, German, Portuguese and Italian are used to identify symptomatic individuals derived from Twitter. Using the geo-location attached to each tweet, we map users to a geographic location to produce a time-series of potential symptomatic individuals. We calibrate an extended SEIRD epidemiological model with combinations of low-latency data feeds, including the symptomatic tweets, with death data and infer the parameters of the model. We then evaluate the usefulness of the data feeds when making predictions of daily deaths in 50 US States, 16 Latin American countries, 2 European countries and 7 NHS (National Health Service) regions in the UK. We show that using symptomatic tweets can result in a 6% and 17% increase in mean squared error accuracy, on average, when predicting COVID-19 deaths in US States and the rest of the world, respectively, compared to using solely death data. (2) Origin/destination (O/D) matrices, for movements between seven NHS regions, are constructed by determining when a user has tweeted twice in a 24 h period in two different locations. We show that increasing and decreasing a social connectivity parameter within an SIR model affects the rate of spread of a disease.

Funder

EPSRC

ESRC Centre for Doctoral Training on Quantification and Management of Risk and Uncertainty in Complex Systems Environments

AWE

EPSRC Centre for Doctoral Training in Distributed Algorithms

EPSRC through the Big Hypotheses

Publisher

MDPI AG

Subject

Information Systems

Reference56 articles.

1. (2023, March 03). Coronavirus Disease 2019. Available online: https://www.google.com/search?q=covid-19+cases+worldwide&rlz=1C1CHBF_enGB763GB763&sxsrf=AJOqlzVAHRTMaItK2GPe9r5WtVyiju1d9g%3A1677849490518&ei=kvMBZO6lH4SW8gL377G4Dg&ved=0ahUKEwjutvm27L_9AhUEi1wKHfd3DOcQ4dUDCA8&uact=5&oq=covid-19+cases+worldwide&gs_lcp=Cgxnd3Mtd2l6LXNlcnAQAzIFCAAQgAQyBQgAEIAEMgYIABAWEB4yBggAEBYQHjIGCAAQFhAeMgYIABAWEB4yBggAEBYQHjIGCAAQFhAeMgYIABAWEB4yBggAEBYQHjoKCAAQRxDWBBCwAzoECAAQQ0oECEEYAFDLBFjOEWCFEmgBcAB4AIABWIgB8QSSAQE5mAEAoAEByAEIwAEB&sclient=gws-wiz-serpt.

2. An interactive web-based dashboard to track COVID-19 in real time;Dong;Lancet Infect. Dis.,2020

3. A contribution to the mathematical theory of epidemics;Kermack;Proc. R. Soc. London. Ser. A Contain Pap. Math. Phys. Charact.,1927

4. (2021, October 01). Reproduction Number (R) and Growth Rate: Methodology, Available online: https://www.gov.uk/government/publications/reproduction-number-r-and-growth-rate-methodology/reproduction-number-r-and-growth-rate-methodology.

5. Real-time nowcasting and forecasting of COVID-19 dynamics in England: The first wave;Birrell;Philos. Trans. R. Soc. B,2021

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3