Physical activity, sedentary behaviour, and sleep on Twitter: A labelled dataset for public health research

Author:

Hossein Abad Zahra ShakeriORCID,Butler Gregory P.,Thompson Wendy,Lee JoonORCID

Abstract

ABSTRACTAdvances in automated data processing, together with the unprecedented growth in user-generated social media (SM) content, have made public health surveillance (PHS) one of the long-lasting SM applications. However, the existing PHS systems feeding on SM data have not been widely deployed in national surveillance systems, which appears to stem from the lack of practitioners’ trust in SM data. More robust datasets over which machine learning (ML) models can be trained/tested reliably is a significant step toward overcoming this hurdle. The health implications of physical activity, sedentary behaviour, and sleep (PASS) are widely studied through traditional data sources, which are often out-of-date, costly to collect, and thus limited in quantity and coverage. We present LPHEADA, a multicountry and fully Labelled digital Public HEAlth DAtaset of tweets originated in Australia/Canada/United Kingdom/United States between November 2018-June 2020. LPHEADA contains 366,405 labels for 122,135 PASS-related tweets and provides details about the place/time/demographics associated with each tweet. LPHEADA is publicly available and can be utilized to develop (un)supervised ML models for digital PASS surveillance.

Publisher

Cold Spring Harbor Laboratory

Reference40 articles.

1. Kemp, S. Digital 2020: July global statshot. DATAREPORTAL. Available online: https://datareportal.com/reports/digital-2020-july-global-statshot (accessed on 8 January 2021) (2020).

2. The use of social media in public health surveillance;West. Pac. surveillance response journal: WPSAR,2015

3. Digital disease detection—harnessing the web for public health surveillance;The New Engl. journal medicine,2009

4. Twitter as a tool for health research: a systematic review;Am. journal public health,2017

5. Social medicine: Twitter in healthcare;J. clinical medicine,2018

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3