Understanding COVID-19 Impacts on the Health Workforce: AI-Assisted Open-Source Media Content Analysis (Preprint)

Author:

Pienkowska AnitaORCID,Ravaut MathieuORCID,Mammadova MaleykaORCID,Ang Chin-SiangORCID,Wang HanyuORCID,Ong Qi ChwenORCID,Bojic IvaORCID,Qin Vicky MengqiORCID,Sumsuzzman Dewan MdORCID,Ajuebor OnyemaORCID,Boniol MathieuORCID,Bustamante Juana PaolaORCID,Campbell JamesORCID,Cometto GiorgioORCID,Fitzpatrick SiobhanORCID,Kane CatherineORCID,Joty ShafiqORCID,Car JosipORCID

Abstract

BACKGROUND

To investigate the impacts of the COVID-19 pandemic on the health workforce, we aimed to develop a framework that synergizes natural language processing (NLP) techniques and human-generated analysis to reduce, organize, classify, and analyze a vast volume of publicly available news articles to complement scientific literature and support strategic policy dialogue, advocacy, and decision-making.

OBJECTIVE

This study aimed to explore the possibility of systematically scanning intelligence from media that are usually not captured or best gathered through structured academic channels and inform on the impacts of the COVID-19 pandemic on the health workforce, contributing factors to the pervasiveness of the impacts, and policy responses, as depicted in publicly available news articles. Our focus was to investigate the impacts of the COVID-19 pandemic and, concurrently, assess the feasibility of gathering health workforce insights from open sources rapidly.

METHODS

We conducted an NLP-assisted media content analysis of open-source news coverage on the COVID-19 pandemic published between January 2020 and June 2022. A data set of 3,299,158 English news articles on the COVID-19 pandemic was extracted from the World Health Organization Epidemic Intelligence through Open Sources (EIOS) system. The data preparation phase included developing rules-based classification, fine-tuning an NLP summarization model, and further data processing. Following relevancy evaluation, a deductive-inductive approach was used for the analysis of the summarizations. This included data extraction, inductive coding, and theme grouping.

RESULTS

After processing and classifying the initial data set comprising 3,299,158 news articles and reports, a data set of 5131 articles with 3,007,693 words was devised. The NLP summarization model allowed for a reduction in the length of each article resulting in 496,209 words that facilitated agile analysis performed by humans. Media content analysis yielded results in 3 sections: areas of COVID-19 impacts and their pervasiveness, contributing factors to COVID-19–related impacts, and responses to the impacts. The results suggest that insufficient remuneration and compensation packages have been key disruptors for the health workforce during the COVID-19 pandemic, leading to industrial actions and mental health burdens. Shortages of personal protective equipment and occupational risks have increased infection and death risks, particularly at the pandemic’s onset. Workload and staff shortages became a growing disruption as the pandemic progressed.

CONCLUSIONS

This study demonstrates the capacity of artificial intelligence–assisted media content analysis applied to open-source news articles and reports concerning the health workforce. Adequate remuneration packages and personal protective equipment supplies should be prioritized as preventive measures to reduce the initial impact of future pandemics on the health workforce. Interventions aimed at lessening the emotional toll and workload need to be formulated as a part of reactive measures, enhancing the efficiency and maintainability of health delivery during a pandemic.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3