Informed Random Forest to Model Associations of Epidemiological Priors, Government Policies, and Public Mobility

Author:

Thapelo Tsaone Swaabow1ORCID,Mpoeleng Dimane2,Hillhouse Gregory3

Affiliation:

1. Department of Computer Science and Information Systems, Botswana International University of Science and Technology, Palapye, Botswana

2. Director (Ag.) Research Innovation Technology, Research Development and Innovation, Department of Computer Science and Information Systems, Botswana International University of Science and Technology, Palapye, Botswana

3. Head of the Department of Physics and Astronomy, Botswana International University of Science and Technology, Palapye, Botswana

Abstract

Background. Infectious diseases constitute a significant concern worldwide due to their increasing prevalence, associated health risks, and the socioeconomic costs. Machine learning (ML) models and epidemic models formulated using deterministic differential equations are the most dominant tools for analyzing and modeling the transmission of infectious diseases. However, ML models can be inconsistent in extracting the dynamics of a disease in the presence of data drifts. Likewise, the capability of epidemic models is constrained to parameter dimensions and estimation. We aimed at creating a framework of informed ML that integrates a random forest (RF) with an adapted susceptible infectious recovered (SIR) model to account for accuracy and consistency in stochasticity within the dynamics of coronavirus disease 2019 (COVID-19). Methods. An adapted SIR model was used to inform a default RF on predicting new COVID-19 cases (NCCs) at given intervals. We validated the performance of the informed RF (IRF) using real data. We used Botswana’s pharmaceutical interventions (PIs) and non-PIs (NPIs) adopted between February 2020 and August 2022. The discrepancy between predictions and observations is modeled using loss functions, which are minimized, interpreted, and used to assess the IRF. Results. The findings on the real data have revealed the effectiveness of the default RF in modeling and predicting NCCs. The use of the effective reproductive rate to inform the RF yielded an excellent predictive power (84%) compared with 75% by the default RF. Conclusion. This research has potential to inform policy and decision makers in developing systems to evaluate interventions for infectious diseases. Highlights This framework is initiated by incorporating model outputs from an epidemic model to a machine learning model. An informed random forest (RF) is instantiated to model government and public responses to the COVID-19 pandemic. This framework does not require data transformations, and the epidemic model is shown to boost the RF’s performance. This is a baseline knowledge-informed learning framework for assessing public health interventions in Botswana.

Publisher

SAGE Publications

Subject

Public Health, Environmental and Occupational Health,Health Policy

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3