Ensemble learning methods of inference for spatially stratified infectious disease systems
Author:
Peitsch Jeffrey1, Pokharel Gyanendra2, Hossain Shakhawat2
Affiliation:
1. Department of Mathematics and Statistics , 2129 University of Calgary , Calgary , AB , Canada 2. Department of Mathematics and Statistics , 8665 University of Winnipeg , Winnipeg , MB , Canada
Abstract
Abstract
Individual level models are a class of mechanistic models that are widely used to infer infectious disease transmission dynamics. These models incorporate individual level covariate information accounting for population heterogeneity and are generally fitted in a Bayesian Markov chain Monte Carlo (MCMC) framework. However, Bayesian MCMC methods of inference are computationally expensive for large data sets. This issue becomes more severe when applied to infectious disease data collected from spatially heterogeneous populations, as the number of covariates increases. In addition, summary statistics over the global population may not capture the true spatio-temporal dynamics of disease transmission. In this study we propose to use ensemble learning methods to predict epidemic generating models instead of time consuming Bayesian MCMC method. We apply these methods to infer disease transmission dynamics over spatially clustered populations, considering the clusters as natural strata instead of a global population. We compare the performance of two tree-based ensemble learning techniques: random forest and gradient boosting. These methods are applied to the 2001 foot-and-mouth disease epidemic in the U.K. and evaluated using simulated data from a clustered population. It is shown that the spatially clustered data can help to predict epidemic generating models more accurately than the global data.
Publisher
Walter de Gruyter GmbH
Reference23 articles.
1. Nuzzo, J, Moss, B, Watson, C, Rutkow, L, Garibaldi, B, Blauer, B, et al.. Johns hopkins coronavirus resource center covid-19 dashboard. Baltimore, Marylands, USA: School of Medicine, Johns Hopkins University; 2022. 2. Chis Ster, I, Ferguson, N. Transmission parameters of the 2001 foot and mouth epidemic in great britain. PLoS One 2007;2:e502. https://doi.org/10.1371/journal.pone.0000502. 3. Deardon, R, Brooks, S, Grenfell, T, Keeling, M, Tildesley, M, Savill, N, et al.. Inference for individual-level models of infectious diseases in large populations. Stat Sin 2010;20:239–61. 4. Kwong, GPS, Deardon, R. Linearized forms of individual-level models for large-scale spatial infectious disease systems. Bull Math Biol 2012;74:1912–37. https://doi.org/10.1007/s11538-012-9739-8. 5. Pokharel, G, Deardon, R. Gaussian process emulators for spatial individual-level models of infectious disease. Can J Stat 2016;44:480–501. https://doi.org/10.1002/cjs.11304.
|
|