Accuracy in the prediction of disease epidemics when ensembling simple but highly correlated models-Reference-Cited by-同舟云学术

Accuracy in the prediction of disease epidemics when ensembling simple but highly correlated models

Published:2021-03-15 Issue:3 Volume:17 Page:e1008831
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

Shah Denis A.^ORCID,De Wolf Erick D.^ORCID,Paul Pierce A.^ORCID,Madden Laurence V.^ORCID

Abstract

Ensembling combines the predictions made by individual component base models with the goal of achieving a predictive accuracy that is better than that of any one of the constituent member models. Diversity among the base models in terms of predictions is a crucial criterion in ensembling. However, there are practical instances when the available base models produce highly correlated predictions, because they may have been developed within the same research group or may have been built from the same underlying algorithm. We investigated, via a case study on Fusarium head blight (FHB) on wheat in the U.S., whether ensembles of simple yet highly correlated models for predicting the risk of FHB epidemics, all generated from logistic regression, provided any benefit to predictive performance, despite relatively low levels of base model diversity. Three ensembling methods were explored: soft voting, weighted averaging of smaller subsets of the base models, and penalized regression as a stacking algorithm. Soft voting and weighted model averages were generally better at classification than the base models, though not universally so. The performances of stacked regressions were superior to those of the other two ensembling methods we analyzed in this study. Ensembling simple yet correlated models is computationally feasible and is therefore worth pursuing for models of epidemic risk.

Funder

U.S. Wheat & Barley Scab Initiative

Publisher

Public Library of Science (PLoS)

Subject

Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modelling and Simulation,Ecology, Evolution, Behavior and Systematics

Reference72 articles.

1. Ensemble-based classifiers.;L. Rokach;Artificial Intelligence Review,2010

2. Ensemble Methods

3. Individual model forecasts can be misleading, but together they are useful;CO Buckee;Eur J Epidemiol,2020

4. Prediction of infectious disease epidemics via weighted density ensembles.;EL Ray;PLoS Comput Biol,2018

5. Accuracy of real-time multi-model ensemble forecasts for seasonal influenza in the U.S.;NG Reich;PLoS Comput Biol.,2019

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predicting plant disease epidemics using boosted regression trees;Infectious Disease Modelling;2024-12

2. Risk quantification as an epidemiological analysis strategy: Analysis and application to bud rot in oil palm;Plant Pathology;2024-05-30

3. Weather‐based models for forecasting Fusarium head blight risks in wheat and barley: A review;Plant Pathology;2023-12-02

4. Into the Trees: Random Forests for Predicting Fusarium Head Blight Epidemics of Wheat in the United States;Phytopathology®;2023-08

5. Effects of climate change on the distribution of Fusarium spp. in Italy;Science of The Total Environment;2023-07