Scoring epidemiological forecasts on transformed scales

Author:

Bosse Nikos I.ORCID,Abbott SamORCID,Cori AnneORCID,van Leeuwen EdwinORCID,Bracher JohannesORCID,Funk SebastianORCID

Abstract

AbstractForecast evaluation is essential for the development of predictive epidemic models and can inform their use for public health decision-making. Common scores to evaluate epidemiological forecasts are the Continuous Ranked Probability Score (CRPS) and the Weighted Interval Score (WIS), which can be seen as measures of the absolute distance between the forecast distribution and the observation. However, applying these scores directly to predicted and observed incidence counts may not be the most appropriate due to the exponential nature of epidemic processes and the varying magnitudes of observed values across space and time. In this paper, we argue that transforming counts before applying scores such as the CRPS or WIS can effectively mitigate these difficulties and yield epidemiologically meaningful and easily interpretable results. Using the CRPS on log-transformed values as an example, we list three attractive properties: Firstly, it can be interpreted as a probabilistic version of a relative error. Secondly, it reflects how well models predicted the time-varying epidemic growth rate. And lastly, using arguments on variance-stabilizing transformations, it can be shown that under the assumption of a quadratic mean-variance relationship, the logarithmic transformation leads to expected CRPS values which are independent of the order of magnitude of the predicted quantity. Applying a transformation of log(x + 1) to data and forecasts from the European COVID-19 Forecast Hub, we find that it changes model rankings regardless of stratification by forecast date, location or target types. Situations in which models missed the beginning of upward swings are more strongly emphasised while failing to predict a downturn following a peak is less severely penalised when scoring transformed forecasts as opposed to untransformed ones. We conclude that appropriate transformations, of which the natural logarithm is only one particularly attractive option, should be considered when assessing the performance of different models in the context of infectious disease incidence.

Publisher

Cold Spring Harbor Laboratory

Reference38 articles.

1. Abbott, S. , Hellewell, J. , Sherratt, K. , Gostic, K. , Hickson, J. , Badr, H. S. , DeWitt, M. , Thompson, R. , EpiForecasts, and Funk, S. (2020). EpiNow2: Estimate Real-Time Case Counts and Time-Varying Epi-demiological Parameters. R package, https://doi.org/10.5281/zenodo.3957490.

2. Abbott, S. , Sherratt, K. , Bosse, N. , Gruson, H. , Bracher, J. , and Funk, S. (2022). Evaluating an epidemio-logically motivated surrogate model of a multi-model ensemble.

3. The Square Root Transformation in Analysis of Variance;Supplement to the Journal of the Royal Statistical Society,1936

4. Bellégo, C. , Benatia, D. , and Pape, L. (2022). Dealing with Logs and Zeros in Regression Models.

5. Local scale invariance and robustness of proper scoring rules

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3