Equitability Revisited: Why the “Equitable Threat Score” Is Not Equitable-Reference-Cited by-同舟云学术

Equitability Revisited: Why the “Equitable Threat Score” Is Not Equitable

Published:2010-04-01 Issue:2 Volume:25 Page:710-726
ISSN:1520-0434
Container-title:Weather and Forecasting
language:en
Short-container-title:

Author:

Hogan Robin J.¹,Ferro Christopher A. T.²,Jolliffe Ian T.²,Stephenson David B.²

Affiliation:

1. Department of Meteorology, University of Reading, Reading, United Kingdom

2. School of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter, United Kingdom

Abstract

Abstract In the forecasting of binary events, verification measures that are “equitable” were defined by Gandin and Murphy to satisfy two requirements: 1) they award all random forecasting systems, including those that always issue the same forecast, the same expected score (typically zero), and 2) they are expressible as the linear weighted sum of the elements of the contingency table, where the weights are independent of the entries in the table, apart from the base rate. The authors demonstrate that the widely used “equitable threat score” (ETS), as well as numerous others, satisfies neither of these requirements and only satisfies the first requirement in the limit of an infinite sample size. Such measures are referred to as “asymptotically equitable.” In the case of ETS, the expected score of a random forecasting system is always positive and only falls below 0.01 when the number of samples is greater than around 30. Two other asymptotically equitable measures are the odds ratio skill score and the symmetric extreme dependency score, which are more strongly inequitable than ETS, particularly for rare events; for example, when the base rate is 2% and the sample size is 1000, random but unbiased forecasting systems yield an expected score of around −0.5, reducing in magnitude to −0.01 or smaller only for sample sizes exceeding 25 000. This presents a problem since these nonlinear measures have other desirable properties, in particular being reliable indicators of skill for rare events (provided that the sample size is large enough). A potential way to reconcile these properties with equitability is to recognize that Gandin and Murphy’s two requirements are independent, and the second can be safely discarded without losing the key advantages of equitability that are embodied in the first. This enables inequitable and asymptotically equitable measures to be scaled to make them equitable, while retaining their nonlinearity and other properties such as being reliable indicators of skill for rare events. It also opens up the possibility of designing new equitable verification measures.

Publisher

American Meteorological Society

Subject

Atmospheric Science

Link

http://journals.ametsoc.org/waf/article-pdf/25/2/710/4019354/2009waf2222350_1.pdf

Reference29 articles.

1. Sensitivity of several performance measures to displacement error, bias, and event frequency.;Baldwin;Wea. Forecasting,2006

2. A general analytic method for assessing sensitivity to bias of performance measures for dichotomous forecasts.;Brill;Wea. Forecasting,2009

3. An objective evaluator of techniques for predicting severe weather events.;Donaldson,1975

4. On summary measures of skill in rare event forecasting based on contingency tables.;Doswell;Wea. Forecasting,1990

5. Tornado predictions.;Finley;Amer. Meteor. J.,1884

Cited by 109 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring the ability of regional extrapolation for precipitation nowcasting with deep learning;Meteorologische Zeitschrift;2024-08-13

2. Re(de)fining degree-heating week: coral bleaching variability necessitates regional and temporal optimization of global forecast model stress metrics;Coral Reefs;2024-06-12

3. DEUCE v1.0: a neural network for probabilistic precipitation nowcasting with aleatoric and epistemic uncertainties;Geoscientific Model Development;2024-05-14

4. Real-time flood forecasting using satellite precipitation product and machine learning approach in Bagmati river basin, India;Acta Geophysica;2024-04-07

5. Improving the Near-Surface Wind and Turbulence at the Edge of the Orographic Drag Gray Zone by Tuning the Roughness Length;Monthly Weather Review;2024-02