Comparing Forecast Skill-Reference-Cited by-同舟云学术

Comparing Forecast Skill

Published:2014-12-01 Issue:12 Volume:142 Page:4658-4678
ISSN:0027-0644
Container-title:Monthly Weather Review
language:en
Short-container-title:

Author:

DelSole Timothy¹,Tippett Michael K.²

Affiliation:

1. George Mason University, Fairfax, Virginia, and Center for Ocean–Land–Atmosphere Studies, Calverton, Maryland

2. Department of Applied Physics and Applied Mathematics, Columbia University, New York, New York, and Center of Excellence for Climate Change Research, Department of Meteorology, King Abdulaziz University, Jeddah, Saudi Arabia

Abstract

Abstract A basic question in forecasting is whether one prediction system is more skillful than another. Some commonly used statistical significance tests cannot answer this question correctly if the skills are computed on a common period or using a common set of observations, because these tests do not account for correlations between sample skill estimates. Furthermore, the results of these tests are biased toward indicating no difference in skill, a fact that has important consequences for forecast improvement. This paper shows that the magnitude of bias is characterized by a few parameters such as sample size and correlation between forecasts and their errors, which, surprisingly, can be estimated from data. The bias is substantial for typical seasonal forecasts, implying that familiar tests may wrongly judge that differences in seasonal forecast skill are insignificant. Four tests that are appropriate for assessing differences in skill over a common period are reviewed. These tests are based on the sign test, the Wilcoxon signed-rank test, the Morgan–Granger–Newbold test, and a permutation test. These techniques are applied to ENSO hindcasts from the North American Multimodel Ensemble and reveal that the Climate Forecast System, version 2, and the Canadian Climate Model, version 3 (CanCM3), outperform other models in the sense that their squared error is less than that of other single models more frequently. It should be recognized that while certain models may be superior in a certain sense for a particular period and variable, combinations of forecasts are often significantly more skillful than a single model alone. In fact, the multimodel mean significantly outperforms all single models.

Publisher

American Meteorological Society

Subject

Atmospheric Science

Link

http://journals.ametsoc.org/mwr/article-pdf/142/12/4658/4304426/mwr-d-14-00045_1.pdf

Reference30 articles.

1. Predictions of Nino3.4 SST in CFSv1 and CFSv2: A diagnostic comparison;Barnston;Climate Dyn.,2013

2. Skill of real-time seasonal ENSO model predictions during 2002–11: Is our capability increasing?;Barnston;Bull. Amer. Meteor. Soc.,2012

3. Approximately normal tests for equal predictive accuracy in nested models;Clark;J. Econom.,2007

Cited by 59 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Performance‐based evaluation of NMME and C3S models in forecasting the June–August Central African rainfall under the influence of the South Atlantic Ocean Dipole;International Journal of Climatology;2024-04-16

2. A Simple Statistical Postprocessing Scheme for Enhancing the Skill of Seasonal SST Predictions in the Tropics;Monthly Weather Review;2024-04

3. A Relative Sea Surface Temperature Index for Classifying ENSO Events in a Changing Climate;Journal of Climate;2024-02-15

4. Quantification of Long-Range Dependence in Hydroclimatic Time Series: A Method-Comparison Study;Journal of Applied Meteorology and Climatology;2023-12

5. Real-Time ENSO Forecast Skill Evaluated Over the Last Two Decades, with Focus on Onset of ENSO Events;2023-11-15