Affiliation:
1. DAMO Academy Alibaba Group Beijing China
2. Artificial Intelligence Innovation and Incubation Institute Fudan University Shanghai China
Abstract
AbstractThe data‐driven approaches for medium‐range weather forecasting are recently shown to be extraordinarily promising for ensemble forecasting due to their fast inference speed compared to the traditional numerical weather prediction models. However, their forecast accuracy can hardly match the state‐of‐the‐art operational ECMWF Integrated Forecasting System (IFS) model. Previous data‐driven approaches perform ensemble forecasting using some simple perturbation methods, like the initial condition perturbation and the Monte Carlo dropout. However, their ensemble performance is often limited arguably by the sub‐optimal ways of applying perturbation. We propose a Swin Transformer‐based Variational Recurrent Neural Network (SwinVRNN), which is a stochastic weather forecasting model combining a SwinRNN predictor with a perturbation module. SwinRNN is designed as a Swin Transformer‐based recurrent neural network, which predicts the future states deterministically. Furthermore, to model the stochasticity in the prediction, we design a perturbation module following the Variational Auto‐Encoder paradigm to learn the multivariate Gaussian distributions of a time‐variant stochastic latent variable from the data. Ensemble forecasting can be easily performed by perturbing the model features leveraging the noise sampled from the learned distribution. We also compare four categories of perturbation methods for ensemble forecasting, that is, fixed distribution perturbation, learned distribution perturbation, MC dropout, and multi model ensemble. Comparisons on the WeatherBench data set show that the learned distribution perturbation method using our SwinVRNN model achieves remarkably improved forecasting accuracy and reasonable ensemble spread due to the joint optimization of the two targets. More notably, SwinVRNN surpasses operational IFS on the surface variables of the 2‐m temperature and the 6‐hourly total precipitation at all lead times up to 5 days (Code is available at https://github.com/tpys/wwprediction).
Publisher
American Geophysical Union (AGU)
Subject
General Earth and Planetary Sciences,Environmental Chemistry,Global and Planetary Change
Reference26 articles.
1. Babaeizadeh M. Finn C. Erhan D. Campbell R. H. &Levine S.(2017).Stochastic variational video prediction. In International Conference on Learning Representations. arXiv preprint arXiv:1710.11252.https://doi.org/10.48550/arXiv.1710.11252
2. A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction
3. Combining distribution‐based neural networks to predict weather forecast probabilities
4. Dosovitskiy A. Beyer L. Kolesnikov A. Weissenborn D. Zhai X. Unterthiner T. et al. (2020).An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations. arXiv preprint arXiv:2010.11929.https://doi.org/10.48550/arXiv.2010.11929
Cited by
22 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献