ClimateNet: an expert-labeled open dataset and deep learning architecture for enabling high-precision analyses of extreme weather-Reference-Cited by-同舟云学术

ClimateNet: an expert-labeled open dataset and deep learning architecture for enabling high-precision analyses of extreme weather

Published:2021-01-08 Issue:1 Volume:14 Page:107-124
ISSN:1991-9603
Container-title:Geoscientific Model Development
language:en
Short-container-title:Geosci. Model Dev.

Author:

,Kashinath Karthik,Mudigonda Mayur,Kim Sol,Kapp-Schwoerer Lukas,Graubner Andre,Karaismailoglu Ege,von Kleist Leo,Kurth Thorsten^ORCID,Greiner Annette,Mahesh Ankur,Yang Kevin,Lewis Colby^ORCID,Chen Jiayi,Lou Andrew,Chandran Sathyavat,Toms Ben,Chapman Will,Dagon Katherine^ORCID,Shields Christine A.,O'Brien Travis^ORCID,Wehner Michael^ORCID,Collins William^ORCID

Abstract

Abstract. Identifying, detecting, and localizing extreme weather events is a crucial first step in understanding how they may vary under different climate change scenarios. Pattern recognition tasks such as classification, object detection, and segmentation (i.e., pixel-level classification) have remained challenging problems in the weather and climate sciences. While there exist many empirical heuristics for detecting extreme events, the disparities between the output of these different methods even for a single event are large and often difficult to reconcile. Given the success of deep learning (DL) in tackling similar problems in computer vision, we advocate a DL-based approach. DL, however, works best in the context of supervised learning – when labeled datasets are readily available. Reliable labeled training data for extreme weather and climate events is scarce. We create “ClimateNet” – an open, community-sourced human-expert-labeled curated dataset that captures tropical cyclones (TCs) and atmospheric rivers (ARs) in high-resolution climate model output from a simulation of a recent historical period. We use the curated ClimateNet dataset to train a state-of-the-art DL model for pixel-level identification – i.e., segmentation – of TCs and ARs. We then apply the trained DL model to historical and climate change scenarios simulated by the Community Atmospheric Model (CAM5.1) and show that the DL model accurately segments the data into TCs, ARs, or “the background” at a pixel level. Further, we show how the segmentation results can be used to conduct spatially and temporally precise analytics by quantifying distributions of extreme precipitation conditioned on event types (TC or AR) at regional scales. The key contribution of this work is that it paves the way for DL-based automated, high-fidelity, and highly precise analytics of climate data using a curated expert-labeled dataset – ClimateNet. ClimateNet and the DL-based segmentation method provide several unique capabilities: (i) they can be used to calculate a variety of TC and AR statistics at a fine-grained level; (ii) they can be applied to different climate scenarios and different datasets without tuning as they do not rely on threshold conditions; and (iii) the proposed DL method is suitable for rapidly analyzing large amounts of climate model output. While our study has been conducted for two important extreme weather patterns (TCs and ARs) in simulation datasets, we believe that this methodology can be applied to a much broader class of patterns and applied to observational and reanalysis data products via transfer learning.

Funder

Office of Science

Lawrence Berkeley National Laboratory

National Center for Atmospheric Research

Publisher

Copernicus GmbH

Link

https://gmd.copernicus.org/articles/14/107/2021/gmd-14-107-2021.pdf

Reference53 articles.

1. Allen, M. and Ingram, W.: Constraints on Future Changes in Climate and the Hydrologic Cycle, Nature, 419, 224–32, https://doi.org/10.1038/nature01092, 2002. a, b

2. Bonfanti, C., Stewart, J., Maksimovic, S., Hall, D., Govett, M., Trailovic, L., and Jankov, I.: Detecting Extratropical and Tropical Cyclone Regions of Interest (ROI) in Satellite Data using Deep Learning, available at: https://ui.adsabs.harvard.edu/abs/2018AGUFM.H31H1992B/abstract (last access: 14 December 2020), 2018a. a

3. Bonfanti, C., Trailovic, L., Stewart, J., and Govett, M.: Machine Learning: Defining Worldwide Cyclone Labels for Training, 2018 21st International Conference on Information Fusion (FUSION), IEEE, https://doi.org/10.23919/ICIF.2018.8455276, 2018b. a

4. Brenowitz, N. D. and Bretherton, C. S.: Prognostic validation of a neural network unified physics parameterization, Geophys. Res. Lett., 45, 6289–6298, 2018. a

5. Chapman, W., Subramanian, A., Delle Monache, L., Xie, S., and Ralph, F.: Improving Atmospheric River Forecasts With Machine Learning, Geophys. Res. Lett., 46, 10627–10635, 2019. a

Cited by 53 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning–based extreme event attribution;Science Advances;2024-08-23

2. Pushing the frontiers in climate modelling and analysis with machine learning;Nature Climate Change;2024-08-23

3. Curating AI-Ready Datasets for Equity and Environmental Justice: A Data-Centric AI Case Study;IGARSS 2024 - 2024 IEEE International Geoscience and Remote Sensing Symposium;2024-07-07

4. Increasing the Reproducibility and Replicability of Supervised AI/ML in the Earth Systems Science by Leveraging Social Science Methods;Earth and Space Science;2024-07

5. Understanding the Low Predictability of the 2015/16 El Niño Event Based on a Deep Learning Model;Advances in Atmospheric Sciences;2024-06-01