A machine learning methodology for the generation of a parameterization of the hydroxyl radical
-
Published:2022-08-17
Issue:16
Volume:15
Page:6341-6358
-
ISSN:1991-9603
-
Container-title:Geoscientific Model Development
-
language:en
-
Short-container-title:Geosci. Model Dev.
Author:
Anderson Daniel C.ORCID, Follette-Cook Melanie B., Strode Sarah A.ORCID, Nicely Julie M.ORCID, Liu Junhua, Ivatt Peter D., Duncan Bryan N.
Abstract
Abstract. We present a methodology that uses gradient-boosted regression trees (a
machine learning technique) and a full-chemistry simulation (i.e., training
dataset) from a chemistry–climate model (CCM) to efficiently generate a
parameterization of tropospheric hydroxyl radical (OH) that is a function of
chemical, dynamical, and solar irradiance variables. This surrogate model of
OH is designed to be integrated into a CCM and allow for
computationally efficient simulation of nonlinear feedbacks between OH and
tropospheric constituents that have loss by reaction with OH as their
primary sinks (e.g., carbon monoxide (CO), methane (CH4), volatile
organic compounds (VOCs)). Such a model framework is advantageous for
studies that require multi-decadal simulations of CH4 or multi-year
sensitivity simulations to understand the causes of trends and variations of
CO and CH4. To allow the user to easily target the training dataset
towards a desired application, we are outlining a methodology to generate a
parameterization of OH and not presenting an “off-the-shelf” version of a
parameterization to be incorporated into a CCM. This provides for the
relatively easy creation of a new parameterization in response to, for
example, changes in research goals or the underlying CCM chemistry and/or
dynamics schemes. We show that a sample parameterization of OH generated
from a CCM simulation is able to reproduce OH concentrations with a
normalized root-mean-square error of approximately 5 % and
capture the global mean methane lifetime within approximately 1 %. Our
calculated accuracy of the parameterization assumes inputs being within the
bounds of the training dataset. Large excursions from these bounds will
likely decrease the overall accuracy. However, we show that the sample
parameterization predicts large deviations in OH for an El Niño event
that was not part of the training dataset and that the spatial distribution
and strength of these deviations are consistent with the event. This result
gives confidence in the fidelity of a parameterization developed with our
methodology to simulate the spatial and temporal responses of OH to
perturbations from large variations in the chemical, dynamical, and solar
irradiance drivers of OH. In addition, we discuss how two machine learning
metrics, Gain feature importance and Shapley
additive explanations values, indicate that the behavior
of a parameterization of OH generally accords with our understanding of OH
chemistry, even though there are no physics- or chemistry-based constraints
on the parameterization.
Funder
National Aeronautics and Space Administration
Publisher
Copernicus GmbH
Reference66 articles.
1. Anderson, D. C., Duncan, B. N., Fiore, A. M., Baublitz, C. B., Follette-Cook, M. B., Nicely, J. M., and Wolfe, G. M.: Spatial and temporal variability in the hydroxyl (OH) radical: understanding the role of large-scale climate features and their influence on OH through its dynamical and photochemical drivers, Atmos. Chem. Phys., 21, 6481–6508, https://doi.org/10.5194/acp-21-6481-2021, 2021. 2. Anderson, D. C., Follette-Cook, M. B., Strode, S. A., Nicely, J. M., Liu, J., Ivatt, P. D., and Duncan, B. N.: Code for the development of a
parameterization of OH for CCMs, Zenodo [code],
https://doi.org/10.5281/zenodo.6046037, 2022a. 3. Anderson, D. C., Follette-Cook, M. B., Strode, S. A., Nicely, J .M., Liu, J., Ivatt, P. D., and Duncan, B. N.: Sample ECCOH OH Parameterization
(1.0), Zenodo [code and data set], https://doi.org/10.5281/zenodo.6604130, 2022b. 4. CEDA Archive: CCMI-1 Data Archive, CEDA Archive [data set], http://data.ceda.ac.uk/badc/wcrp-ccmi/data/CCMI-1/output, last access: 4 Aug. 2022. 5. Chen, T. and Guestrin, C.: XGBoost: A Scalable Tree Boosting System, KDD
'16: Proceedings of the 22nd ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016, 785–794, https://doi.org/10.1145/2939672.2939785, 2016.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|