Improving the prediction of an atmospheric chemistry transport model using gradient-boosted regression trees-Reference-Cited by-同舟云学术

Improving the prediction of an atmospheric chemistry transport model using gradient-boosted regression trees

Published:2020-07-13 Issue:13 Volume:20 Page:8063-8082
ISSN:1680-7324
Container-title:Atmospheric Chemistry and Physics
language:en
Short-container-title:Atmos. Chem. Phys.

Author:

Ivatt Peter D.^ORCID,Evans Mathew J.^ORCID

Abstract

Abstract. Predictions from process-based models of environmental systems are biased, due to uncertainties in their inputs and parameterizations, reducing their utility. We develop a predictor for the bias in tropospheric ozone (O3, a key pollutant) calculated by an atmospheric chemistry transport model (GEOS-Chem), based on outputs from the model and observations of ozone from both the surface (EPA, EMEP, and GAW) and the ozone-sonde networks. We train a gradient-boosted decision tree algorithm (XGBoost) to predict model bias (model divided by observation), with model and observational data for 2010–2015, and then we test the approach using the years 2016–2017. We show that the bias-corrected model performs considerably better than the uncorrected model. The root-mean-square error is reduced from 16.2 to 7.5 ppb, the normalized mean bias is reduced from 0.28 to −0.04, and Pearson's R is increased from 0.48 to 0.84. Comparisons with observations from the NASA ATom flights (which were not included in the training) also show improvements but to a smaller extent, reducing the root-mean-square error (RMSE) from 12.1 to 10.5 ppb, reducing the normalized mean bias (NMB) from 0.08 to 0.06, and increasing Pearson's R from 0.76 to 0.79. We attribute the smaller improvements to the lack of routine observational constraints for much of the remote troposphere. We show that the method is robust to variations in the volume of training data, with approximately a year of data needed to produce useful performance. Data denial experiments (removing observational sites from the algorithm training) show that information from one location (for example Europe) can reduce the model bias over other locations (for example North America) which might provide insights into the processes controlling the model bias. We explore the choice of predictor (bias prediction versus direct prediction) and conclude both may have utility. We conclude that combining machine learning approaches with process-based models may provide a useful tool for improving these models.

Publisher

Copernicus GmbH

Subject

Atmospheric Science

Link

https://acp.copernicus.org/articles/20/8063/2020/acp-20-8063-2020.pdf

Reference59 articles.

1. Anderson, G. J. and Lucas, D. D.: Machine Learning Predictions of a Multiresolution Climate Model Ensemble, Geophys. Res. Lett., 45, 4273–4280, https://doi.org/10.1029/2018gl077049, 2018. a

2. Bauer, P., Thorpe, A., and Brunet, G.: The quiet revolution of numerical weather prediction, Nature, 525, 47–55, https://doi.org/10.1038/nature14956, 2015. a

3. Bergstra, J. and Bengio, Y.: Random Search for Hyper-Parameter Optimization, J. Mach. Learn. Res., 13, 281–305, 2012. a

4. Bey, I., Jacob, D. J., Yantosca, R. M., Logan, J. A., Field, B. D., Fiore, A. M., Li, Q. B., Liu, H. G. Y., Mickley, L. J., and Schultz, M. G.: Global modeling of tropospheric chemistry with assimilated meteorology: Model description and evaluation, J. Geophys. Res.-Atmos., 106, 23073–23095, https://doi.org/10.1029/2001jd000807, 2001. a

5. Blockeel, H. and De Raedt, L.: Top-down induction of first-order logical decision trees, Artificial Intelligence, 101, 285–297, https://doi.org/10.1016/s0004-3702(98)00034-4, 1998. a

Cited by 41 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Investigating the causes and reduction approaches of nocturnal ozone increase events over Tai'an in the North China Plain;Atmospheric Research;2024-09

2. Regression Forest Approaches to Gravity Wave Parameterization for Climate Projection;Journal of Advances in Modeling Earth Systems;2024-07

3. Multidecadal ozone trends in China and implications for human health and crop yields: a hybrid approach combining a chemical transport model and machine learning;Atmospheric Chemistry and Physics;2024-01-11

4. Detecting Thyroid Disease Using Optimized Machine Learning Model Based on Differential Evolution;International Journal of Computational Intelligence Systems;2024-01-03

5. Improving Air Quality Zoning Through Deep Learning and Hyperlocal Measurements;IEEE Access;2024