KGML-ag: a modeling framework of knowledge-guided machine learning to simulate agroecosystems: a case study of estimating N<sub>2</sub>O emission using data from mesocosm experiments
-
Published:2022-04-07
Issue:7
Volume:15
Page:2839-2858
-
ISSN:1991-9603
-
Container-title:Geoscientific Model Development
-
language:en
-
Short-container-title:Geosci. Model Dev.
Author:
Liu Licheng, Xu Shaoming, Tang Jinyun, Guan Kaiyu, Griffis Timothy J., Erickson Matthew D., Frie Alexander L., Jia Xiaowei, Kim TaegonORCID, Miller Lee T., Peng Bin, Wu Shaowei, Yang Yufeng, Zhou Wang, Kumar Vipin, Jin Zhenong
Abstract
Abstract. Agricultural nitrous oxide (N2O) emission accounts for a non-trivial
fraction of global greenhouse gas (GHG) budget. To date, estimating
N2O fluxes from cropland remains a challenging task because the related
microbial processes (e.g., nitrification and denitrification) are controlled
by complex interactions among climate, soil, plant and human activities.
Existing approaches such as process-based (PB) models have well-known
limitations due to insufficient representations of the processes or
uncertainties of model parameters, and due to leverage recent advances in
machine learning (ML) a new method is needed to unlock the “black box” to
overcome its limitations such as low interpretability, out-of-sample failure
and massive data demand. In this study, we developed a first-of-its-kind
knowledge-guided machine learning model for agroecosystems (KGML-ag) by
incorporating biogeophysical and chemical domain knowledge from an advanced PB
model, ecosys, and tested it by comparing simulating daily N2O fluxes with
real observed data from mesocosm experiments. The gated recurrent unit (GRU)
was used as the basis to build the model structure. To optimize the model
performance, we have investigated a range of ideas, including (1) using
initial values of intermediate variables (IMVs) instead of time series as
model input to reduce data demand; (2) building hierarchical structures to
explicitly estimate IMVs for further N2O prediction; (3) using multi-task
learning to balance the simultaneous training on multiple variables; and (4)
pre-training with millions of synthetic data generated from ecosys and fine-tuning
with mesocosm observations. Six other pure ML models were developed using
the same mesocosm data to serve as the benchmark for the KGML-ag model.
Results show that KGML-ag did an excellent job in reproducing the mesocosm
N2O fluxes (overall r2=0.81, and RMSE=3.6 mgNm-2d-1
from cross validation). Importantly, KGML-ag always outperforms
the PB model and ML models in predicting N2O fluxes, especially for
complex temporal dynamics and emission peaks. Besides, KGML-ag goes beyond
the pure ML models by providing more interpretable predictions as well as
pinpointing desired new knowledge and data to further empower the current
KGML-ag. We believe the KGML-ag development in this study will stimulate a
new body of research on interpretable ML for biogeochemistry and other
related geoscience processes.
Funder
Advanced Research Projects Agency - Energy National Science Foundation
Publisher
Copernicus GmbH
Reference57 articles.
1. Barton, L., Wolf, B., Rowlings, D., Scheer, C., Kiese, R., Grace, P., Grace, P., Stefanova, K., and Butterbach-Bahl, K.: Sampling frequency affects estimates of annual nitrous oxide fluxes, Scientific Reports, 5, 15912, https://doi.org/10.1038/srep15912, 2015. 2. Bauer, P., Dueben, P. D., Hoefler, T., Quintino, T., Schulthess, T. C., and Wedi, N. P.: The digital revolution of Earth-system science, Nature Computational Science, 1, 104–113, https://doi.org/10.1038/s43588-021-00023-0, 2021. 3. Beucler, T., Rasp, S., Pritchard, M., and Gentine, P.: Achieving conservation of energy in neural network emulators for climate modeling, arXiv [preprint], arXiv:1906.06622, 2019. 4. Beucler, T., Pritchard, M., Rasp, S., Ott, J., Baldi, P., and Gentine, P.: Enforcing analytic constraints in neural networks emulating physical systems, Phys. Rev. Lett., 126, 098302, https://doi.org/10.1103/PhysRevLett.126.098302, 2021. 5. Butterbach-Bahl, K., Baggs, E. M., Dannenmann, M., Kiese, R., and Zechmeister-Boltenstern, S.: Nitrous oxide emissions from soils: how well do we understand the processes and their controls?, Philos. T. Roy. Soc. B, 368, 20130122, https://doi.org/10.1098/rstb.2013.0122, 2013.
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|