Data worth analysis within a model-free data assimilation framework for soil moisture flow
-
Published:2023-07-19
Issue:14
Volume:27
Page:2661-2680
-
ISSN:1607-7938
-
Container-title:Hydrology and Earth System Sciences
-
language:en
-
Short-container-title:Hydrol. Earth Syst. Sci.
Author:
Wang Yakun,Hu Xiaolong,Wang Lijun,Li Jinmin,Lin Lin,Huang Kai,Shi Liangsheng
Abstract
Abstract. Conventional data worth (DW) analysis for soil water problems depends on physical dynamic models. The widespread occurrence of model structural errors and the strong nonlinearity of soil water flow may lead to biased or wrong worth assessment. By introducing the nonparametric data worth analysis (NP-DWA) framework coupled with the ensemble Kalman filter (EnKF), this real-world case study attempts to assess the worth of potential soil moisture observations regarding the reconstruction of fully data-driven soil water flow models prior to data gathering. The DW of real-time soil moisture observations after Gaussian process training and Kalman update was quantified with three representative information metrics, including the trace, Shannon entropy difference and relative entropy. The sequential NP-DWA framework was examined by a number of cases in terms of the variable of interest, spatial location, observation error, and prior data content. Our results indicated that, similarly to the traditional DW analysis based on physical models, the overall increasing trend of the DW from the sequential augmentation of additional observations within the NP-DWA framework was also susceptible to interruptions by localized surges due to never-experienced atmospheric conditions (i.e., rainfall events). The difference is that this biased DW in the former is caused by model
structural errors triggered by contrasting scenarios, which is difficult to be compensated for by assimilating more prior data, while this performance
degradation in the NP-DWA can be effectively alleviated by enriching training scenarios or the appropriate amplification of observational noise
under extreme meteorological conditions. Nevertheless, a substantial
expansion of the prior data content may cause an unexpected increase in the DW
of future potential observations due to the possible introduction of ensuing observation noises. Hence, high-quality and representative small data may be a better choice than unfiltered big data. Compared with the observations in the surface layer with the strongest time variability, the soil water content in the middle layer robustly exhibited remarkable superiority in the construction of model-free soil moisture models. We also demonstrated that the DW assessment performance was jointly determined by 3C, i.e., the capacity of potential observation realizations to capture actual observations, the correlation of potential observations with the variables of interest and the choice of DW indicators. Direct mapping from regular meteorological data to soil water content within the NP-DWA mitigated the adverse effects of nonlinearity-related interference, which thus facilitated the identification of the soil moisture covariance matrix, especially the cross-covariance.
Funder
National Natural Science Foundation of China
Publisher
Copernicus GmbH
Subject
General Earth and Planetary Sciences,General Engineering,General Environmental Science
Reference65 articles.
1. Akhtar, K., Wang, W., Khan, A., Ren, G., Afridi, M. Z., Feng, Y., and Yang,
G.: Wheat straw mulching offset soil moisture deficient for improving
physiological and growth performance of summer sown soybean, Agric. Water
Manage., 211, 16–25, https://doi.org/10.1016/j.agwat.2018.09.031, 2019. 2. Amro, A., Al-Akhras, M., Hindi, K. E., Habib, M., and Shawar, B. A.:
Instance Reduction for Avoiding Overfitting in Decision Trees, J. Intell.
Syst., 30, 438–459, https://doi.org/10.1515/jisys-2020-0061, 2021. 3. Brajard, J., Carrassi, A., Bocquet, M., and Bertino, L.: Combining data
assimilation and machine learning to emulate a dynamical model from sparse
and noisy observations: A case study with the Lorenz 96 model, J. Comput.
Sci., 44, 101171, https://doi.org/10.1016/j.jocs.2020.101171, 2020. 4. Brajard, J., Carrassi, A., Bocquet, M., and Bertino, L.: Combining data
assimilation and machine learning to infer unresolved scale parametrization,
Philos. T. Roy. Soc. A, 379, 20200086, https://doi.org/10.1098/rsta.2020.0086, 2021. 5. Bresler, E., Heller, J., Diner, N., Ben-Asher, I., Brandt, A., and Goldberg,
D.: Infiltration from a Trickle Source: II. Experimental Data and
Theoretical Predictions, Soil Sci. Soc. Am. J., 35, 683–689,
https://doi.org/10.2136/sssaj1971.03615995003500050019x, 1971.
|
|