Value and limitations of machine learning in high-frequency nutrient data for gap-filling, forecasting, and transport process interpretation
-
Published:2023-06-27
Issue:7
Volume:195
Page:
-
ISSN:0167-6369
-
Container-title:Environmental Monitoring and Assessment
-
language:en
-
Short-container-title:Environ Monit Assess
Author:
Barcala Victoria,Rozemeijer Joachim,Ouwerkerk Kevin,Gerner Laurens,Osté Leonard
Abstract
AbstractHigh-frequency monitoring of water quality in catchments brings along the challenge of post-processing large amounts of data. Moreover, monitoring stations are often remote and technical issues resulting in data gaps are common. Machine learning algorithms can be applied to fill these gaps, and to a certain extent, for predictions and interpretation. The objectives of this study were (1) to evaluate six different machine learning models for gap-filling in a high-frequency nitrate and total phosphorus concentration time series, (2) to showcase the potential added value (and limitations) of machine learning to interpret underlying processes, and (3) to study the limits of machine learning algorithms for predictions outside the training period. We used a 4-year high-frequency dataset from a ditch draining one intensive dairy farm in the east of The Netherlands. Continuous time series of precipitation, evapotranspiration, groundwater levels, discharge, turbidity, and nitrate or total phosphorus were used as predictors for total phosphorus and nitrate concentrations respectively. Our results showed that the random forest algorithm had the best performance to fill in data-gaps, with R2 higher than 0.92 and short computation times. The feature importance helped understanding the changes in transport processes linked to water conservation measures and rain variability. Applying the machine learning model outside the training period resulted in a low performance, largely due to system changes (manure surplus and water conservation) which were not included as predictors. This study offers a valuable and novel example of how to use and interpret machine learning models for post-processing high-frequency water quality data.
Funder
H2020 Marie Skłodowska-Curie Actions
Publisher
Springer Science and Business Media LLC
Subject
Management, Monitoring, Policy and Law,Pollution,General Environmental Science,General Medicine
Reference60 articles.
1. Aha, D., Kilbert, D., & Albert, M. (1991). Instance-based learning algorithms. Machine Learning, 6, 37–66. 2. Arriagada, P., Karelovic, B., & Link, O. (2021). Automatic gap-filling of daily streamflow time series in data-scarce regions using a machine learning algorithm. Journal of Hydrology, 598(May), 126454. https://doi.org/10.1016/j.jhydrol.2021.126454 3. Astuti, A. D., Aris, A., Salim, M. R., Azman, S., Salmiati, & Said, M. I. M. (2020). Artificial intelligence approach to predicting river water quality: A review. Journal of Environmental Treatment Techniques, 8(3), 1093–1100. 4. Baken, S., Verbeeck, M., Verheyen, D., Diels, J., & Smolders, E. (2015). Phosphorus losses from agricultural land to natural waters are reduced by immobilization in iron-rich sediments of drainage ditches. Water Research, 71, 160–170. https://doi.org/10.1016/j.watres.2015.01.008 5. Barcala, V., Rozemeijer, J., Osté, L., Van Der Grift, B., Gerner, L., & Behrends, T. (2020). Processes controlling the flux of legacy phosphorus to surface waters at the farm scale. Environmental Research Letters, 16(1). https://doi.org/10.1088/1748-9326/abcdd4
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|