ML-based water quality modeling at national level in a data-scarce region-Reference-Cited by-同舟云学术

ML-based water quality modeling at national level in a data-scarce region

Published:2022-06-11 Issue: Volume:3 Page:1-11
ISSN:2700-8150
Container-title:AGILE: GIScience Series
language:en
Short-container-title:AGILE GIScience Ser.

Author:

Virro Holger^ORCID,Kmoch Alexander^ORCID,Vainu Marko,Uuemaa Evelyn^ORCID

Abstract

Abstract. Water quality (WQ) modeling can be used for gaining insight into WQ issues in order to implement effective mitigation efforts. Process-based nutrient models are very complex, requiring a lot of input parameters and computationally expensive calibration. Recently, ML approaches have shown to achieve an accuracy comparable to the process-based models and even outperform them when describing nonlinear relationships. We used observations from 242 Estonian catchments, amounting to 469 yearly total nitrogen (TN) and 470 total phosphorus (TP) measurements covering the period 2016–2020 to train random forest (RF) models for predicting annual N and P concentrations. We used a total of 82 predictor variables, including land use and land cover (LULC), soil, climate and topography parameters and applied a feature selection strategy to reduce the number of dependent features in the models. The SHAP method was used for deriving the most relevant predictors. The performance of our models is comparable to previous process-based models used in the Baltic region. However, as input data used in our models is easier to obtain, the models offer superior applicability in areas, where data availability is insufficient for process-based approaches.

Publisher

Copernicus GmbH

Subject

General Medicine

Link

https://agile-giss.copernicus.org/articles/3/66/2022/agile-giss-3-66-2022.pdf