Spatial prediction of organic carbon in German agricultural topsoil using machine learning algorithms
Author:
Sakhaee Ali, Gebauer Anika, Ließ Mareike, Don AxelORCID
Abstract
Abstract. As the largest terrestrial carbon pool, soil organic carbon (SOC) has the
potential to influence and mitigate climate change; thus, SOC monitoring is of high importance
in the frameworks of various international treaties. Therefore, high-resolution SOC maps are required. Machine learning (ML) offers new
opportunities to develop these maps due to its ability to data mine large
datasets. The aim of this study was to apply three algorithms commonly used
in digital soil mapping – random forest (RF), boosted regression trees
(BRT), and support vector machine for regression (SVR) – on the first German
agricultural soil inventory to model the agricultural topsoil (0–30 cm) SOC
content and develop a two-model approach to address the high variability in
SOC in German agricultural soils. Model performance is often limited by the
size and quality of the soil dataset available for calibration and
validation. Therefore, the impact of enlarging the training dataset was tested
by including data from the European Land Use/Cover Area frame Survey
for agricultural sites in Germany. Nested cross-validation was implemented
for model evaluation and parameter tuning. Grid search and the differential
evolution algorithm were also applied to ensure that each algorithm was
appropriately tuned . The SOC content of the German agricultural soil
inventory was highly variable, ranging from 4 to 480 g kg−1. However, only 4 % of all soils contained more than 87 g kg−1 SOC and were considered organic or degraded organic soils. The
results showed that SVR produced the best performance, with a root-mean-square error (RMSE) of 32 g kg−1 when the algorithms were trained on the full dataset. However, the
average RMSE of all algorithms decreased by 34 % when mineral and organic
soils were modelled separately, with the best result from SVR presenting an RMSE of
21 g kg−1. The model performance was enhanced by up to 1 % for
mineral soils and by up to 2 % for organic soils. Despite the ability of machine
learning algorithms, in general, and SVR, in particular, to model SOC on a
national scale, the study showed that the most important aspect for
improving the model performance was to separate the modelling of mineral and
organic soils.
Publisher
Copernicus GmbH
Reference90 articles.
1. Al-Anazi, A. F. and Gates, I. D.: Support vector regression to predict
porosity and permeability: Effect of sample size, Comput. Geosci., 39,
64–76, https://doi.org/10.1016/j.cageo.2011.06.011, 2012. 2. Arrouays, D., Jolivet, C., Boulonne, L., Bodineau, G., Saby, N., and
Grolleau, E.: A new projection in France: a multi-institutional soil quality
monitoring network, Comptes Rendus l'Académie d'Agriculture Fr., 88,
93–103, 2002. 3. Awad, M. and Khanna, R.: Support Vector Regression, in: Efficient Learning
Machines, Apress, Berkeley, CA, 67–80,
https://doi.org/10.1007/978-1-4302-5990-9_4, 2015. 4. Ballabio, C., Panagos, P., and Monatanarella, L.: Mapping topsoil physical
properties at European scale using the LUCAS database, Geoderma, 261,
110–123, https://doi.org/10.1016/j.geoderma.2015.07.006, 2016. 5. Ballabio, C., Lugato, E., Fernández-Ugalde, O., Orgiazzi, A., Jones, A.,
Borrelli, P., Montanarella, L., and Panagos, P.: Mapping LUCAS topsoil
chemical properties at European scale using Gaussian process regression,
Geoderma, 355, 113912, https://doi.org/10.1016/j.geoderma.2019.113912, 2019.
Cited by
17 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|