Digital Mapping of Soil Organic Carbon Using Machine Learning Algorithms in the Upper Brahmaputra Valley of Northeastern India
Author:
Kumar Amit12ORCID, Moharana Pravash Chandra3ORCID, Jena Roomesh Kumar4ORCID, Malyan Sandeep Kumar5ORCID, Sharma Gulshan Kumar6ORCID, Fagodiya Ram Kishor7ORCID, Shabnam Aftab Ahmad1ORCID, Jigyasu Dharmendra Kumar1, Kumari Kasthala Mary Vijaya1, Doss Subramanian Gandhi2ORCID
Affiliation:
1. Central Muga Eri Research and Training Institute, Lahdoigarh, Jorhat 785700, Assam, India 2. Central Sericultural Research and Training Institute, Mysuru 570008, Karnataka, India 3. ICAR-National Bureau of Soil Survey and Land Use Planning, Nagpur 440033, Maharashtra, India 4. ICAR-Indian Institute of Water Management, Bhubaneswar 751023, Odisha, India 5. Department of Environmental Studies, Dyal Singh Evening College, University of Delhi, New Delhi 110003, India 6. ICAR-Indian Institute of Soil and Water Conservation, Research Centre, Kota 324002, Rajasthan, India 7. ICAR-Central Soil Salinity Research Institute, Karnal 132001, Haryana, India
Abstract
Soil Organic Carbon (SOC) is a crucial indicator of ecosystem health and soil quality. Machine learning (ML) models that predict soil quality based on environmental parameters are becoming more prevalent. However, studies have yet to examine how well each ML technique performs when predicting and mapping SOC, particularly at high spatial resolutions. Model predictors include topographic variables generated from SRTM DEM; vegetation and soil indices derived from Landsat satellite images predict SOC for the Lakhimpur district of the upper Brahmaputra Valley of Assam, India. Four ML models, Random Forest (RF), Cubist, Extreme Gradient Boosting (XGBoost), and Support Vector Machine (SVM), were utilized to predict SOC for the top layer of soil (0–15 cm) at a 30 m resolution. The results showed that the descriptive statistics of the calibration and validation sets were close enough to the total set data and calibration dataset, representing the complete samples. The measured SOC content varied from 0.10 to 1.85%. The RF model’s performance was optimal in the calibration and validation sets (R2c = 0.966, RMSEc = 0.159%, R2v = 0.418, RMSEv = 0.377%). The SVM model, on the other hand, had the next-lowest accuracy, explaining 47% of the variation (R2c = 0.471, RMSEc = 0.293, R2v = 0.081, RMSEv = 0.452), while the Cubist model fared the poorest in both the calibration and validation sets. The most-critical variable in the RF model for predicting SOC was elevation, followed by MAT and MRVBF. The essential variables for the Cubist model were slope, TRI, MAT, and Band4. AP and LS were the most-essential factors in the XGBoost and SVM models. The predicted OC ranged from 0.44 to 1.35%, 0.031 to 1.61%, 0.035 to 1.71%, and 0.47 to 1.36% in the RF, Cubist, XGBoost, and SVM models, respectively. Compared with different ML models, RF was optimal (high accuracy and low uncertainty) for predicting SOC in the investigated region. According to the present modeling results, SOC may be determined simply and accurately. In general, the high-resolution maps might be helpful for decision-makers, stakeholders, and applicants in sericultural management practices towards precision sericulture.
Subject
Nature and Landscape Conservation,Ecology,Global and Planetary Change
Reference38 articles.
1. A simple pipeline for the assessment of legacy soil datasets: An example and test with soil organic carbon from a highly variable area;Schillaci;Catena,2019 2. Assessment of soil organic carbon stocks under future climate and land cover changes in Europe;Yigini;Sci. Total Environ.,2016 3. Jena, R.K., Moharana, P.C., Dharumarajan, S., Sharma, G.K., Ray, P., Deb Roy, P., Ghosh, D., Das, B., Alsuhaibani, A.M., and Gaber, A. (2023). Spatial Prediction of Soil Particle-Size Fractions Using Digital Soil Mapping in the North Eastern Region of India. Land, 12. 4. Moharana, P.C., Meena, R.L., Nogiya, M., Jena, R.K., Sharma, G.K., Sahoo, S., Jha, P.K., Aditi, K., and Vara Prasad, P.V. (2022). Impacts of Land Use on Pools and Indices of Soil Organic Carbon and Nitrogen in the Ghaggar Flood Plains of Arid India. Land, 11. 5. Digital mapping of soil organic carbon at multiple depths using different data mining techniques in Baneh region, Iran;Nabiollahi;Geoderma,2016
|
|