Using Variable Selection and Wavelets to Exploit the Full Potential of Visible–Near Infrared Spectra for Predicting Soil Properties

Author:

Vohland Michael1,Ludwig Marie2,Harbich Monika3,Emmerling Christoph2,Thiele-Bruhn Sören2

Affiliation:

1. Geoinformatics and Remote Sensing, Institute for Geography, Leipzig University, Germany

2. Soil Science, Trier University, Germany

3. Environmental Remote Sensing and Geoinformatics, Trier University, Germany

Abstract

In soil spectroscopy a series of strategies exists to optimise multivariate calibrations. We explore this issue with a set of topsoil samples for which we estimated soil organic carbon (OC) and total nitrogen (N) from visible–near infrared (vis–NIR) spectra (350–2500 nm). In total, 172 samples were collected to cover the soil heterogeneity in the study area located in western Rhineland-Palatinate, Germany. There, soils with varying properties developed from very diverse parent materials, e.g., ranging from very acidic sandstone to dolomitic marl. We defined four sample sets each of a different size and heterogeneity. Each set was subdivided into a calibration and a validation set. The first strategy that we tested to improve prediction accuracies was spectral variable selection using competitive adaptive reweighted sampling (CARS) and iteratively retaining informative variables (IRIV), both in combination with partial least squares regression (PLSR). In addition, continuous wavelet transformation (CWT) with the Mexican Hat wavelet was applied to decompose the measured spectra into multiple scale components (dyadic scales 21−25) and thus to represent the high and low frequency features contained in the spectra. CARS was then applied to select wavelet coefficients from the different scales and to introduce them in the PLSR approach (CWT-CARS-PLSR). Regarding prediction power, CWT-CARS-PLSR outperformed the other approaches. For the smallest data set with 30 validation samples, prediction accuracy for OC increased from approximately quantitative with full spectrum-PLSR ( r2 = 0.81, residual prediction deviation ( RPD) = 2.27) to excellent when using wavelet decomposition and CARS-PLSR ( r2 = 0.93, RPD = 3.60). For N, predictions improved from unsuccessful ( r2 = 0.63, RPD = 1.36) to approximately quantitative ( r2 = 0.84, RPD = 2.03). In case of OC, predictions were worst for the largest dataset with 57 validation samples: CWT-CARS-PLSR achieved approximately quantitative predictions ( r2 = 0.82, RPD = 2.31), whereas full spectrum-PLSR provided estimates that allowed only separating between high and low values ( r2 = 0.72, RPD = 1.88). Accuracy of N estimation for this dataset using CWT-CARS-PLSR was also approximately quantitative. Concerning the tested spectral variable selection techniques, both methods provided similar results in the prediction. The application of IRIV was limited due to long processing times.

Publisher

SAGE Publications

Subject

Spectroscopy

Cited by 42 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3