Abstract
AbstractPartial least squares regression (PLSR) is a reference method in chemometrics. In agronomy, it is used for instance to predict components of chemical composition (response variablesy) of vegetal materials from spectral near infrared (NIR) dataXcollected from spectrometers. The principle of PLSR is to reduce the dimension of the spectral dataXby computing vectors that are then used as latent variables (LVs) in a multiple linear model. A difficulty is to determine the relevant dimensionality (number of LVs) of the model for the given available data. This step can also become time consuming when many different datasets have to be processed and/or the datasets are frequently updated. An alternative to determinate the relevant PLSR dimensionality is the ensemble learning method “PLSR averaging”. In the past, this method has been demonstrated to be efficient for complex biological materials such as mixed forages, and facilitates to automatize predictions (e.g. in user-friendly web interface platforms). This article presents the extension of the PLSR averaging to ak-nearest neighbors locally weighted PLSR pipeline (kNN-LWPLSR). The kNN-LWPLSR pipeline has the advantage to account for non-linearity betweenXandyexisting for instance in heterogeneous data (e.g. mixing of vegetal species, collection from different geographical areas, etc.). In the article, kNN-LWPLSR averaging is applied to an extensive NIR database built to predict the chemical composition of European and tropical forages and feed. The main finding of the study was the overall superiority of the averaging compared to the usual kNN-LWPLSR. Averaging may therefore be recommended in local PLSR pipelines to predict NIR forage and feed data.
Publisher
Cold Spring Harbor Laboratory
Reference30 articles.
1. PLS regression methods
2. H. Wold , Nonlinear iterative partial least squares (NIPALS) modeling: some current developments, in: Multivariate Analysis II, Krishnaiah, P. R., Wright State University, Dayton, Ohio, USA. June 19–24, 1972. New York: Academic Press, 1973: pp. 383–407.
3. PLS-regression: a basic tool of chemometrics
4. Multivariate Calibration and Chemometrics for near Infrared Spectroscopy: Which Method?, J. Near Infrared Spectrosc;JNIRS,2000
5. National calibration of soil organic carbon concentration using diffuse infrared reflectance spectroscopy