A novel two‐phase near‐infrared and midinfrared wavelength selection framework for sample classification-Reference-Cited by-同舟云学术

A novel two‐phase near‐infrared and midinfrared wavelength selection framework for sample classification

Published:2024-02-17 Issue:3 Volume:38 Page:
ISSN:0886-9383
Container-title:Journal of Chemometrics
language:en
Short-container-title:Journal of Chemometrics

Author:

Fontes Juliana¹^ORCID,Anzanello Michel J.¹,Brito João B. G.¹,Bucco Guilherme B.²

Affiliation:

1. Department of Industrial Engineering Federal University of Rio Grande do Sul Porto Alegre Brazil

2. School of Administration Federal University of Rio Grande do Sul Porto Alegre Brazil

Abstract

AbstractSpectral data describing product samples are typically composed of a large number of noisy and irrelevant wavelengths that tends to undermine the performance of multivariate predictive techniques. This paper proposes a two‐phase framework that integrates a preselection wavelength step oriented by wavelength clustering to a wrapper‐based strategy. The first phase performs a pruning process in the data that removes the less informative wavelengths relying on the spectral clustering, a technique deemed suitable to the Fourier transform infrared (FTIR) spectroscopy and near‐infrared (NIR) spectroscopy data at hand. The preselected wavelengths undergo a second phase of selection efforts based on the combination of different wavelength importance indices (i.e., Bhattacharyya distance, Chi‐square, ReliefF, and Gini) and classification techniques (i.e., support vector machine, k‐nearest neighbors, and random forest). When applied to 11 FTIR datasets from different domains, the recommended combination of importance index and classifier increased the average accuracy by 6.37% (from 0.863 to 0.918), while retaining average 3.84% of the original spectra. The framework also improved the selection process regarding computational time.

Publisher

Wiley

Reference61 articles.

1. A non-equidistant wavenumber interval selection approach for classifying diesel/biodiesel samples

2. High-throughput NIR-chemometric methods for chemical and pharmaceutical characterization of sustained release tablets

3. Selecting relevant Fourier transform infrared spectroscopy wavenumbers for clustering authentic and counterfeit drug samples

4. ATR-FTIR characterization of generic brand-named and counterfeit sildenafil- and tadalafil-based tablets found on the Brazilian market

5. Vis-NIR spectrometric determination of Brix and sucrose in sugar production samples using kernel partial least squares with interval selection based on the successive projections algorithm

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Gaussian process regression coupled with mRMR to predict adulterant concentration in cocaine;Journal of Pharmaceutical and Biomedical Analysis;2024-09