SPECTROSCOPY DATA CALIBRATION USING STACKED ENSEMBLE MACHINE LEARNING-Reference-Cited by-同舟云学术

SPECTROSCOPY DATA CALIBRATION USING STACKED ENSEMBLE MACHINE LEARNING

Published:2024-01-01 Issue:1 Volume:25 Page:208-224
ISSN:2289-7860
Container-title:IIUM Engineering Journal
language:
Short-container-title:IIUMEJ

Author:

Mahmud Iwan Solihin ,Yuan Chan Jin^ORCID,Hong Wan Siu^ORCID,Pui Liew Phing,Kit Ang Chun^ORCID,Hossain Wafa,Machmudah Affiani

Abstract

Near infrared spectroscopy (NIRS) is a widely used analytical technique for non-destructive analysis of various materials including food fraud detection. However, the accurate calibration of NIRS data can be challenging due to the complexity of the underlying relationships between the spectral data and the target variables of interest. Ensemble learning, which combines multiple models to make predictions, has been shown to improve the accuracy and robustness of predictive models in various domains. This paper proposes stacking ensemble machine learning (SEML) for calibration of NIRS data with two levels of learning involved. Eight (8) spectroscopy datasets from public repository and previously published works by the authors are used as the case study. The model well generalized the data in the respective regression tasks with of at least »0.8 in the test samples and in the respective classification tasks with classification accuracy (CA) of at least »0.8 also. In addition, the proposed SEML can improve, or at least reach par with, the accuracy of individual base learners in both train and test samples for all cases of regression and classification datasets. It shows superior performance in test samples for both regression and classification datasets with respectively ranging from 0.86 to nearly 1 and CA ranging from 0.89 to 1. ABSTRAK: Spektroskopi inframerah dekat (NIRS) adalah teknik analitikal yang banyak digunakan bagi analisa pelbagai bahan tanpa merosakkan bahan termasuk ketika mengesan penipuan makanan. Walau bagaimanapun, kalibrasi yang tepat bagi data NIRS adalah sangat mencabar kerana hubungan antara data spektral dan pemboleh ubah sasaran yang ingin dikaji bersifat kompleks. Gabungan pembelajaran (Ensemble learning), iaitu gabungan pelbagai model bagi membuat prediksi, telah terbukti dapat meningkatkan ketepatan dan kecekapan model prediksi dalam pelbagai bentuk. Kajian ini mencadangkan Turutan Gabungan Pembelajaran Mesin (Stacking Ensemble Machine Learning ) (SEML), bagi teknik penentu ukuran data NIRS melibatkan dua tahap pembelajaran. Lapan (8) set data spektroskopi dari repositori awam dan kajian terdahulu oleh pengarang telah digunakan sebagai kes kajian. Model ini menggeneralisasi data dalam tugas regresi masing-masing sebanyak ?0.8 bagi sampel ujian dan pengelasan tugas masing-masing dengan ketepatan klasifikasi (CA) sekurang-kurangnya ?0.8. Tambahan, SEML yang dicadangkan ini dapat membantu, atau sekurang-kurangnya setanding dengan ketepatan individu dalam pembelajaran berkumpulan dalam kedua-dua sampel latihan dan ujian bagi semua kes set data regresi dan klasifikasi. Ia menunjukkan prestasi terbaik dalam sampel ujian bagi kedua-dua kumpulan set data regresi dan klasifikasi dengan masing-masing antara 0.86 hingga hampir 1 dan antara julat 0.89 hingga 1 bagi CA.

Funder

Ministry of Higher Education, Malaysia

Publisher

IIUM Press

Reference36 articles.

1. Solihin MI, Shameem Y, Htut T, Ang CK, Hidayab M. (2019) Non-Invasive Blood Glucose Estimation using Handheld Near Infrared Device. Int. J. Recent Technol. Eng., 3: 16-19.

2. doi: 10.35940/ijrte.C1004.1083S19.

3. Chen CJ, Akowuah GA. (2023) Comparison of HPLC and ATR-FTIR Methods for the Determination of Rosmarinic Acid in Aqueous Leaf Extract of Orthosiphon stamineus. Nat. Prod. J., 13(1): 40-46. doi: 10.2174/2210315512666220429114935.

4. B. A. Sabbagh, P. V. Kumar, Y. L. Chew, J. H. Chin, and G. A. Akowuah. (2022) Determination of metformin in fixed-dose combination tablets by ATR-FTIR spectroscopy. Chem. Data Collect., 13: 100868. doi: 10.1016/J.CDC.2022.100868.

5. D. G. Abdullah Al-Sanabani, M. I. Solihin, L. P. Pui, W. Astuti, C. K. Ang, and L. W. Hong. (2019) Development of non-destructive mango assessment using Handheld Spectroscopy and Machine Learning Regression. Journal of Physics: Conference Series, 1367(1): 012030. doi: 10.1088/1742-6596/1367/1/012030.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial intelligence for honey integrity in Ghana: A feasibility study on the use of smartphone images coupled with multivariate algorithms;Smart Agricultural Technology;2024-08

2. Functional network reorganization after endovascular thrombectomy in patients with anterior circulation stroke;NeuroImage: Clinical;2024