To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example
-
Published:2022-07-14
Issue:
Volume:9
Page:
-
ISSN:2296-861X
-
Container-title:Frontiers in Nutrition
-
language:
-
Short-container-title:Front. Nutr.
Author:
Li Yizhang,Liu Lingyu,Wang Zhongmin,Chang Tianying,Li Ke,Xu Wenqing,Wu Yong,Yang Hua,Jiang Daoli
Abstract
It is a necessity to determine significant food or traditional Chinese medicine (TCM) with low cost, which is more likely to achieve high accurate identification by THz-TDS. In this study, feedforward neural networks based on terahertz spectra are employed to predict the animal origin of gelatins, whose adaption to the mission is examined by parallel models built by random sample partition and initialization. It is found that the generalization performance of feedforward ANNs in original data is not satisfactory although prediction on trained samples can be accurate. A multivariate scattering correction is conducted to enhance prediction accuracy, and 20 additional models verify the effectiveness of such dispose. A special partition of total dataset is conducted based on statistics of parallel models, whose influence on ANN performance is investigated with another 20 models. The performance of the models is unsatisfactory because of notable differences in training and test sets according to principal component analysis. By comparing the distribution of the first two principal components before and after multivariate scattering correction, we found that the reciprocal of the minimum number of line segments required for error-free classification in 2-D feature space can be viewed as an index to describe linear separability of data. The rise of proposed linear separability would have a lower requirement for harsh parameter tuning of ANN models and tolerate random initialization. The difference in principal components of samples between a training set and a data set determines whether partition is acceptable or whether a model would have generality. A rapid way to estimate the performance of an ANN before sufficient tuning on a classification mission is to compare differences between groups and differences within groups. Given that a representative peak missing curve is discussed in this article, an analysis based on gelatin THz spectra may be helpful for studies on some other feature-less species.
Funder
Major Scientific and Technological Innovation Project of Shandong Province
Shandong Academy of Sciences
Natural Science Foundation of Shandong Province
National Natural Science Foundation of China
Publisher
Frontiers Media SA
Subject
Nutrition and Dietetics,Endocrinology, Diabetes and Metabolism,Food Science
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献