Lightweight and Optimized Multi-Label Fruit Image Classification: A Combined Approach of Knowledge Distillation and Image Enhancement
-
Published:2024-08-17
Issue:16
Volume:13
Page:3267
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Zhang Juce1, Lu Yao2, Guo Yi1, Wu Chengkai3, Liu Hengjun1, Yu Zhuoyi4, Zhou Jiayi5
Affiliation:
1. School of Economics, Beijing Technology and Business University, Beijing 100048, China 2. School of Information Science and Technology, Shihezi University, Shihezi 832003, China 3. The School of Mathematics and Statistics, Beijing Jiaotong University, Beijing 100044, China 4. Maynooth International Engineering College, Fuzhou University, Fuzhou 350108, China 5. Institutes of Science and Development, Chinese Academy of Sciences, Beijing 100864, China
Abstract
In our research, we aimed to address the shortcomings of traditional fruit image classification models, which struggle with inconsistent lighting, complex backgrounds, and high computational demands. To overcome these challenges, we developed a novel multi-label classification method incorporating advanced image preprocessing techniques, such as Contrast Limited Adaptive Histogram Equalization and the Gray World algorithm, which enhance image quality and color balance. Utilizing lightweight encoder–decoder architectures, specifically MobileNet, DenseNet, and EfficientNet, optimized with an Asymmetric Binary Cross-Entropy Loss function, we improved model performance in handling diverse sample difficulties. Furthermore, Multi-Label Knowledge Distillation (MLKD) was implemented to transfer knowledge from large, complex teacher models to smaller, efficient student models, thereby reducing computational complexity without compromising accuracy. Experimental results on the DeepFruit dataset, which includes 21,122 images of 20 fruit categories, demonstrated that our method achieved a peak mean Average Precision (mAP) of 90.2% using EfficientNet-B3, with a computational cost of 7.9 GFLOPs. Ablation studies confirmed that the integration of image preprocessing, optimized loss functions, and knowledge distillation significantly enhances performance compared to the baseline models. This innovative method offers a practical solution for real-time fruit classification on resource-constrained devices, thereby supporting advancements in smart agriculture and the food industry.
Reference38 articles.
1. Improving the accuracy of medical diagnosis with causal machine learning;Richens;Nat. Commun.,2020 2. Ni, J., Chen, Y., Chen, Y., Zhu, J., Ali, D., and Cao, W. (2020). A survey on theories and applications for self-driving cars based on deep learning methods. Appl. Sci., 10. 3. Camgoz, N.C., Koller, O., Hadfield, S., and Bowden, R. (2020, January 13–19). Sign language transformers: Joint end-to-end sign language recognition and translation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA. 4. Zoph, B., Vasudevan, V., Shlens, J., and Le, Q.V. (2018, January 18–23). Learning transferable architectures for scalable image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA. 5. Nandi, C.S., Tudu, B., and Koley, C. (2012, January 18–21). An automated machine vision based system for fruit sorting and grading. Proceedings of the 2012 Sixth International Conference on Sensing Technology (ICST), Kolkata, India.
|
|