Polymorphic Clustering and Approximate Masking Framework for Fine-Grained Insect Image Classification
-
Published:2024-04-27
Issue:9
Volume:13
Page:1691
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Huo Hua1ORCID, Mei Aokun1, Xu Ningya1
Affiliation:
1. Information Engineering College, Henan University of Science and Technology, Luoyang 471000, China
Abstract
Insect diversity monitoring is crucial for biological pest control in agriculture and forestry. Modern monitoring of insect species relies heavily on fine-grained image classification models. Fine-grained image classification faces challenges such as small inter-class differences and large intra-class variances, which are even more pronounced in insect scenes where insect species often exhibit significant morphological differences across multiple life stages. To address these challenges, we introduce segmentation and clustering operations into the image classification task and design a novel network model training framework for fine-grained classification of insect images using multi-modality clustering and approximate mask methods, named PCAM-Frame. In the first stage of the framework, we adopt the Polymorphic Clustering Module, and segmentation and clustering operations are employed to distinguish various morphologies of insects at different life stages, allowing the model to differentiate between samples at different life stages during training. The second stage consists of a feature extraction network, called Basenet, which can be any mainstream network that performs well in fine-grained image classification tasks, aiming to provide pre-classification confidence for the next stage. In the third stage, we apply the Approximate Masking Module to mask the common attention regions of the most likely classes and continuously adjust the convergence direction of the model during training using a Deviation Loss function. We apply PCAM-Frame with multiple classification networks as the Basenet in the second stage and conduct extensive experiments on the Insecta dataset of iNaturalist 2017 and IP102 dataset, achieving improvements of 2.2% and 1.4%, respectively. Generalization experiments on other fine-grained image classification datasets such as CUB200-2011 and Stanford Dogs also demonstrate positive effects. These experiments validate the pertinence and effectiveness of our framework PCAM-Frame in fine-grained image classification tasks under complex conditions, particularly in insect scenes.
Funder
National Natural Science Foundation of China Major Science and Technology Program of Henan Province Central Government Guiding Local Science and Technology Development Fund Program of Henan Province
Reference55 articles.
1. Wah, C., Branson, S., Welinder, P., Perona, P., and Belongie, S. (2023, May 23). The Caltech-UCSD Birds-200-2011 Dataset. Available online: https://authors.library.caltech.edu/records/cvm3y-5hh21. 2. Khosla, A., Jayadevaprakash, N., Yao, B., and Li, F.-F. (2023, May 23). Novel Dataset for Fine-Grained Image Categorization. Available online: https://people.csail.mit.edu/khosla/papers/fgvc2011.pdf. 3. Van Horn, G., Mac Aodha, O., Song, Y., Cui, Y., Sun, C., Shepard, A., Adam, H., Perona, P., and Belongie, S. (2018, January 18–23). The inaturalist species classification and detection dataset. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA. 4. Wu, X., Zhan, C., Lai, Y.K., Cheng, M.M., and Yang, J. (2019, January 15–20). Ip102: A large-scale benchmark dataset for insect pest recognition. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA. 5. Krause, J., Stark, M., Deng, J., and Fei-Fei, L. (2014, January 23–28). 3D Object Representations for Fine-Grained Categorization. Proceedings of the IEEE International Conference on Computer Vision Workshops, Columbus, OH, USA.
|
|