Affiliation:
1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
Abstract
Few-shot object detection (FSOD) aims to detect objects belonging to novel classes with few training samples. With the small number of novel class samples, the visual information extracted is insufficient to accurately represent the object itself, presenting significant intra-class variance and confusion between classes of similar samples, resulting in large errors in the detection results of the novel class samples. We propose a few-shot object detection framework to achieve effective classification and detection by embedding semantic information and contrastive learning. Firstly, we introduced a semantic fusion (SF) module, which projects semantic spatial information into visual space for interaction, to compensate for the lack of visual information and further enhance the representation of feature information. To further improve the classification performance, we embed the memory contrastive proposal (MCP) module to adjust the distribution of the feature space by calculating the contrastive loss between the class-centered features of previous samples and the current input features to obtain a more discriminative embedding space for better intra-class aggregation and inter-class separation for subsequent classification and detection. Extensive experiments on the PASCAL VOC and MS-COCO datasets show that the performance of our proposed method is effectively improved. Our proposed method improves nAP50 over the baseline model by 4.5% and 3.5%.
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference42 articles.
1. Kong, T., Yao, A., Chen, Y., and Sun, F. (2016, January 27–30). Hypernet: Towards accurate region proposal generation and joint object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.
2. Vehicle target detection method based on improved SSD model;Yu;J. Artif. Intell.,2020
3. Object detection and tracking with UAV data using deep learning;Micheal;J. Indian Soc. Remote Sens.,2021
4. Learning to match anchors for visual object detection;Zhang;IEEE Trans. Pattern Anal. Mach. Intell.,2021
5. Kang, B., Liu, Z., Wang, X., Yu, F., Feng, J., and Darrell, T. (November, January 27). Few-shot object detection via feature reweighting. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea.