Few-Shot Object Detection in Remote Sensing Images via Data Clearing and Stationary Meta-Learning
Author:
Yang Zijiu1, Guan Wenbin1ORCID, Xiao Luyang1, Chen Honggang1ORCID
Affiliation:
1. College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
Abstract
Nowadays, the focus on few-shot object detection (FSOD) is fueled by limited remote sensing data availability. In view of various challenges posed by remote sensing images (RSIs) and FSOD, we propose a meta-learning-based Balanced Few-Shot Object Detector (B-FSDet), built upon YOLOv9 (GELAN-C version). Firstly, addressing the problem of incompletely annotated objects that potentially breaks the balance of the few-shot principle, we propose a straightforward yet efficient data clearing strategy, which ensures balanced input of each category. Additionally, considering the significant variance fluctuations in output feature vectors from the support set that lead to reduced effectiveness in accurately representing object information for each class, we propose a stationary feature extraction module and corresponding stationary and fast prediction method, forming a stationary meta-learning mode. In the end, in consideration of the issue of minimal inter-class differences in RSIs, we propose inter-class discrimination support loss based on the stationary meta-learning mode to ensure the information provided for each class from the support set is balanced and easier to distinguish. Our proposed detector’s performance is evaluated on the DIOR and NWPU VHR-10.v2 datasets, and comparative analysis with state-of-the-art detectors reveals promising performance.
Funder
the Sichuan Science and Technology Program
Reference44 articles.
1. Gui, S., Song, S., Qin, R., and Tang, Y. (2024). Remote Sensing Object Detection in the Deep Learning Era—A Review. Remote Sens., 16. 2. Ben Saad, A., Facciolo, G., and Davy, A. (2024, January 1–6). On the Importance of Large Objects in CNN Based Object Detection Algorithms. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA. 3. Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27–30). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 4. Faster R-CNN: Towards real-time object detection with region proposal networks;Ren;Adv. Neural Inf. Process. Syst.,2015 5. Karlinsky, L., Shtok, J., Harary, S., Schwartz, E., Aides, A., Feris, R., Giryes, R., and Bronstein, A.M. (2019, January 15–20). Repmet: Representative-based metric learning for classification and few-shot object detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.
|
|