Few‐shot object detection via class encoding and multi‐target decoding-Reference-Cited by-同舟云学术

Few‐shot object detection via class encoding and multi‐target decoding

Published:2023-04-11 Issue:2 Volume:5 Page:
ISSN:2631-6315
Container-title:IET Cyber-Systems and Robotics
language:en
Short-container-title:IET Cyber-Syst and Robotics

Author:

Guo Xueqiang¹,Yang Hanqing¹,Wei Mohan¹,Ye Xiaotong¹,Zhang Yu¹²^ORCID

Affiliation:

1. State Key Laboratory of Industrial Control Technology College of Control Science and Engineering Zhejiang University Hangzhou China

2. Key Laboratory of Collaborative Sensing and Autonomous Unmanned Systems of Zhejiang Province Hangzhou China

Abstract

AbstractThe task of few‐shot object detection is to classify and locate objects through a few annotated samples. Although many studies have tried to solve this problem, the results are still not satisfactory. Recent studies have found that the class margin significantly impacts the classification and representation of the targets to be detected. Most methods use the loss function to balance the class margin, but the results show that the loss‐based methods only have a tiny improvement on the few‐shot object detection problem. In this study, the authors propose a class encoding method based on the transformer to balance the class margin, which can make the model pay more attention to the essential information of the features, thus increasing the recognition ability of the sample. Besides, the authors propose a multi‐target decoding method to aggregate RoI vectors generated from multi‐target images with multiple support vectors, which can significantly improve the detection ability of the detector for multi‐target images. Experiments on Pascal visual object classes (VOC) and Microsoft Common Objects in Context datasets show that our proposed Few‐Shot Object Detection via Class Encoding and Multi‐Target Decoding significantly improves upon baseline detectors (average accuracy improvement is up to 10.8% on VOC and 2.1% on COCO), achieving competitive performance. In general, we propose a new way to regulate the class margin between support set vectors and a way of feature aggregation for images containing multiple objects and achieve remarkable results. Our method is implemented on mmfewshot, and the code will be available later.

Funder

National Natural Science Foundation of China

State Key Laboratory of Industrial Control Technology

Zhejiang University

Publisher

Institution of Engineering and Technology (IET)

Subject

Artificial Intelligence,Computational Theory and Mathematics,Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction,Information Systems

Reference60 articles.

1. The face image super-resolution algorithm based on combined representation learning

2. Improved anti-occlusion object tracking algorithm using Unscented Rauch-Tung-Striebel smoother and kernel correlation filter

3. Research on image inpainting algorithm of improved total variation minimization method;Chen Y.;J. Ambient Intell. Hum. Comput.,2021

4. FFTI: Image inpainting algorithm via features fusion and two-steps inpainting

5. Köhler M. Eisenbach M. Gross H.‐M.:Few‐Shot Object Detection: A Survey(2021). arXiv preprint arXiv:2112.11699

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Few-shot object detection: Research advances and challenges;Information Fusion;2024-07