Semi-Supervised Object Detection with Multi-Scale Regularization and Bounding Box Re-Prediction-Reference-Cited by-同舟云学术

Semi-Supervised Object Detection with Multi-Scale Regularization and Bounding Box Re-Prediction

Published:2024-01-03 Issue:1 Volume:13 Page:221
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Shao Yeqin¹^ORCID,Lv Chang¹,Zhang Ruowei²,Yin He³,Che Meiqin¹^ORCID,Yang Guoqing⁴,Jiang Quan¹

Affiliation:

1. School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China

2. College of Electrical Engineering, Nantong University, Nantong 226004, China

3. School of Information Science and Technology, Nantong University, Nantong 226019, China

4. Suzhou Research Institute of Industrial Technology, Zhejiang University, Hangzhou 310058, China

Abstract

Semi-supervised object detection has become a hot topic in recent years, but there are still some challenges regarding false detection, duplicate detection, and inaccurate localization. This paper presents a semi-supervised object detection method with multi-scale regularization and bounding box re-prediction. Specifically, to improve the generalization of the two-stage object detector and to make consistent predictions related to the image and its down-sampled counterpart, a novel multi-scale regularization loss is proposed for the region proposal network and the region-of-interest head. Then, in addition to using the classification probabilities of the pseudo-labels to exploit the unlabeled data, this paper proposes a novel bounding box re-prediction strategy to re-predict the bounding boxes of the pseudo-labels in the unlabeled images and select the pseudo-labels with reliable bounding boxes (location coordinates) to improve the model’s localization accuracy based on its unsupervised localization loss. Experiments on the public MS COCO and Pascal VOC show that our proposed method achieves a competitive detection performance compared to other state-of-the-art methods. Furthermore, our method offers a multi-scale regularization strategy and a reliably located pseudo-label screening strategy, both of which facilitate the development of semi-supervised object detection techniques and boost the object detection performance in autonomous driving, industrial inspection, and agriculture automation.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/1/221/pdf

Reference40 articles.

1. Bochkovskiy, A., Wang, C., and Liao, H.M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv.

2. Ge, Z., Liu, S., Wang, F., Li, Z., and Sun, J. (2021). YOLOX: Exceeding YOLO Series in 2021. arXiv.

3. Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). FCOS: Fully Convolutional One-Stage Object Detection. Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Repbulic of Korea.

4. Tan, M., Pang, R., and Le, Q.V. (2020, January 13–19). EfficientDet: Scalable and Efficient Object Detection. Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA.

5. Wang, C., Bochkovskiy, A., and Liao, H.M. (2023, January 17–24). YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Binary-SegNet: Efficient Convolutional Architecture for Semantic Segmentation Based on Monocular Camera;Lecture Notes in Networks and Systems;2024