Proposal-Free Fully Convolutional Network: Object Detection Based on a Box Map-Reference-Cited by-同舟云学术

Proposal-Free Fully Convolutional Network: Object Detection Based on a Box Map

Published:2024-05-30 Issue:11 Volume:24 Page:3529
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Su Zhihao¹^ORCID,Adam Afzan¹^ORCID,Nasrudin Mohammad Faidzul¹^ORCID,Prabuwono Anton Satria²^ORCID

Affiliation:

1. Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia

2. Faculty of Computing and Information Technology in Rabigh, King Abdulaziz University, Rabigh 21911, Saudi Arabia

Abstract

Region proposal-based detectors, such as Region-Convolutional Neural Networks (R-CNNs), Fast R-CNNs, Faster R-CNNs, and Region-Based Fully Convolutional Networks (R-FCNs), employ a two-stage process involving region proposal generation followed by classification. This approach is effective but computationally intensive and typically slower than proposal-free methods. Therefore, region proposal-free detectors are becoming popular to balance accuracy and speed. This paper proposes a proposal-free, fully convolutional network (PF-FCN) that outperforms other state-of-the-art, proposal-free methods. Unlike traditional region proposal-free methods, PF-FCN can generate a “box map” based on regression training techniques. This box map comprises a set of vectors, each designed to produce bounding boxes corresponding to the positions of objects in the input image. The channel and spatial contextualized sub-network are further designed to learn a “box map”. In comparison to renowned proposal-free detectors such as CornerNet, CenterNet, and You Look Only Once (YOLO), PF-FCN utilizes a fully convolutional, single-pass method. By reducing the need for fully connected layers and filtering center points, the method considerably reduces the number of trained parameters and optimizes the scalability across varying input sizes. Evaluations of benchmark datasets suggest the effectiveness of PF-FCN: the proposed model achieved an mAP of 89.6% on PASCAL VOC 2012 and 71.7% on MS COCO, which are higher than those of the baseline Fully Convolutional One-Stage Detector (FCOS) and other classical proposal-free detectors. The results prove the significance of proposal-free detectors in both practical applications and future research.

Funder

MIGHT-TUBITAK research

proofreading

Publisher

MDPI AG

Link

https://www.mdpi.com/1424-8220/24/11/3529/pdf

Reference53 articles.

1. Object detection with deep learning: A review;Zhao;IEEE Trans. Neural Netw. Learn. Syst.,2019

2. Deep learning-based object detection in augmented reality: A systematic review;Ghasemi;Comput. Ind.,2022

3. (2020). Deep Learning in Computer Vision: Principles and Applications, CRC Press.

4. Review of local binary pattern operators in image feature extraction;Khaleefah;Indones. J. Electr. Eng. Comput. Sci.,2020

5. Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23–28). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.