StarCAN-PFD: An Efficient and Simplified Multi-Scale Feature Detection Network for Small Objects in Complex Scenarios-Reference-Cited by-同舟云学术

StarCAN-PFD: An Efficient and Simplified Multi-Scale Feature Detection Network for Small Objects in Complex Scenarios

Published:2024-08-03 Issue:15 Volume:13 Page:3076
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Chai Zongxuan¹,Zheng Tingting²^ORCID,Lu Feixiang³

Affiliation:

1. School of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China

2. School of Economics and Management, North China University of Technology, Beijing 100144, China

3. SUS-Baidu PaddlePaddle Intelligent Sports Technology Innovation Center, Beijing 100085, China

Abstract

Small object detection in traffic sign applications often faces challenges like complex backgrounds, blurry samples, and multi-scale variations. Existing solutions tend to complicate the algorithms. In this study, we designed an efficient and simple algorithm network called StarCAN-PFD, based on the single-stage YOLOv8 framework, to accurately recognize small objects in complex scenarios. We proposed the StarCAN feature extraction network, which was enhanced with the Context Anchor Attention (CAA). We designed the Pyramid Focus and Diffusion Network (PFDNet) to address multi-scale information loss and developed the Detail-Enhanced Conv Shared Detect (DESDetect) module to improve the recognition of complex samples while keeping the network lightweight. Experiments on the CCTSDB dataset validated the effectiveness of each module. Compared to YOLOv8, our algorithm improved mAP@0.5 by 4%, reduced the model size to less than half, and demonstrated better performance on different traffic sign datasets. It excels at detecting small traffic sign targets in complex scenes, including challenging samples such as blurry, low-light night, occluded, and overexposed conditions, showcasing strong generalization ability.

Funder

Yuxiu Innovation Project of NCUT

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/15/3076/pdf

Reference44 articles.

1. Abuadbba, A., Rhodes, N., Moore, K., Sabir, B., Wang, S., and Gao, Y. (2024). DeepiSign-G: Generic Watermark to Stamp Hidden DNN Parameters for Self-contained Tracking. arXiv.

2. Improved deep learning performance for real-time traffic sign detection and recognition applicable to intelligent transportation systems;Barodi;Int. J. Adv. Comput. Sci. Appl.,2022

3. A universal traffic sign detection system using a novel self-training neural network modeling approach;Trappey;Adv. Eng. Inform.,2024

4. Bao, D., and Gao, R. (2024). YED-YOLO: An object detection algorithm for automatic driving. Signal Image Video Process., 1–9.

5. Agrawal, S., and Chaurasiya, R.K. (2017, January 24–27). Ensemble of SVM for accurate traffic sign detection and recognition. Proceedings of the 1st International Conference on Graphics and Signal Processing, Singapore.