Fast and Accurate Detection of Dim and Small Targets for Smart Micro-Light Sight
-
Published:2024-08-20
Issue:16
Volume:13
Page:3301
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Wei Jia1ORCID, Che Kai1ORCID, Gong Jiayuan2ORCID, Zhou Yun1, Lv Jian1, Que Longcheng1, Liu Hu3, Len Yuanbin4
Affiliation:
1. College of Optoelectronic Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China 2. Institute of Automotive Engineers, Hubei University of Automotive Technology No. 167, Shiyan 442000, China 3. Xi’an Institute of Applied Optics, Xi’an 710065, China 4. Chengdu Dingyi Information Technology Co., Chengdu 611731, China
Abstract
To deal with low recognition accuracy and large time-consumption for dim, small targets in a smart micro-light sight, we propose a lightweight model DS_YOLO (dim and small target detection). We introduce the adaptive channel convolution module (ACConv) to reduce computational redundancy while maximizing the utilization of channel features. To address the misalignment problem in multi-task learning, we also design a lightweight dynamic task alignment detection head (LTD_Head), which utilizes GroupNorm to improve the performance of detection head localization and classification, and shares convolutions to make the model lightweight. Additionally, to improve the network’s capacity to detect small-scale targets while maintaining its generalization to multi-scale target detection, we extract high-resolution feature map information to establish a new detection head. Ultimately, the incorporation of the attention pyramid pooling layer (SPPFLska) enhances the model’s regression accuracy. We conduct an evaluation of the proposed algorithm DS_YOLO on four distinct datasets: CityPersons, WiderPerson, DOTA, and TinyPerson, achieving a 66.6% mAP on the CityPersons dataset, a 4.3% improvement over the original model. Meanwhile, our model reduces the parameter count by 33.3% compared to the baseline model.
Funder
Natural Science Foundation of Hubei Province of China Key Project of Science and Technology Research Plan of Hubei Provincial Department of Education
Reference43 articles.
1. Deep learning for occluded and multi-scale pedestrian detection: A review;Xiao;IET Image Process.,2021 2. Vehicle And Pedestrian Detection Algorithm Based on Improved YOLOv5;Sun;IAENG Int. J. Comput. Sci.,2023 3. Computing systems for autonomous driving: State of the art and challenges;Liu;IEEE Internet Things J.,2020 4. Levinson, J., Askeland, J., Becker, J., Dolson, J., Held, D., Kammel, S., Kolter, J.Z., Langer, D., Pink, O., and Pratt, V. (2011, January 5–9). Towards fully autonomous driving: Systems and algorithms. Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden, Germany. 5. Gawande, U., Hajari, K., and Golhar, Y. (2020). Pedestrian detection and tracking in video surveillance system: Issues, comprehensive review, and challenges. Recent Trends in Computational Intelligence, Intech Open Publ.
|
|