Affiliation:
1. College of Information Science and Engineering, Henan University of Technology, Zhengzhou 450001, China
2. Beijing Institute of Technology, Beijing 100081, China
Abstract
Intelligent video surveillance plays a pivotal role in enhancing the infrastructure of smart urban environments. The seamless integration of multi-angled cameras, functioning as perceptive sensors, significantly enhances pedestrian detection and augments security measures in smart cities. Nevertheless, current pedestrian-focused target detection encounters challenges such as slow detection speeds and increased costs. To address these challenges, we introduce the YOLOv5-MS model, an YOLOv5-based solution for target detection. Initially, we optimize the multi-threaded acquisition of video streams within YOLOv5 to ensure image stability and real-time performance. Subsequently, leveraging reparameterization, we replace the original BackBone convolution with RepvggBlock, streamlining the model by reducing convolutional layer channels, thereby enhancing the inference speed. Additionally, the incorporation of a bioinspired “squeeze and excitation” module in the convolutional neural network significantly enhances the detection accuracy. This module improves target focusing and diminishes the influence of irrelevant elements. Furthermore, the integration of the K-means algorithm and bioinspired Retinex image augmentation during training effectively enhances the model’s detection efficacy. Finally, loss computation adopts the Focal-EIOU approach. The empirical findings from our internally developed smart city dataset unveil YOLOv5-MS’s impressive 96.5% mAP value, indicating a significant 2.0% advancement over YOLOv5s. Moreover, the average inference speed demonstrates a notable 21.3% increase. These data decisively substantiate the model’s superiority, showcasing its capacity to effectively perform pedestrian detection within an Intranet of over 50 video surveillance cameras, in harmony with our stringent requisites.
Subject
Molecular Medicine,Biomedical Engineering,Biochemistry,Biomaterials,Bioengineering,Biotechnology
Reference36 articles.
1. Multi-scale visualization based on sketch interaction for massive surveillance video data;Zhang;Pers. Ubiquitous Comput.,2021
2. Zahra, A., Ghafoor, M., Munir, K., Ullah, A., and Ul Abideen, Z. (2021). Application of region-based video surveillance in smart cities using deep learning. Multimed. Tools Appl., 1–26.
3. Real-time target detection in visual sensing environments using deep transfer learning and improved anchor box generation;Ren;IEEE Access,2020
4. Edge Computing: Vision and Challenges;Shi;IEEE Internet Things J.,2016
5. TensorRT-Based Framework and Optimization Methodology for Deep Learning Inference on Jetson Boards;Jeong;ACM Trans. Embed. Comput. Syst.,2022
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献