S-YOLOv5: A Lightweight Model for Detecting Objects Thrown from Tall Buildings in Communities-Reference-Cited by-同舟云学术

S-YOLOv5: A Lightweight Model for Detecting Objects Thrown from Tall Buildings in Communities

Published:2024-03-29 Issue:4 Volume:15 Page:188
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Shi Yuntao¹²,Luo Qi¹²,Zhou Meng¹²^ORCID,Guo Wei¹²,Li Jie¹²,Li Shuqin¹²,Ding Yu¹²

Affiliation:

1. School of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China

2. Key Lab of Field Bus and Automation of Beijing, North China University of Technology, Beijing 100144, China

Abstract

Objects thrown from tall buildings in communities are characterized by their small size, inconspicuous features, and high speed. Existing algorithms for detecting such objects face challenges, including excessive parameters, overly complex models that are difficult to implement, and insufficient detection accuracy. This study proposes a lightweight detection model for objects thrown from tall buildings in communities, named S-YOLOv5, to address these issues. The model is based on the YOLOv5 algorithm, and a lightweight convolutional neural network, Enhanced ShuffleNet (ESNet), is chosen as the backbone network to extract image features. On this basis, the initial stage of the backbone network is enhanced and the simplified attention module (SimAM) attention mechanism is added to utilize the rich position information and contour information in the shallow feature map to improve the detection of small targets. For feature fusion, the sparsely connected Path Aggregation Network (SCPANet) module is designed to use sparsely connected convolution (SCConv) instead of the regular convolution of the Path Aggregation Network (PANet) to fuse features efficiently. In addition, the model uses the normalized Wasserstein distance (NWD) loss function to reduce the sensitivity of positional bias. The accuracy of the model is further improved. Test results from the self-built objects thrown from tall buildings dataset show that S-YOLOv5 can detect objects thrown from tall buildings quickly and accurately, with an accuracy of 90.2% and a detection rate of 34.1 Fps/s. Compared with the original YOLOv5 model, the parameters are reduced by 87.3%, and the accuracy and rate are improved by 0.8% and 63%, respectively.

Funder

National Key R&D Program of China

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2078-2489/15/4/188/pdf

Reference30 articles.

1. Artificial intelligence-aided grade crossing safety violation detection methodology and a case study in new jersey;Zaman;Transp. Res. Rec. J. Transp. Res. Board,2023

2. Stochastic gradient descent classifier-based lightweight intrusion detection systems using the efficient feature subsets of datasets;Azimjonov;Expert Syst. Appl.,2024

3. Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., and Keutzer, K. (2016). SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5 MB model size. arXiv.

4. Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv.

5. Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.-C. (2018, January 18–22). MobileNetV2: Inverted residuals and linear bottlenecks. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.