Violence-YOLO: Enhanced GELAN Algorithm for Violence Detection
-
Published:2024-08-01
Issue:15
Volume:14
Page:6712
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Xu Wenbin1, Zhu Dingju12, Deng Renfeng1, Yung KaiLeung3, Ip Andrew W. H.4ORCID
Affiliation:
1. School of Software, South China Normal University, Foshan 528000, China 2. School of Computer Science, South China Normal University, Guangzhou 510000, China 3. Department of Industrial and Systems Engineering, Hong Kong Polytechnic University, Hong Kong 999077, China 4. Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A2, Canada
Abstract
Violence is a serious threat to societal health; preventing violence in airports, airplanes, and spacecraft is crucial. This study proposes the Violence-YOLO model to detect violence accurately in real time in complex environments, enhancing public safety. The model is based on YOLOv9’s Generalized Efficient Layer Aggregation Network (GELAN-C). A multilayer SimAM is incorporated into GELAN’s neck to identify attention regions in the scene. YOLOv9 modules are combined with RepGhostNet and GhostNet. Two modules, RepNCSPELAN4_GB and RepNCSPELAN4_RGB, are innovatively proposed and introduced. The shallow convolution in the backbone is replaced with GhostConv, reducing computational complexity. Additionally, an ultra-lightweight upsampler, Dysample, is introduced to enhance performance and reduce overhead. Finally, Focaler-IoU addresses the neglect of simple and difficult samples, improving training accuracy. The datasets are derived from RWF-2000 and Hockey. Experimental results show that Violence-YOLO outperforms GELAN-C. mAP@0.5 increases by 0.9%, computational load decreases by 12.3%, and model size is reduced by 12.4%, which is significant for embedded hardware such as the Raspberry Pi. Violence-YOLO can be deployed to monitor public places such as airports, effectively handling complex backgrounds and ensuring accurate and fast detection of violent behavior. In addition, we achieved 84.4% mAP on the Pascal VOC dataset, which is a significant reduction in model parameters compared to the previously refined detector. This study offers insights for real-time detection of violent behaviors in public environments.
Reference37 articles.
1. A survey of video violence detection;Yao;Cyber-Phys. Syst.,2023 2. Kumar, P., Shih, G.L., Guo, B.L., Nagi, S.K., Manie, Y.C., Yao, C.K., Arockiyadoss, M.A., and Peng, P.C. (2024). Enhancing Smart City Safety and Utilizing AI Expert Systems for Violence Detection. Future Internet, 16. 3. Wang, Z., Lei, L., and Shi, P. (2023). Smoking behavior detection algorithm based on YOLOv8-MNC. Front. Comput. Neurosci., 17. 4. Moshayedi, A.J., Uddin, N.M.I., Khan, A.S., Zhu, J., and Emadi Andani, M. (2023). Designing and Developing a Vision-Based System to Investigate the Emotional Effects of News on Short Sleep at Noon: An Experimental Case Study. Sensors, 23. 5. Singh, A., Anand, T., Sharma, S., and Singh, P. (2021, January 8–10). IoT based weapons detection system for surveillance and security using YOLOV4. Proceedings of the 2021 6th International Conference on Communication and Electronics Systems (ICCES), Coimbatre, India.
|
|