Affiliation:
1. Department of Information and Communication Engineering, School of Electrical and Computer Engineering, Chungbuk National University, Cheongju-si 28644, Republic of Korea
2. Information Technology Department, Faculty of Computers and Information, Assiut University, Assiut 71515, Egypt
Abstract
Detecting violent behavior in videos to ensure public safety and security poses a significant challenge. Precisely identifying and categorizing instances of violence in real-life closed-circuit television, which vary across specifications and locations, requires comprehensive understanding and processing of the sequential information embedded in these videos. This study aims to introduce a model that adeptly grasps the spatiotemporal context of videos within diverse settings and specifications of violent scenarios. We propose a method to accurately capture spatiotemporal features linked to violent behaviors using optical flow and RGB data. The approach leverages a Conv3D-based ResNet-3D model as the foundational network, capable of handling high-dimensional video data. The efficiency and accuracy of violence detection are enhanced by integrating an attention mechanism, which assigns greater weight to the most crucial frames within the RGB and optical-flow sequences during instances of violence. Our model was evaluated on the UBI-Fight, Hockey, Crowd, and Movie-Fights datasets; the proposed method outperformed existing state-of-the-art techniques, achieving area under the curve scores of 95.4, 98.1, 94.5, and 100.0 on the respective datasets. Moreover, this research not only has the potential to be applied in real-time surveillance systems but also promises to contribute to a broader spectrum of research in video analysis and understanding.
Funder
Technology development Program of MSS
MSIT (Ministry of Science and ICT), Korea
Reference47 articles.
1. Adaptive background mixture models for real-time tracking;Stauffer;Proceedings of the 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149),1999
2. Distinctive image features from scale-invariant keypoints;Lowe;Int. J. Comput. Vis.,2004
3. On space-time interest points;Laptev;Int. J. Comput. Vis.,2005
4. Hassner, T., Itcher, Y., and Kliper-Gross, O. (2012, January 16–21). Violent flows: Real-time detection of violent crowd behavior. Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Providence, RI, USA.
5. Bank, D., Koenigstein, N., and Giryes, R. (2023). Machine Learning for Data Science Handbook: Data Mining and Knowledge Discovery Handbook, Springer.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献