Violence Detection by Pretrained Modules with Different Deep Learning Approaches-Reference-Cited by-同舟云学术

Violence Detection by Pretrained Modules with Different Deep Learning Approaches

Published:2019-10-25 Issue:01 Volume:07 Page:19-40
ISSN:2196-8888
Container-title:Vietnam Journal of Computer Science
language:en
Short-container-title:Vietnam J. Comp. Sci.

Author:

Sumon Shakil Ahmed¹,Goni Raihan¹,Hashem Niyaz Bin¹,Shahria Tanzil¹,Rahman Rashedur M.¹

Affiliation:

1. Department of Electrical and Computer Engineering, North South University, Dhaka 1229, Bangladesh

Abstract

In this paper, we have explored different strategies to find out the saliency of the features from different pretrained models in detecting violence in videos. A dataset has been created which consists of violent and non-violent videos of different settings. Three ImageNet models; VGG16, VGG19, ResNet50 are being used to extract features from the frames of the videos. In one of the experiments, the extracted features have been feed into a fully connected network which detects violence in frame level. Moreover, in another experiment, we have fed the extracted features of 30 frames to a long short-term memory (LSTM) network at a time. Furthermore, we have applied attention to the features extracted from the frames through spatial transformer network which also enables transformations like rotation, translation and scale. Along with these models, we have designed a custom convolutional neural network (CNN) as a feature extractor and a pretrained model which is initially trained on a movie violence dataset. In the end, the features extracted from the ResNet50 pretrained model proved to be more salient towards detecting violence. These ResNet50 features, in combination with LSTM provide an accuracy of 97.06% which is better than the other models we have experimented with.

Publisher

World Scientific Pub Co Pte Lt

Link

https://www.worldscientific.com/doi/pdf/10.1142/S2196888820500013

Reference24 articles.

1. Violent Crowd Flow Detection Using Deep Learning

2. Violence Detection in Video by Using 3D Convolutional Neural Networks

Cited by 52 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Crowd dynamics analysis and behavior recognition in surveillance videos based on deep learning;Multimedia Tools and Applications;2024-09-12

2. Revisiting vision-based violence detection in videos: A critical analysis;Neurocomputing;2024-09

3. A shallow 3D convolutional neural network for violence detection in videos;Egyptian Informatics Journal;2024-06

4. An ensemble based approach for violence detection in videos using deep transfer learning;Multimedia Tools and Applications;2024-05-20

5. Research trend analysis of abnormal behavior detection based on knowledge networks;International Conference on Remote Sensing Technology and Survey Mapping (RSTSM 2024);2024-05-16