Abstract
Abstract
In this paper, we approach with four different CNN-based models i.e., VGG-19, VGG-16, InceptionV3 and MobileNetV3 with an improved version of the previous models for violence detection and recognition from videos. The proposed models use the pre-trained models as the base model for feature extraction and for classification after freezing the rest of the layer, the head model is prepared with averagepooling2D of (5, 5), and after flattening only one dense layer having 512 nodes with ‘ReLU’ activation function, dropout layer of 0.5 and last output layer with only 2 classes and ‘softmax’ activation function. This head model of fully connected layers was used in the proposed models. These models are trained and evaluated on the Hockey fight dataset and Real life violence situations detection datasets. The experimental results are far better in terms of accuracy and other performance metrics and the models have reduced parameters and less computational time than previous models.
Publisher
Research Square Platform LLC
Reference51 articles.
1. “New Delhi Streets Turn Into Battleground, Hindus vs. Muslims -The New York Times.” [Available online]: https://www.nytimes.com/2020/02/25/world/asia/new-delhi-hindu-muslim-violence.html
2. “Fiery Clashes Erupt Between Police and Protesters Over George Floyd Death - The New York Times. ” [Available online]: https://www.nytimes.com/2020/05/30/us/minneapolis-floyd-protests.html
3. Ditsanthia E, Pipanmaekaporn L, Kamonsantiroj S (2018) Video Representation Learning for CCTV-Based Violence Detection. https://doi.org/10.1109/times-icon.2018.8621751
4. State-of-the-art violence detection techniques in video surveillance security systems: a systematic review;Omarov B;PeerJ,2022
5. A Comprehensive Review on Vision-Based Violence Detection in Surveillance Videos;Ullah FUM;ACM-CSUR,2022