CNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detection-Reference-Cited by-同舟云学术

CNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detection

Published:2023-09-07 Issue:18 Volume:23 Page:7734
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Sharif Md. Haidar¹^ORCID,Jiao Lei¹^ORCID,Omlin Christian W.¹

Affiliation:

1. Department of ICT, University of Agder, 4630 Kristiansand, Norway

Abstract

Video anomaly event detection (VAED) is one of the key technologies in computer vision for smart surveillance systems. With the advent of deep learning, contemporary advances in VAED have achieved substantial success. Recently, weakly supervised VAED (WVAED) has become a popular VAED technical route of research. WVAED methods do not depend on a supplementary self-supervised substitute task, yet they can assess anomaly scores straightway. However, the performance of WVAED methods depends on pretrained feature extractors. In this paper, we first address taking advantage of two pretrained feature extractors for CNN (e.g., C3D and I3D) and ViT (e.g., CLIP), for effectively extracting discerning representations. We then consider long-range and short-range temporal dependencies and put forward video snippets of interest by leveraging our proposed temporal self-attention network (TSAN). We design a multiple instance learning (MIL)-based generalized architecture named CNN-ViT-TSAN, by using CNN- and/or ViT-extracted features and TSAN to specify a series of models for the WVAED problem. Experimental results on publicly available popular crowd datasets demonstrated the effectiveness of our CNN-ViT-TSAN.

Funder

Research Council of Norway

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/18/7734/pdf

Reference73 articles.

1. Liu, K., and Ma, H. (2019, January 21–25). Exploring Background-bias for Anomaly Detection in Surveillance Videos. Proceedings of the International Conference on Multimedia (MM), Nice, France.

2. Gong, D., Liu, L., Le, V., Saha, B., Mansour, M.R., Venkatesh, S., and van den Hengel, A. (November, January 27). Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection. Proceedings of the International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.

3. Zaheer, M.Z., Mahmood, A., Khan, M.H., Segu, M., Yu, F., and Lee, S.I. (2022, January 18–24). Generative Cooperative Learning for Unsupervised Video Anomaly Detection. Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.

4. Deep Crowd Anomaly Detection by Fusing Reconstruction and Prediction Networks;Sharif;Electronics,2023

5. Anomaly detection: A survey;Chandola;ACM Comput. Surv.,2009

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multimodal knowledge graph construction for risk identification in water diversion projects;Journal of Hydrology;2024-05

2. Anomaly Detection in Weakly Supervised Videos Using Multistage Graphs and General Deep Learning Based Spatial-Temporal Feature Enhancement;IEEE Access;2024

3. Attention Relational Network for Skeleton-Based Group Activity Recognition;IEEE Access;2023