Progressive prediction: Video anomaly detection via multi‐grained prediction-Reference-Cited by-同舟云学术

Progressive prediction: Video anomaly detection via multi‐grained prediction

Published:2024-06-03 Issue:10 Volume:18 Page:2568-2583
ISSN:1751-9659
Container-title:IET Image Processing
language:en
Short-container-title:IET Image Processing

Author:

Zeng Xianlin¹^ORCID,Jiang Yalong²^ORCID,Wang Yufeng²,Fu Qiang²,Ding Wenrui²

Affiliation:

1. School of Electrical and Information Engineering Beihang University Beijing China

2. Unmanned System Research Institute Beihang University Beijing China

Abstract

AbstractVideo Anomaly Detection (VAD) has been an active research field for several decades. However, most existing approaches merely extract a single type of feature from videos and define a single paradigm to indicate the extent of abnormalities. A coarse‐to‐fine three‐level prediction is built by integrating different levels of spatio‐temporal representations, better highlighting the difference between normal and abnormal behaviors. First, an object‐level trajectory prediction is proposed to model human historical position using a graph transformer network. Subsequently, skeleton‐level prediction is achieved by incorporating the positional information from the trajectory prediction. More importantly, based on the predicted skeleton, a skeleton‐guided pixel‐level region prediction is performed. A novel Skeleton Conditioned Generative Adversarial Network (SCGAN) is designed to explore the correlation between skeleton‐level and pixel‐level motion prediction. Benefiting from SCGAN, the prediction of human regions is contributed by both coarse‐grained and fine‐grained motion features. This three‐level prediction, namely Progressive Prediction Video Anomaly Detection (P3VAD), enlarges the prediction error on irregular motion patterns. Besides, a pixel‐level analysis method is proposed to achieve Background‐bias Elimination (BE) and denoise the predicted region. Experimental results validate the effectiveness of P3VAD on the four benchmark datasets (ShanghaiTech, CUHK Avenue, IITB‐Corridor, and ADOC).

Funder

Natural Science Foundation of Beijing Municipality

Publisher

Institution of Engineering and Technology (IET)

Reference78 articles.

1. Ionescu R.T. Khan F.S. Georgescu M.‐I. Shao L.:Object‐centric auto‐encoders and dummy anomalies for abnormal event detection in video. In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition pp. 7842–7851.IEEE Piscataway(2019)

2. Luo W. Liu W. Gao S.:Remembering history with convolutional lstm for anomaly detection. In:2017 IEEE International Conference on Multimedia and Expo (ICME) pp. 439–444.IEEE Piscataway(2017)

3. Park H. Noh J. Ham B.:Learning memory‐guided normality for anomaly detection. In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition pp. 14 372–14 381.IEEE Piscataway(2020)

4. Influence-Aware Attention Networks for Anomaly Detection in Surveillance Videos

5. Pourreza M. Salehi M. Sabokrou M.:Ano‐graph: Learning normal scene contextual graphs to detect video anomalies. arXiv preprint arXiv:2103.10502 (2021)

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Intelligent design and optimization of exercise equipment based on fusion algorithm of YOLOv5-ResNet 50;Alexandria Engineering Journal;2024-10