Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video-Reference-Cited by-同舟云学术

Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video

Published:2021-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Wu Jie¹²,Zhang Wei³,Li Guanbin¹,Wu Wenhao³,Tan Xiao³,Li Yingying³,Ding Errui³,Lin Liang¹

Affiliation:

1. Sun Yat-sen University

2. ByteDance Inc.

3. Baidu Inc.

Abstract

In this paper, we introduce a novel task, referred to as Weakly-Supervised Spatio-Temporal Anomaly Detection (WSSTAD) in surveillance video. Specifically, given an untrimmed video, WSSTAD aims to localize a spatio-temporal tube (i.e., a sequence of bounding boxes at consecutive times) that encloses the abnormal event, with only coarse video-level annotations as supervision during training. To address this challenging task, we propose a dual-branch network which takes as input the proposals with multi-granularities in both spatial-temporal domains. Each branch employs a relationship reasoning module to capture the correlation between tubes/videolets, which can provide rich contextual information and complex entity relationships for the concept learning of abnormal behaviors. Mutually-guided Progressive Refinement framework is set up to employ dual-path mutual guidance in a recurrent manner, iteratively sharing auxiliary supervision information across branches. It impels the learned concepts of each branch to serve as a guide for its counterpart, which progressively refines the corresponding branch and the whole framework. Furthermore, we contribute two datasets, i.e., ST-UCF-Crime and STRA, consisting of videos containing spatio-temporal abnormal annotations to serve as the benchmarks for WSSTAD. We conduct extensive qualitative and quantitative evaluations to demonstrate the effectiveness of the proposed approach and analyze the key factors that contribute more to handle this task.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semantic-driven dual consistency learning for weakly supervised video anomaly detection;Pattern Recognition;2025-01

2. Anomalies cannot materialize or vanish out of thin air: A hierarchical multiple instance learning with position-scale awareness for video anomaly detection;Expert Systems with Applications;2024-11

3. VPE-WSVAD: Visual prompt exemplars for weakly-supervised video anomaly detection;Knowledge-Based Systems;2024-09

4. Anomaly detection in surveillance videos using Transformer with margin learning;Multimedia Systems;2024-08-16

5. Weakly-Supervised Video Anomaly Detection With Snippet Anomalous Attention;IEEE Transactions on Circuits and Systems for Video Technology;2024-07