Author:
Liu Gang,Shu Lisheng,Yang Yuhui,Jin Chen
Abstract
In this paper, an innovative approach to detecting anomalous occurrences in video data without supervision is introduced, leveraging contextual data derived from visual characteristics and effectively addressing the semantic discrepancy that exists between visual information and the interpretation of atypical incidents. Our work incorporates Unmanned Aerial Vehicles (UAVs) to capture video data from a different perspective and to provide a unique set of visual features. Specifically, we put forward a technique for discerning context through scene comprehension, which entails the construction of a spatio-temporal contextual graph to represent various aspects of visual information. These aspects encompass the manifestation of objects, their interrelations within the spatio-temporal domain, and the categorization of the scenes captured by UAVs. To encode context information, we utilize Transformer with message passing for updating the graph's nodes and edges. Furthermore, we have designed a graph-oriented deep Variational Autoencoder (VAE) approach for unsupervised categorization of scenes, enabling the extraction of the spatio-temporal context graph across diverse settings. In conclusion, by utilizing contextual data, we ascertain anomaly scores at the frame-level to identify atypical occurrences. We assessed the efficacy of the suggested approach by employing it on a trio of intricate data collections, specifically, the UCF-Crime, Avenue, and ShanghaiTech datasets, which provided substantial evidence of the method's successful performance.
Subject
Public Administration,Urban Studies,Renewable Energy, Sustainability and the Environment
Reference59 articles.
1. “Video parsing for abnormality detection.”;Antić,2011
2. Visual objects in context;Bar;Nat. Rev. Neurosci,2004
3. “Video anomaly detection and localization using hierarchical feature representation and gaussian process regression,”;Cheng;Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2015
4. Context models and out-of-context objects;Choi;Pattern Recogn. Lett,2012
5. “Abnormal event detection in videos using spatiotemporal autoencoder,”;Chong;Advances in Neural Networks-ISNN 2017: 14th International Symposium, ISNN 2017, Sapporo, Hakodate, and Muroran, Hokkaido, Japan, June 21-26, 2017, Proceedings, Part II 14,2017
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献