Affiliation:
1. School of Mathematics and Statistics, Beijing Institute of Technology, Beijing 100081, China
2. Inner Mongolia Ecological Environment Big Data Co., Ltd., Hohhot 010070, China
Abstract
Video surveillance is widely used in monitoring environmental pollution, particularly harmful dust. Currently, manual video monitoring remains the predominant method for analyzing potential pollution, which is inefficient and prone to errors. In this paper, we introduce a new unsupervised method based on latent diffusion models. Specifically, we propose a spatio-temporal network structure, which better integrates the spatial and temporal features of videos. Our conditional guidance mechanism samples frames of input videos to guide high-quality generation and obtains frame-level anomaly scores, comparing generated videos with original ones. We also propose an efficient compression strategy to reduce computational costs, allowing the model to perform in a latent space. The superiority of our method was demonstrated by numerical experiments in three public benchmarks and practical application analysis in coal mining over previous SOTA methods with better AUC, of at most over 3%. Our method accurately detects abnormal patterns in multiple challenging environmental monitoring scenarios, illustrating the potential application possibilities in the environmental protection domain and beyond.
Funder
National Natural Science Foundation of China
Reference45 articles.
1. Baradaran, M., and Bergevin, R. (2021). A critical study on the recent deep learning based semi-supervised video anomaly detection methods. arXiv.
2. Feng, J.C., Hong, F.T., and Zheng, W.S. (2021, January 20–25). Mist: Multiple instance self-training framework for video anomaly detection. Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), Nashville, TN, USA.
3. Liu, Y., Liu, J., Ni, W., and Song, L. (2022, January 18–23). Abnormal event detection with self-guiding multi-instance ranking framework. Proceedings of the International Joint Conference on Neural Networks (IJCNN), Padua, Italy.
4. Wan, B., Fang, Y., Xia, X., and Mei, J. (2020, January 6–10). Weakly supervised video anomaly detection via center-guided discriminative learning. Proceedings of the International Congress on Mathematical Education (ICME), London, UK.
5. Sultani, W., Chen, C., and Shah, M. (2018, January 18–23). Real-world anomaly detection in surveillance videos. Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), Salt Lake City, UT, USA.