Affiliation:
1. Department of Computer Science, Chungbuk National University, 1 Chungdae-ro, Seowon-gu, Cheongju, Chungbuk 28644, Republic of Korea
2. Department of Computer Engineering, Kumoh National Institute of Technology, 61 Daehak-ro, Gumi, Gyeongbuk 39177, Republic of Korea
Abstract
This paper presents a novel approach to risk assessment by incorporating image captioning as a fundamental component to enhance the effectiveness of surveillance systems. The proposed surveillance system utilizes image captioning to generate descriptive captions that portray the relationship between objects, actions, and space elements within the observed scene. Subsequently, it evaluates the risk level based on the content of these captions. After defining the risk levels to be detected in the surveillance system, we constructed a dataset consisting of [Image-Caption-Danger Score]. Our dataset offers caption data presented in a unique sentence format, departing from conventional caption styles. This unique format enables a comprehensive interpretation of surveillance scenes by considering various elements, such as objects, actions, and spatial context. We fine-tuned the BLIP-2 model using our dataset to generate captions, and captions were then interpreted with BERT to evaluate the risk level of each scene, categorizing them into stages ranging from 1 to 7. Multiple experiments provided empirical support for the effectiveness of the proposed system, demonstrating significant accuracy rates of 92.3%, 89.8%, and 94.3% for three distinct risk levels: safety, hazard, and danger, respectively.
Funder
academic research program of Chungbuk National University
Reference44 articles.
1. The Business Research Company (2023, November 09). Surveillance Technology Global Market Report. Available online: https://www.thebusinessresearchcompany.com/report/surveillance-technology-global-market-report.
2. A Hybrid CNN and LSTM-Based Deep Learning Model for Abnormal Behavior Detection;Chang;Multimed. Tools Appl.,2022
3. Alairaji, R.M., Aljazaery, I.A., and Alrikabi, H.T.S. (2022). Advanced Computational Paradigms and Hybrid Intelligent Computing: Proceedings of ICACCP 2021, Springer.
4. Video Crowd Detection and Abnormal Behavior Model Detection Based on Machine Learning Method;Xie;Neural Comput. Appl.,2019
5. Skeleton-Based Abnormal Behavior Detection Using Secure Partitioned Convolutional Neural Network Model;Qiu;IEEE J. Biomed. Health Inform.,2021