Active Visual Perception Enhancement Method Based on Deep Reinforcement Learning
-
Published:2024-04-25
Issue:9
Volume:13
Page:1654
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Yang Zhonglin12, Fang Hao1, Liu Huanyu1, Li Junbao1, Jiang Yutong2, Zhu Mengqi2
Affiliation:
1. School of Cyberspace Security, Harbin Institute of Technology, Harbin 150000, China 2. Information and Control Technology Department, China North Vehicle Research Institute, Beijing 100072, China
Abstract
Traditional object detection methods using static cameras are constrained by their limited perspectives, hampering the effective detection of low-confidence targets. To address this challenge, this study introduces a deep reinforcement learning-based visual perception enhancement technique. This approach leverages pan–tilt–zoom (PTZ) cameras to achieve active vision, enabling them to autonomously make decisions and actions tailored to the current scene and object detection outcomes. This optimization enhances both the object detection process and information acquisition, significantly boosting the intelligent perception capabilities of PTZ cameras. Experimental findings demonstrate the robust generalization capabilities of this method across various object detection algorithms, resulting in an average confidence level improvement of 23.80%.
Funder
National Natural Science Foundation of China Interdisciplinary Research Foundation of HIT
Reference23 articles.
1. Video Surveillance Systems-Current Status and Future Trends;Tsakanikas;Comput. Electr. Eng.,2018 2. A Small Target Detection Method Based on Deep Learning with Considerate Feature and Effectively Expanded Sample Size;Zhang;IEEE Access,2021 3. Wang, S., Tian, Y., and Xu, Y. (2015, January 19–22). Automatic Control of PTZ Camera Based on Object Detection and Scene Partition. Proceedings of the 2015 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Ningbo, China. 4. Wang, X., Van De Weem, J., and Jonker, P. (2013, January 25–29). An Advanced Active Vision System Imitating Human Eye Movements. Proceedings of the 2013 16th International Conference on Advanced Robotics (ICAR), Montevideo, Uruguay. 5. Brockman, G., Cheung, V., Pettersson, L., Schneider, J., Schulman, J., Tang, J., and Zaremba, W. (2016). OpenAI Gym 2016. arXiv.
|
|