A Parallel Open-World Object Detection Framework with Uncertainty Mitigation for Campus Monitoring
-
Published:2023-11-29
Issue:23
Volume:13
Page:12806
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Dong Jian12, Zhang Zhange1, He Siqi3, Liang Yu4ORCID, Ma Yuqing15, Yu Jiaqi6, Zhang Ruiyan6, Li Binbin2
Affiliation:
1. State Key Lab of Software Development Environment, Beihang University, Beijing 100191, China 2. China Electronics Standardization Institute, Beijing 100007, China 3. School of Computer Science, Peking University, Beijing 100871, China 4. Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China 5. Institute of Artificial Intelligence, Beihang University, Beijing 100191, China 6. Beijing Institute of Control and Electronic Technology, Beijing 100038, China
Abstract
The recent advancements in artificial intelligence have brought about significant changes in education. In the context of intelligent campus development, target detection technology plays a pivotal role in applications such as campus environment monitoring and the facilitation of classroom behavior surveillance. However, traditional object detection methods face challenges in open and dynamic campus scenarios where unexpected objects and behaviors arise. Open-World Object Detection (OWOD) addresses this issue by enabling detectors to gradually learn and recognize unknown objects. Nevertheless, existing OWOD methods introduce two major uncertainties that limit the detection performance: the unknown discovery uncertainty from the manual generation of pseudo-labels for unknown objects and the known discrimination uncertainty from perturbations that unknown training introduces to the known class features. In this paper, we introduce a Parallel OWOD Framework with Uncertainty Mitigation to alleviate the unknown discovery uncertainty and the known discrimination uncertainty within the OWOD task. To address the unknown discovery uncertainty, we propose an objectness-driven discovery module to focus on capturing the generalized objectness shared among various known classes, driving the framework to discover more potential objects that are distinct from the background, including unknown objects. To mitigate the discrimination uncertainty, we decouple the learning processes for known and unknown classes through a parallel structure to reduce the mutual influence at the feature level and design a collaborative open-world classifier to achieve high-performance collaborative detection of both known and unknown classes. Our framework provides educators with a powerful tool for effective campus monitoring and classroom management. Experimental results on standard benchmarks demonstrate the framework’s superior performance compared to state-of-the-art methods, showcasing its transformative potential in intelligent educational environments.
Funder
National Natural Science Foundation of China National Key R&D Program of China
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference57 articles.
1. He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 2. Arcucci, R., Zhu, J., Hu, S., and Guo, Y.K. (2021). Deep data assimilation: Integrating deep learning with data assimilation. Appl. Sci., 11. 3. Li, F., He, F., Wang, F., Zhang, D., Xia, Y., and Li, X. (2020). A novel simplified convolutional neural network classification algorithm of motor imagery EEG signals based on deep learning. Appl. Sci., 10. 4. Shieh, C.S., Lin, W.W., Nguyen, T.T., Chen, C.H., Horng, M.F., and Miu, D. (2021). Detection of unknown ddos attacks with deep learning and gaussian mixture model. Appl. Sci., 11. 5. Chiu, M.T., Xu, X., Wei, Y., Huang, Z., Schwing, A.G., Brunner, R., Khachatrian, H., Karapetyan, H., Dozier, I., and Rose, G. (2020, January 13–19). Agriculture-vision: A large aerial image database for agricultural pattern analysis. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.
|
|