A Parallel Open-World Object Detection Framework with Uncertainty Mitigation for Campus Monitoring-Reference-Cited by-同舟云学术

A Parallel Open-World Object Detection Framework with Uncertainty Mitigation for Campus Monitoring

Published:2023-11-29 Issue:23 Volume:13 Page:12806
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Dong Jian¹²,Zhang Zhange¹,He Siqi³,Liang Yu⁴^ORCID,Ma Yuqing¹⁵,Yu Jiaqi⁶,Zhang Ruiyan⁶,Li Binbin²

Affiliation:

1. State Key Lab of Software Development Environment, Beihang University, Beijing 100191, China

2. China Electronics Standardization Institute, Beijing 100007, China

3. School of Computer Science, Peking University, Beijing 100871, China

4. Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China

5. Institute of Artificial Intelligence, Beihang University, Beijing 100191, China

6. Beijing Institute of Control and Electronic Technology, Beijing 100038, China

Abstract

The recent advancements in artificial intelligence have brought about significant changes in education. In the context of intelligent campus development, target detection technology plays a pivotal role in applications such as campus environment monitoring and the facilitation of classroom behavior surveillance. However, traditional object detection methods face challenges in open and dynamic campus scenarios where unexpected objects and behaviors arise. Open-World Object Detection (OWOD) addresses this issue by enabling detectors to gradually learn and recognize unknown objects. Nevertheless, existing OWOD methods introduce two major uncertainties that limit the detection performance: the unknown discovery uncertainty from the manual generation of pseudo-labels for unknown objects and the known discrimination uncertainty from perturbations that unknown training introduces to the known class features. In this paper, we introduce a Parallel OWOD Framework with Uncertainty Mitigation to alleviate the unknown discovery uncertainty and the known discrimination uncertainty within the OWOD task. To address the unknown discovery uncertainty, we propose an objectness-driven discovery module to focus on capturing the generalized objectness shared among various known classes, driving the framework to discover more potential objects that are distinct from the background, including unknown objects. To mitigate the discrimination uncertainty, we decouple the learning processes for known and unknown classes through a parallel structure to reduce the mutual influence at the feature level and design a collaborative open-world classifier to achieve high-performance collaborative detection of both known and unknown classes. Our framework provides educators with a powerful tool for effective campus monitoring and classroom management. Experimental results on standard benchmarks demonstrate the framework’s superior performance compared to state-of-the-art methods, showcasing its transformative potential in intelligent educational environments.

Funder

National Natural Science Foundation of China

National Key R&D Program of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/23/12806/pdf

Reference57 articles.

1. He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

2. Arcucci, R., Zhu, J., Hu, S., and Guo, Y.K. (2021). Deep data assimilation: Integrating deep learning with data assimilation. Appl. Sci., 11.

3. Li, F., He, F., Wang, F., Zhang, D., Xia, Y., and Li, X. (2020). A novel simplified convolutional neural network classification algorithm of motor imagery EEG signals based on deep learning. Appl. Sci., 10.

4. Shieh, C.S., Lin, W.W., Nguyen, T.T., Chen, C.H., Horng, M.F., and Miu, D. (2021). Detection of unknown ddos attacks with deep learning and gaussian mixture model. Appl. Sci., 11.

5. Chiu, M.T., Xu, X., Wei, Y., Huang, Z., Schwing, A.G., Brunner, R., Khachatrian, H., Karapetyan, H., Dozier, I., and Rose, G. (2020, January 13–19). Agriculture-vision: A large aerial image database for agricultural pattern analysis. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.