Abstract
AbstractIn the field of deep learning based computer vision, the development of deep object detection has led to unique paradigms (e.g., two-stage or set-based) and architectures (e.g., Faster-RCNN or DETR) which enable outstanding performance on challenging benchmark datasets. Despite this, the trained object detectors typically do not reliably assess uncertainty regarding their own knowledge, and the quality of their probabilistic predictions is usually poor. As these are often used to make subsequent decisions, such inaccurate probabilistic predictions must be avoided. In this work, we investigate the uncertainty calibration properties of different pretrained object detection architectures in a multi-class setting. We propose a framework to ensure a fair, unbiased, and repeatable evaluation and conduct detailed analyses assessing the calibration under distributional changes (e.g., distributional shift and application to out-of-distribution data). Furthermore, by investigating the influence of different detector paradigms, post-processing steps, and suitable choices of metrics, we deliver novel insights into why poor detector calibration emerges. Based on these insights, we are able to improve the calibration of a detector by simply finetuning its last layer.
Publisher
Springer Science and Business Media LLC
Reference30 articles.
1. Bieshaar, M., Zernetsch, S., Hubert, A., Sick, B., & Doll, K. (2018). Cooperative starting movement detection of cyclists using convolutional neural networks and a boosted stacking ensemble. IEEE Transactions on Intelligent Vehicles, 3(4), 534–44.
2. Bishop, C. M. (2006). Pattern recognition and machine learning. Springer.
3. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., & Zagoruyko, S. (2020). End-to-end object detection with transformers. European conference on computer vision (pp. 213–229).
4. Du, X., Gozum, G., Ming, Y., & Li, Y. (2022). Siren: Shaping representations for detecting out-of-distribution objects. Advances in Neural Information Processing Systems, 35, 20434–20449.
5. Du, X., Wang, X., Gozum, G., & Li, Y. (2022). Unknown-aware object detection: Learning what you don’t know from videos in the wild. Proceedings of the ieee/cvf conference on computer vision and pattern recognition (pp. 13678–13688).