Local performance evaluation of AI-algorithms with the generalized spatial recall index
Author:
Müller Patrick1, Braun Alexander2ORCID
Affiliation:
1. Hochschule Düsseldorf, Fachbereich Elektro- und Informationstechnik , Düsseldorf , Deutschland 2. Electrical Engineering and Information Technology , University of Applied Sciences Dusseldorf , Münsterstraße 156, 40476 , Düsseldorf , NRW , Germany
Abstract
Abstract
We have developed a novel metric to gauge the performance of artificial intelligence (AI) or machine learning (ML) algorithms, called the Spatial Recall Index (SRI). The novelty is the spatial resolution of a standard performance indicator, as a Recall value is assigned to each individual pixel. This generates a distribution of the performance of a given AI-algorithm with the resolution of the images in the dataset. While the mathematical basis has already been presented before, here we demonstrate the usage on more datasets and delve into in-depth application examples. We examine both the MS COCO and the Berkeley Deep Drive datasets, using a state-of-the-art object detection algorithm. The dataset is degraded using a physical-realistic lens-model, where the optical performance varies over the field of view, as a real camera would. This study highlights the usefulness of the SRI, as every image has been taken by realistic optics. A generalization, the GSRI is introduced, from which we derive SRIA, weighting with object area and SRIrisk intended for autonomous driving. Finally, these metrics are compared.
Publisher
Walter de Gruyter GmbH
Subject
Electrical and Electronic Engineering,Instrumentation
Reference39 articles.
1. I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning, MIT Press, 2016. Available at: http://www.deeplearningbook.org. 2. G. Litjens, T Kooi, B. E Bejnordi, et al.., “A survey on Deep learning in medical image analysis,” Med. Image Anal., vol. 42, pp. 60–88, 2017, arXiv: 1702.05747[cs]. http://arxiv.org/abs/1702.05747. 3. F. Yu,H. Chen,X. Wang, et al.., “BDD100K: a diverse driving dataset for heterogeneous multitask learning,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 2633–2642. 4. C. Michaelis, B. Mitzkus,R. Geirhos, et al.., “Benchmarking robustness in object detection: autonomous driving when winter is coming,” arXiv:1907.07484 [cs, stat], 2020. http://arxiv.org/abs/1907.07484. 5. H. Caesar, V. Bankiti,A. Lang, et al.., “nuScenes: a multimodal dataset for autonomous driving,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, IEEE, 2020, pp. 11618–11628. Available at: https://ieeexplore.ieee.org/document/9156412/ [accessed: Jun. 29, 2022].
|
|