Corner cases in machine learning processes-Reference-Cited by-同舟云学术

Corner cases in machine learning processes

Published:2024-01-02 Issue:1 Volume:6 Page:
ISSN:2948-2143
Container-title:AI Perspectives & Advances
language:en
Short-container-title:AI Perspect. Adv.

Author:

Heidecker Florian^ORCID,Bieshaar Maarten^ORCID,Sick Bernhard^ORCID

Abstract

AbstractApplications using machine learning (ML), such as highly autonomous driving, depend highly on the performance of the ML model. The data amount and quality used for model training and validation are crucial. If the model cannot detect and interpret a new, rare, or perhaps dangerous situation, often referred to as a corner case, we will likely blame the data for not being good enough or too small in number. However, the implemented ML model and its associated architecture also influence the behavior. Therefore, the occurrence of prediction errors resulting from the ML model itself is not surprising. This work addresses a corner case definition from an ML model’s perspective to determine which aspects must be considered. To achieve this goal, we present an overview of properties for corner cases that are beneficial for the description, explanation, reproduction, or synthetic generation of corner cases. To define ML corner cases, we review different considerations in the literature and summarize them in a general description and mathematical formulation, whereby the expected relevance-weighted loss is the key to distinguishing corner cases from common data. Moreover, we show how to operationalize the corner case characteristics to determine the value of a corner case. To conclude, we present the extended taxonomy for ML corner cases by adding the input, model, and deployment levels, considering the influence of the corner case properties.

Funder

Bundesministerium für Wirtschaft und Klimaschutz

Universität Kassel

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s42467-023-00015-y.pdf

Reference90 articles.

1. Laplante P, Milojicic D, Serebryakov S, Bennett D (2020) Artificial Intelligence and Critical Systems: From Hype to Reality. Computer 53(11):45–52

2. He K, Gkioxari G, Dollár P, Girshick R (2017) Mask R-CNN. In: Proc. of the International Conference on Computer Vision, Venice, pp 2980–2988

3. Wu Y, Kirillov A, Massa F, Lo WY, Girshick R (2019) Detectron2. https://github.com/facebookresearch/detectron2. Accessed 15 July 2022.

4. Baevski A, Zhou Y, Mohamed A, Auli M (2020) wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. In: Proc. of the Advances in Neural Information Processing Systems, Vancouver, pp 12449–12460

5. Zhao WX, Zhou K, Li J, Tang T, Wang X, Hou Y, Min Y, Zhang B, Zhang J, Dong Z, Du Y, Yang C, Chen Y, Chen Z, Jiang J, Ren R, Li Y, Tang X, Liu Z, Liu P, Nie JY, Wen JR (2023) A Survey of Large Language Models. arXiv preprint arXiv:2303.18223

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analysis of Cloud Computing Technology Network Software Educational Affairs Human Resources Development Process and its Applications;2024 International Conference on Science Technology Engineering and Management (ICSTEM);2024-04-26