Uncertainty quantification for deep neural networks: An empirical comparison and usage guidelines-Reference-Cited by-同舟云学术

Uncertainty quantification for deep neural networks: An empirical comparison and usage guidelines

Published:2023-01-20 Issue:6 Volume:33 Page:
ISSN:0960-0833
Container-title:Software Testing, Verification and Reliability
language:en
Short-container-title:Software Testing Verif & Rel

Author:

Weiss Michael¹^ORCID,Tonella Paolo¹^ORCID

Affiliation:

1. Software Institute Università della Svizzera italiana Lugano Switzerland

Abstract

SummaryDeep neural networks (DNN) are increasingly used as components of larger software systems that need to process complex data, such as images, written texts, audio/video signals. DNN predictions cannot be assumed to be always correct for several reasons, amongst which the huge input space that is dealt with, the ambiguity of some inputs data, as well as the intrinsic properties of learning algorithms, which can provide only statistical warranties. Hence, developers have to cope with some residual error probability. An architectural pattern commonly adopted to manage failure prone components is the supervisor, an additional component that can estimate the reliability of the predictions made by untrusted (e.g., DNN) components and can activate an automated healing procedure when these are likely to fail, ensuring that the deep learning‐based system (DLS) does not cause damages, despite its main functionality being suspended.In this paper, we consider DLS that implement a supervisor by means of uncertainty estimation. After overviewing the main approaches to uncertainty estimation and discussing their pros and cons, we motivate the need for a specific empirical assessment method that can deal with the experimental setting in which supervisors are used, where accuracy of the DNN matters only as long as the supervisor lets the DLS continue to operate. Then we present a large empirical study conducted to compare the alternative approaches to uncertainty estimation. We distilled a set of guidelines for developers that are useful to incorporate a supervisor based on uncertainty monitoring into a DLS.

Publisher

Wiley

Subject

Safety, Risk, Reliability and Quality,Software

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/stvr.1840

Reference71 articles.

1. TempletonB.Tesla In Taiwan Crashes Directly Into Overturned Truck Ignores Pedestrian With Autopilot On 2020. Available from:https://www.forbes.com/sites/bradtempleton/2020/06/02/tesla-in-taiwan-crashes-directly-into-overturned-truck-ignores-pedestrian-with-autopilot-on

2. VincentJ.Google fixed its racist algorithm by removing gorillas from its image‐labeling tech 2018. Available from:https://www.theverge.com/2018/1/12/16882408/google-racist-gorillas-photo-recognition-algorithm-ai

3. StoccoA WeissM CalzanaM TonellaP.Misbehaviour Prediction for Autonomous Driving Systems. InProceedings of 42nd International Conference on Software Engineering. ACM 2020;12.

4. WeissM TonellaP.Fail‐safe execution of deep learning based systems through uncertainty monitoring. In2021 IEEE 14th International Conference on Software Testing Validation and Verification (ICST). IEEE 2021;24–35.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Advancing data-driven sustainable design: A novel NEV form design approach in China's market;Journal of Cleaner Production;2024-07

2. Adopting Two Supervisors for Efficient Use of Large-Scale Remote Deep Neural Networks;ACM Transactions on Software Engineering and Methodology;2023-11-23

3. Generating and detecting true ambiguity: a forgotten danger in DNN supervision testing;Empirical Software Engineering;2023-11