Affiliation:
1. SnT Centre, University of Luxembourg, Luxembourg, Luxembourg
2. Lero Centre, University of Limerick, Limerick, Ireland and School of EECS, University of Ottawa, Ottawa, Canada
Abstract
The adoption of
deep neural networks (DNNs)
in safety-critical contexts is often prevented by the lack of effective means to explain their results, especially when they are erroneous. In our previous work, we proposed a white-box approach (HUDD) and a black-box approach (SAFE) to automatically characterize DNN failures. They both identify clusters of similar images from a potentially large set of images leading to DNN failures. However, the analysis pipelines for HUDD and SAFE were instantiated in specific ways according to common practices, deferring the analysis of other pipelines to future work.
In this article, we report on an empirical evaluation of 99 different pipelines for root cause analysis of DNN failures. They combine transfer learning, autoencoders, heatmaps of neuron relevance, dimensionality reduction techniques, and different clustering algorithms. Our results show that the best pipeline combines transfer learning, DBSCAN, and UMAP. It leads to clusters almost exclusively capturing images of the same failure scenario, thus facilitating root cause analysis. Further, it generates distinct clusters for each root cause of failure, thus enabling engineers to detect all the unsafe scenarios. Interestingly, these results hold even for failure scenarios that are only observed in a small percentage of the failing images.
Funder
IEE Luxembourg, Luxembourg’s National Research Fund
NSERC of Canada
Discovery and CRC programs
Publisher
Association for Computing Machinery (ACM)
Reference101 articles.
1. Martín Abadi Ashish Agarwal Paul Barham Eugene Brevdo Zhifeng Chen Craig Citro Greg S. Corrado Andy Davis Jeffrey Dean Matthieu Devin Sanjay Ghemawat Ian Goodfellow Andrew Harp Geoffrey Irving Michael Isard Yangqing Jia Rafal Jozefowicz Lukasz Kaiser Manjunath Kudlur Josh Levenberg Dandelion Mané Rajat Monga Sherry Moore Derek Murray Chris Olah Mike Schuster Jonathon Shlens Benoit Steiner Ilya Sutskever Kunal Talwar Paul Tucker Vincent Vanhoucke Vijay Vasudevan Fernanda Viégas Oriol Vinyals Pete Warden Martin Wattenberg Martin Wicke Yuan Yu and Xiaoqiang Zheng. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Retrieved from https://www.tensorflow.org/Software available from tensorflow.org.
2. Testing vision-based control systems using learnable evolutionary algorithms
3. Regions of interest selection in histopathological images using subspace and multi-objective stream clustering;Attaoui Mohammed Oualid;The Visual Computer,2022
4. Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction and Clustering
5. Mohammed Oualid Attaoui Hazem Fahmy Fabrizio Pastore and Lionel Briand. 2023. DNN Explanation for Safety Analysis: An Empirical Evaluation of Clustering-based Approaches - Replicability Package. Retrieved from https://figshare.com/projects/DNN_Explanation_for_Safety_Analysis_an_Empirical_Evaluation_of_Clustering-based_Approaches/157973