Improved Test Input Prioritization Using Verification Monitors with False Prediction Cluster Centroids
-
Published:2023-12-19
Issue:1
Volume:13
Page:21
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Hwang Hyekyoung1ORCID, Chun Il Yong1ORCID, Shin Jitae1ORCID
Affiliation:
1. Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon 16419, Republic of Korea
Abstract
Deep learning (DL) systems have been remarkably successful in various applications, but they could have critical misbehaviors. To identify the weakness of a trained model and overcome it with new data collection(s), one needs to figure out the corner cases of a trained model. Constructing new datasets to retrain a DL model requires extra budget and time. Test input prioritization (TIP) techniques have been proposed to identify corner cases more effectively. The state-of-the-art TIP approach adopts a monitoring method to TIP and prioritizes based on Gini impurity; one estimates the similarity between a DL prediction probability and uniform distribution. This letter proposes a new TIP method that uses a distance between false prediction cluster (FPC) centroids in a training set and a test instance in the last-layer feature space to prioritize error-inducing instances among an unlabeled test set. We refer to the proposed method as DeepFPC. Our numerical experiments show that the proposed DeepFPC method achieves significantly improved TIP performance in several image classification and active learning tasks.
Funder
National Research Foundation of Korea Institute for Information and Communications Technology Promotion
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference30 articles.
1. Yang, W., Hongwei, G., Yueqiu, J., and Xin, Z. (2022). A Novel Approach to Maritime Image Dehazing Based on a Large Kernel Encoder–Decoder Network with Multihead Pyramids. Electronics, 11. 2. Deep learning based computer vision approaches for smart agricultural applications;Dhanya;Artif. Intell. Agric.,2022 3. Ahmed, W., Morerio, P., and Murino, V. (2022, January 4–8). Cleaning noisy labels by negative ensemble learning for source-free unsupervised domain adaptation. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2022), Waikoloa, HI, USA. 4. On the implicit bias in deep-learning algorithms;Vardi;Commun. ACM,2023 5. Ma, L., Felix, J., Fuyuan, Z., Jiyuan, S., Minhui, X., Bo, L., Chunyang, C., Ting, S., Li, L., and Yang, L. (2018, January 3–7). Deepgauge: Multi-Granularity Testing Criteria for Deep Learning Systems. Proceedings of the 33rd ACM/IEEE international conference on Automated Software Engineering (ASE 2018), Montpellier, France.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|