Validity Matters: Uncertainty‐Guided Testing of Deep Neural Networks

Author:

Jiang Zhouxian1ORCID,Li Honghui1,Wang Rui123,Tian Xuetao4,Liang Ci5,Yan Fei6,Zhang Junwen1,Liu Zhen1

Affiliation:

1. School of Computer Science and Technology Beijing Jiaotong University Beijing China

2. Beijing Key Lab of Traffic Data Analysis and Mining Beijing Jiaotong University Beijing China

3. Collaborative Innovation Center of Railway Traffic Safety Beijing Jiaotong University Beijing China

4. Faculty of Psychology Beijing Normal University Beijing China

5. School of Transportation Science and Engineering Harbin Institute of Technology Harbin China

6. School of Automation and Intelligence Beijing Jiaotong University Beijing China

Abstract

ABSTRACTDespite numerous applications of deep learning technologies on critical tasks in various domains, advanced deep neural networks (DNNs) face persistent safety and security challenges, such as the overconfidence in predicting out‐of‐distribution samples and susceptibility to adversarial examples. Thorough testing by exploring the input space serves as a key strategy to ensure their robustness and trustworthiness of these networks. However, existing testing methods focus on disclosing more erroneous model behaviours, overlooking the validity of the generated test inputs. To mitigate this issue, we investigate devising valid test input generation method for DNNs from a predictive uncertainty perspective. Through a large‐scale empirical study across 11 predictive uncertainty metrics for DNNs, we explore the correlation between validity and uncertainty of test inputs. Our findings reveal that the predictive entropy‐based and ensemble‐based uncertainty metrics effectively characterize the input validity demonstration. Building on these insights, we introduce UCTest, an uncertainty‐guided deep learning testing approach, to efficiently generate valid and authentic test inputs. We formulate a joint optimization objective: to uncover the model's misbehaviours by maximizing the loss function and concurrently generate valid test input by minimizing uncertainty. Extensive experiments demonstrate that our approach outperforms the current testing methods in generating valid test inputs. Furthermore, incorporating natural variation through data augmentation techniques into UCTest effectively boosts the diversity of generated test inputs.

Publisher

Wiley

Reference90 articles.

1. M.Bojarski D.Del Testa D.Dworakowski et al. “End to End Learning for Self‐Driving Cars ” (2016) arXiv preprint arXiv:1604.07316.

2. Confidence Calibration and Predictive Uncertainty Estimation for Deep Medical Image Segmentation

3. C.Szegedy W.Zaremba I.Sutskever et al. “Intriguing Properties of Neural Networks ” (2013) arXiv preprint arXiv:1312.6199.

4. A review of uncertainty quantification in deep learning: Techniques, applications and challenges

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3