Hierarchical Distribution-aware Testing of Deep Learning-Reference-Cited by-同舟云学术

Hierarchical Distribution-aware Testing of Deep Learning

Published:2023-12-21 Issue:2 Volume:33 Page:1-35
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Huang Wei¹^ORCID,Zhao Xingyu²^ORCID,Banks Alec³^ORCID,Cox Victoria³^ORCID,Huang Xiaowei⁴^ORCID

Affiliation:

1. Purple Mountain Laboratories, China and University of Liverpool, U.K.

2. WMG, University of Warwick, U.K.

3. Defence Science and Technology Laboratory, U.K.

4. University of Liverpool, U.K.

Abstract

With its growing use in safety/security-critical applications, Deep Learning (DL) has raised increasing concerns regarding its dependability. In particular, DL has a notorious problem of lacking robustness. Input added with adversarial perturbations, i.e., Adversarial Examples (AEs) , are easily mispredicted by the DL model. Despite recent efforts made in detecting AEs via state-of-the-art attack and testing methods, they are normally input distribution–agnostic and/or disregard the perceptual quality of adversarial perturbations. Consequently, the detected AEs are irrelevant inputs in the application context or noticeably unrealistic to humans. This may lead to a limited effect on improving the DL model’s dependability, as the testing budget is likely to be wasted on detecting AEs that are encountered very rarely in its real-life operations. In this article, we propose a new robustness testing approach for detecting AEs that considers both the feature-level distribution and the pixel-level distribution, capturing the perceptual quality of adversarial perturbations. The two considerations are encoded by a novel hierarchical mechanism. First, we select test seeds based on the density of feature-level distribution and the vulnerability of adversarial robustness. The vulnerability of test seeds is indicated by the auxiliary information, which are highly correlated with local robustness. Given a test seed, we then develop a novel genetic algorithm–based local test case generation method, in which two fitness functions work alternatively to control the perceptual quality of detected AEs. Finally, extensive experiments confirm that our holistic approach considering hierarchical distributions is superior to the state-of-the-arts that either disregard any input distribution or only consider a single (non-hierarchical) distribution, in terms of not only detecting imperceptible AEs but also improving the overall robustness of the DL model under testing.

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3625290

Reference66 articles.

1. Haldun Akoglu. 2018. User’s guide to correlation coefficients. Turk. J. Emergen. Med. 18 3 (2018) 91–93.

2. GenAttack

3. Mohammed Attaoui Hazem Fahmy Fabrizio Pastore and Lionel Briand. 2023. Black-box safety analysis and retraining of DNNs based on feature extraction and clustering. ACM Trans. Softw. Eng. Methodol. 32 3 Article 79 (2023) 40 pages. 79

4. David Berend. 2021. Distribution awareness for AI system testing. In Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering. IEEE, 96–98.

5. Cats are not fish

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Shielding Object Detection: Enhancing Adversarial Defense through Ensemble Methods;2024 5th Information Communication Technologies Conference (ICTC);2024-05-10

2. What, Indeed, is an Achievable Provable Guarantee for Learning-Enabled Safety-Critical Systems;Bridging the Gap Between AI and Reality;2023-12-14

3. Continuous Engineering for Trustworthy Learning-Enabled Autonomous Systems;Bridging the Gap Between AI and Reality;2023-12-14

4. Transcend Adversarial Examples: Diversified Adversarial Attacks to Test Deep Learning Model;2023 IEEE 41st International Conference on Computer Design (ICCD);2023-11-06