Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications-Reference-Cited by-同舟云学术

Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications

Published:2023-07-24 Issue:5 Volume:32 Page:1-32
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Tian Yongqiang¹^ORCID,Zhang Wuqi²^ORCID,Wen Ming³^ORCID,Cheung Shing-Chi²^ORCID,Sun Chengnian⁴^ORCID,Ma Shiqing⁵^ORCID,Jiang Yu⁶^ORCID

Affiliation:

1. University of Waterloo, Canada and The Hong Kong University of Science and Technology, China

2. The Hong Kong University of Science and Technology, China

3. Huazhong University of Science and Technology, China

4. University of Waterloo, Canada

5. Rutgers University, USA

6. Tsinghua University, China

Abstract

Model compression can significantly reduce the sizes of deep neural network (DNN) models and thus facilitate the dissemination of sophisticated, sizable DNN models, especially for deployment on mobile or embedded devices. However, the prediction results of compressed models may deviate from those of their original models. To help developers thoroughly understand the impact of model compression, it is essential to test these models to find those deviated behaviors before dissemination. However, this is a non-trivial task, because the architectures and gradients of compressed models are usually not available. To this end, we propose

Dflare

, a novel, search-based, black-box testing technique to automatically find triggering inputs that result in deviated behaviors in image classification tasks.

Dflare

iteratively applies a series of mutation operations to a given seed image until a triggering input is found. For better efficacy and efficiency,

Dflare

models the search problem as Markov Chains and leverages the Metropolis-Hasting algorithm to guide the selection of mutation operators in each iteration. Further,

Dflare

utilizes a novel fitness function to prioritize the mutated inputs that either cause large differences between two models’ outputs or trigger previously unobserved models’ probability vectors. We evaluated

Dflare

on 21 compressed models for image classification tasks with three datasets. The results show that

Dflare

not only constantly outperforms the baseline in terms of efficacy but also significantly improves the efficiency:

Dflare

is 17.84×–446.06× as fast as the baseline in terms of time; the number of queries required by

Dflare

to find one triggering input is only 0.186–1.937% of those issued by the baseline. We also demonstrated that the triggering inputs found by

Dflare

can be used to repair up to 48.48% deviated behaviors in image classification tasks and further decrease the effectiveness of

Dflare

on the repaired models.

Funder

National Natural Science Foundation of China

Hong Kong RGC/GRF

Hong Kong RGC/RIF

Hong Kong ITF

Hong Kong PhD Fellowship Scheme, HKUST RedBird Academic Excellence Award, and the MSRA Collaborative Research Grant

Cisco Research Gift, Natural Sciences and Engineering Research Council of Canada (NSERC) through the Discovery Grant, and CFI-JELF Project

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3583564

Reference82 articles.

1. Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey

2. Pearson Correlation Coefficient

3. Arjun Nitin Bhagoji, Warren He, Bo Li, and Dawn Song. 2018. Practical black-box attacks on deep neural networks using efficient query mechanisms. In ECCV’18, Vol. 11216. Springer, 158–174.

4. Image Noise Models

5. Model compression

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CoopHance: Cooperative Enhancement for Robustness of Deep Learning Systems;Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis;2023-07-12