Affiliation:
1. Xi’an Jiaotong University, China
2. Singapore Management University, Singapore
Abstract
Deep learning (DL) has been applied in many applications. Meanwhile, the quality of DL systems is becoming a big concern. To evaluate the quality of DL systems, a number of DL testing techniques have been proposed. To generate test cases, a set of initial seed inputs are required. Existing testing techniques usually construct seed corpus by randomly selecting inputs from training or test dataset. Till now, there is no study on how initial seed inputs affect the performance of DL testing and how to construct an optimal one. To fill this gap, we conduct the first systematic study to evaluate the impact of seed selection strategies on DL testing. Specifically, considering three popular goals of DL testing (i.e., coverage, failure detection, and robustness), we develop five seed selection strategies, including three based on single-objective optimization (SOO) and two based on multi-objective optimization (MOO). We evaluate these strategies on seven testing tools. Our results demonstrate that the selection of initial seed inputs greatly affects the testing performance. SOO-based selection can construct the best seed corpus that can boost DL testing with respect to the specific testing goal. MOO-based selection strategies can construct seed corpus that achieve balanced improvement on multiple objectives.
Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Shaanxi Province Key Industry Innovation Program
National Research Foundation, Singapore, and the Cyber Security Agency under its National Cybersecurity R&D Programme
Ministry of Education, Singapore under its Academic Research Tier 3
Publisher
Association for Computing Machinery (ACM)
Reference96 articles.
1. Yuhan Zhi. 2022. Seed Selection. Retrieved from https://sites.google.com/view/seedselection
2. Humberto Abdelnur Radu State Obes Jorge Lucangeli and Olivier Festor. 2010. Spectral fuzzing: Evaluation & feedback. phdthesis. INRIA.
3. Testing autonomous cars for feature interaction failures using many-objective search
4. Mike Aizatsky Kostya Serebryany Oliver Chang Abhishek Arya and Meredith Whittaker. 2016. Announcing OSS-Fuzz: Continuous fuzzing for open source software. Google Testing Blog (2016). Retrieved from https://opensource.googleblog.com/2016/12/announcing-oss-fuzz-continuous-fuzzing.html
5. Automated web application testing using search based software engineering
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献