Detecting Compiler Bugs Via a Deep Learning-Based Framework-Reference-Cited by-同舟云学术

Detecting Compiler Bugs Via a Deep Learning-Based Framework

Published:2022-05 Issue:05 Volume:32 Page:661-691
ISSN:0218-1940
Container-title:International Journal of Software Engineering and Knowledge Engineering
language:en
Short-container-title:Int. J. Soft. Eng. Knowl. Eng.

Author:

Tang Yixuan¹,Ren Zhilei¹²^ORCID,Jiang He¹³,Qiao Lei⁴,Liu Dong¹,Zhou Zhide¹,Kong Weiqiang¹

Affiliation:

1. School of Software, Dalian University of Technology, No. 2, Linggong Road, Ganjingzi District, Dalian City, Liaoning Province, P. R. China

2. Key Laboratory of Safety-Critical Software, Ministry of Industry and Information Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, P. R. China

3. DUT Artificial Intelligence Institute, Dalian City, Liaoning Province, P. R. China

4. Beijing Institute of Control Engineering, No. 104, Youyi Rd. Haidian District, Beijing, P. R. China

Abstract

Compiler testing is the most widely used way to assure compiler quality. However, since compilers require a large number of sophisticated test programs as inputs, the existing approaches in compiler testing still have a limited capability in generating both syntactically valid and diverse test programs. In this paper, we propose DeepGen, a deep learning-based approach to support compiler testing through the inference of a generative model for compiler inputs. First, DeepGen trains a Transformer-XL model based on a large corpus of seed programs, and uses the trained model to generate syntactically valid programs. Then, DeepGen adopts a sampling strategy in the inference phase to generate diverse test programs. Finally, DeepGen leverages differential testing on the generated programs to discover compiler bugs. We have evaluated DeepGen over two popular C++ compilers GCC and LLVM, and the results confirm the effectiveness of our approach. DeepGen detects 35.29%, 53.33%, and 187.50% more bugs than three existing approaches, i.e. DeepSmith, DeepFuzz, and Csmith, respectively. In addition, 30.43% bugs detected by DeepGen are not detected by other approaches. Furthermore, DeepGen has successfully detected 38 bugs in the latest development versions of GCC and LLVM; 21 of them have been confirmed/fixed by the developers.

Funder

National Natural Science Foundation of China

Publisher

World Scientific Pub Co Pte Ltd

Subject

Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Networks and Communications,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218194022500206

Reference48 articles.

1. History-Guided Configuration Diversification for Compiler Test-Program Generation

2. Perses

3. Finding and understanding bugs in C compilers

4. Compiler validation via equivalence modulo inputs

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Design of Intelligent Political Test Paper Generation Method Based on Improved Intelligent Optimization Algorithm;ICST Transactions on Scalable Information Systems;2024-05-02