Affiliation:
1. University of California, Davis
Abstract
We introduce
equivalence modulo inputs (EMI)
, a simple, widely applicable methodology for validating optimizing compilers. Our key insight is to exploit the close interplay between (1) dynamically executing a program on some test inputs and (2) statically compiling the program to work on all possible inputs. Indeed, the test inputs induce a natural collection of the original program's EMI variants, which can help differentially test any compiler and specifically target the difficult-to-find miscompilations.
To create a practical implementation of EMI for validating C compilers, we profile a program's test executions and stochastically prune its unexecuted code. Our extensive testing in eleven months has led to 147 confirmed, unique bug reports for GCC and LLVM alone. The majority of those bugs are miscompilations, and more than 100 have already been fixed.
Beyond testing compilers, EMI can be adapted to validate program transformation and analysis systems in general. This work opens up this exciting, new direction.
Funder
National Science Foundation
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Cited by
109 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Fuzzing MLIR Compiler Infrastructure via Operation Dependency Analysis;Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis;2024-09-11
2. Metamorphic Testing of Secure Multi-party Computation (MPC) Compilers;Proceedings of the ACM on Software Engineering;2024-07-12
3. DTD: Comprehensive and Scalable Testing for Debuggers;Proceedings of the ACM on Software Engineering;2024-07-12
4. Understanding and Detecting Annotation-Induced Faults of Static Analyzers;Proceedings of the ACM on Software Engineering;2024-07-12
5. Enumerating Valid Non-Alpha-Equivalent Programs for Interpreter Testing;ACM Transactions on Software Engineering and Methodology;2024-06-04