Affiliation:
1. Columbia University, New York, NY, USA
Abstract
Systems code must obey many rules, such as "opened files must be closed." One approach to verifying rules is static analysis, but this technique cannot infer precise runtime effects of code, often emitting many false positives. An alternative is symbolic execution, a technique that verifies program paths over all inputs up to a bounded size. However, when applied to verify rules, existing symbolic execution systems often blindly explore many redundant program paths while missing relevant ones that may contain bugs.
Our key insight is that only a small portion of paths are relevant to rules, and the rest (majority) of paths are irrelevant and do not need to be verified. Based on this insight, we create WOODPECKER, a new symbolic execution system for effectively checking rules on systems programs. It provides a set of builtin checkers for common rules, and an interface for users to easily check new rules. It directs symbolic execution toward the program paths relevant to a checked rule, and soundly prunes redundant paths, exponentially speeding up symbolic execution. It is designed to be heuristic-agnostic, enabling users to leverage existing powerful search heuristics.
Evaluation on 136 systems programs totaling 545K lines of code, including some of the most widely used programs, shows that, with a time limit of typically just one hour for each verification run, WOODPECKER effectively verifies 28.7% of the program and rule combinations over bounded input, whereas an existing symbolic execution system KLEE verifies only 8.5%. For the remaining combinations, WOODPECKER verifies 4.6 times as many relevant paths as KLEE. With a longer time limit, WOODPECKER verifies much more paths than KLEE,
e.g.,
17 times as many with a fourhour limit. WOODPECKER detects 113 rule violations, including 10 serious data loss errors with 2 most serious ones already confirmed by the corresponding developers.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Reference67 articles.
1. The LLVM compiler framework. http://llvm.org. The LLVM compiler framework. http://llvm.org.
2. STP Constraint Solver. https://sites.google.com/site/stpfastprover/. STP Constraint Solver. https://sites.google.com/site/stpfastprover/.
3. Dynamic program slicing
4. ODR
Cited by
32 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. SICode: Embedding-Based Subgraph Isomorphism Identification for Bug Detection;Proceedings of the 32nd IEEE/ACM International Conference on Program Comprehension;2024-04-15
2. BSFuzz: Branch-State Guided Hybrid Fuzzing;Electronics;2023-09-25
3. Eunomia: Enabling User-Specified Fine-Grained Search in Symbolically Executing WebAssembly Binaries;Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis;2023-07-12
4. Synergizing Symbolic Execution and Fuzzing By Function-level Selective Symbolization;2022 29th Asia-Pacific Software Engineering Conference (APSEC);2022-12
5. Feedback-Driven Incremental Symbolic Execution;2022 IEEE 33rd International Symposium on Software Reliability Engineering (ISSRE);2022-10