L2S: A Framework for Synthesizing the Most Probable Program under a Specification-Reference-Cited by-同舟云学术

L2S: A Framework for Synthesizing the Most Probable Program under a Specification

Published:2022-03-07 Issue:3 Volume:31 Page:1-45
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Xiong Yingfei¹,Wang Bo¹

Affiliation:

1. Peking University, Haidian, Beijing

Abstract

In many scenarios, we need to find the most likely program that meets a specification under a local context, where the local context can be an incomplete program, a partial specification, natural language description, and so on. We call such a problem program estimation . In this article, we propose a framework, LingLong Synthesis Framework (L2S) , to address this problem. Compared with existing work, our work is novel in the following aspects. (1) We propose a theory of expansion rules to describe how to decompose a program into choices. (2) We propose an approach based on abstract interpretation to efficiently prune off the program sub-space that does not satisfy the specification. (3) We prove that the probability of a program is the product of the probabilities of choosing expansion rules, regardless of the choosing order. (4) We reduce the program estimation problem to a pathfinding problem, enabling existing pathfinding algorithms to solve this problem. L2S has been applied to program generation and program repair. In this article, we report our instantiation of this framework for synthesizing conditional expressions (L2S-Cond) and repairing conditional statements (L2S-Hanabi). The experiments on L2S-Cond show that each option enabled by L2S, including the expansion rules, the pruning technique, and the use of different pathfinding algorithms, plays a major role in the performance of the approach. The default configuration of L2S-Cond correctly predicts nearly 60% of the conditional expressions in the top 5 candidates. Moreover, we evaluate L2S-Hanabi on 272 bugs from two real-world Java defects benchmarks, namely Defects4J and Bugs.jar. L2S-Hanabi correctly fixes 32 bugs with a high precision of 84%. In terms of repairing conditional statement bugs, L2S-Hanabi significantly outperforms all existing approaches in both precision and recall.

Funder

National Key Research and Development Program

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3487570

Reference73 articles.

1. An Evaluation of Similarity Coefficients for Software Fault Localization

2. Alfred V. Aho, Monica S. Lam, Ravi Sethi, and Jeffrey D. Ullman. 2006. Compilers: Principles, Techniques, and Tools (2nd ed.).

3. Syntax-guided synthesis

4. Getafix: Learning to fix bugs automatically;Bader Johannes;Proceedings of the ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA’19),2019

5. Pavol Bielik, Veselin Raychev, and Martin T. Vechev. 2016. PHOG: Probabilistic model for code. In Proceedings of the International Conference on Machine Learning (ICML’16). 2933–2942.

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Does Going Beyond Branch Coverage Make Program Repair Tools More Reliable?;2024 IEEE Conference on Software Testing, Verification and Validation (ICST);2024-05-27

2. Evaluating Fault Localization and Program Repair Capabilities of Existing Closed-Source General-Purpose LLMs;Proceedings of the 1st International Workshop on Large Language Models for Code;2024-04-20

3. GrammarT5: Grammar-Integrated Pretrained Encoder-Decoder Neural Model for Code;Proceedings of the IEEE/ACM 46th International Conference on Software Engineering;2024-04-12

4. Accelerating Patch Validation for Program Repair With Interception-Based Execution Scheduling;IEEE Transactions on Software Engineering;2024-03

5. Variable-based Fault Localization via Enhanced Decision Tree;ACM Transactions on Software Engineering and Methodology;2023-12-21