Multi-modal program inference: a marriage of pre-trained language models and component-based synthesis-Reference-Cited by-同舟云学术

Multi-modal program inference: a marriage of pre-trained language models and component-based synthesis

Published:2021-10-20 Issue:OOPSLA Volume:5 Page:1-29
ISSN:2475-1421
Container-title:Proceedings of the ACM on Programming Languages
language:en
Short-container-title:Proc. ACM Program. Lang.

Author:

Rahmani Kia¹,Raza Mohammad²,Gulwani Sumit²,Le Vu²,Morris Daniel²,Radhakrishna Arjun²,Soares Gustavo²,Tiwari Ashish²

Affiliation:

1. Purdue University, USA

2. Microsoft, USA

Abstract

Multi-modal program synthesis refers to the task of synthesizing programs (code) from their specification given in different forms, such as a combination of natural language and examples. Examples provide a precise but incomplete specification, and natural language provides an ambiguous but more "complete" task description. Machine-learned pre-trained models (PTMs) are adept at handling ambiguous natural language, but struggle with generating syntactically and semantically precise code. Program synthesis techniques can generate correct code, often even from incomplete but precise specifications, such as examples, but they are unable to work with the ambiguity of natural languages. We present an approach that combines PTMs with component-based synthesis (CBS): PTMs are used to generate candidates programs from the natural language description of the task, which are then used to guide the CBS procedure to find the program that matches the precise examples-based specification. We use our combination approach to instantiate multi-modal synthesis systems for two programming domains: the domain of regular expressions and the domain of CSS selectors. Our evaluation demonstrates the effectiveness of our domain-agnostic approach in comparison to a state-of-the-art specialized system, and the generality of our approach in providing multi-modal program synthesis from natural language and examples in different programming domains.

Publisher

Association for Computing Machinery (ACM)

Subject

Safety, Risk, Reliability and Quality,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3485535

Reference45 articles.

1. R. Alur R. Bodik G. Juniwal M. M. K. Martin M. Raghothaman S. A. Seshia R. Singh A. Solar-Lezama E. Torlak and A. Udupa. 2013. Syntax-guided synthesis. In 2013 Formal Methods in Computer-Aided Design. 1–8. https://doi.org/10.1109/FMCAD.2013.6679385 10.1109/FMCAD.2013.6679385 R. Alur R. Bodik G. Juniwal M. M. K. Martin M. Raghothaman S. A. Seshia R. Singh A. Solar-Lezama E. Torlak and A. Udupa. 2013. Syntax-guided synthesis. In 2013 Formal Methods in Computer-Aided Design. 1–8. https://doi.org/10.1109/FMCAD.2013.6679385 10.1109/FMCAD.2013.6679385

2. Rajeev Alur Pavol Cerny and Arjun Radhakrishna. 2015. Synthesis Through Unification. In Computer Aided Verification (CAV). https://www.microsoft.com/en-us/research/publication/synthesis-through-unification/ Rajeev Alur Pavol Cerny and Arjun Radhakrishna. 2015. Synthesis Through Unification. In Computer Aided Verification (CAV). https://www.microsoft.com/en-us/research/publication/synthesis-through-unification/

3. On the complexity of minimum inference of regular sets

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Structure and design of multimodal dataset for automatic regex synthesis methods in Roman Urdu;International Journal of Data Science and Analytics;2024-07-23

2. PyDex: Repairing Bugs in Introductory Python Assignments using LLMs;Proceedings of the ACM on Programming Languages;2024-04-29

3. Programming-by-Demonstration for Long-Horizon Robot Tasks;Proceedings of the ACM on Programming Languages;2024-01-05

4. Survey of intelligent program synthesis techniques;International Conference on Algorithms, High Performance Computing, and Artificial Intelligence (AHPCAI 2023);2023-12-07

5. FormaT5: Abstention and Examples for Conditional Table Formatting with Natural Language;Proceedings of the VLDB Endowment;2023-11