Learning to Detect and Localize Multilingual Bugs-Reference-Cited by-同舟云学术

Learning to Detect and Localize Multilingual Bugs

Published:2024-07-12 Issue:FSE Volume:1 Page:2190-2213
ISSN:2994-970X
Container-title:Proceedings of the ACM on Software Engineering
language:en
Short-container-title:Proc. ACM Softw. Eng.

Author:

Yang Haoran¹^ORCID,Nong Yu¹^ORCID,Zhang Tao²^ORCID,Luo Xiapu³^ORCID,Cai Haipeng¹^ORCID

Affiliation:

1. Washington State University, Pullman, USA

2. Macau University of Science and Technology, Macau, China

3. Hong Kong Polytechnic University, Hong Kong, China

Abstract

Increasing studies have shown bugs in multi-language software as a critical loophole in modern software quality assurance, especially those induced by language interactions (i.e., multilingual bugs). Yet existing tool support for bug detection/localization remains largely limited to single-language software, despite the long-standing prevalence of multi-language systems in various real-world software domains. Extant static/dynamic analysis and deep learning (DL) based approaches all face major challenges in addressing multilingual bugs. In this paper, we present xLoc, a DL-based technique/tool for detecting and localizing multilingual bugs. Motivated by results of our bug-characteristics study on top locations of multilingual bugs, xLoc first learns the general knowledge relevant to differentiating various multilingual control-flow structures. This is achieved by pre-training a Transformer model with customized position encoding against novel objectives. Then, xLoc learns task-specific knowledge for the task of multilingual bug detection/localization, through another new position encoding scheme (based on cross-language API vicinity) that allows for the model to attend particularly to control-flow constructs that bear most multilingual bugs during fine-tuning. We have implemented xLoc for Python-C software and curated a dataset of 3,770 buggy and 15,884 non-buggy Python-C samples, which enabled our extensive evaluation of xLoc against two state-of-the-art baselines: fine-tuned CodeT5 and zero-shot ChatGPT. Our results show that xLoc achieved 94.98% F1 and 87.24%@Top-1 accuracy, which are significantly (up to 162.88% and 511.75%) higher than the baselines. Ablation studies further confirmed significant contributions of each of the novel design elements in xLoc. With respective bug-location characteristics and labeled bug datasets for fine-tuning, our design may be applied to other language combinations beyond Python-C.

Funder

NSF

Office of Naval Research

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3660804

Reference79 articles.

1. Mouna Abidi, Manel Grichi, and Foutse Khomh. 2019. Behind the scenes: developers’ perception of multi-language practices. In Annual International Conference on Computer Science and Software Engineering. 72–81.

2. Are Multi-Language Design Smells Fault-Prone? An Empirical Study

3. Mayank Agarwal Yikang Shen Bailin Wang Yoon Kim and Jie Chen. 2024. Structured code representations enable data-efficient adaptation of code language models. arXiv preprint arXiv:2401.10716 1–18. https://doi.org/10.48550/arXiv.2401.10716 10.48550/arXiv.2401.10716

4. Towards Understanding and Reasoning About Android Interoperations

5. BridgeTaint: A Bi-Directional Dynamic Taint Tracking Method for JavaScript Bridges in Android Hybrid Applications