GA-SCS: Graph-Augmented Source Code Summarization-Reference-Cited by-同舟云学术

GA-SCS: Graph-Augmented Source Code Summarization

Published:2023-02-21 Issue:2 Volume:22 Page:1-19
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Zhang Mengli¹^ORCID,Zhou Gang¹^ORCID,Yu Wanting¹^ORCID,Huang Ningbo¹^ORCID,Liu Wenfen²^ORCID

Affiliation:

1. State Key Laboratory of Mathematical Engineering and Advanced Computing, Zhengzhou, China

2. Guilin University of Electronic Technology, Guilin, China

Abstract

Automatic source code summarization system aims to generate a valuable natural language description for a program, which can facilitate software development and maintenance, code categorization, and retrieval. However, previous sequence-based research did not consider the long-distance dependence and highly structured characteristics of source code simultaneously. In this article, we present a Transformer-based Graph-Augmented Source Code Summarization (GA-SCS), which can effectively incorporate inherent structural and textual features of source code to generate an effective code description. Specifically, we develop a graph-based structure feature extraction scheme leveraging abstract syntax tree and graph attention networks to mine global syntactic information. And then, to take full advantage of the lexical and syntactic information of code snippets, we extend the original attention to a syntax-informed self-attention mechanism in our encoder. In the training process, we also adopt a reinforcement learning strategy to enhance the readability and informativity of generated code summaries. We utilize the Java dataset and Python dataset to evaluate the performance of different models. Experimental results demonstrate that our GA-SCS model outperforms all competitive methods on BLEU, METEOR, ROUGE, and human evaluations.

Funder

National Natural Science Foundation of China

Guangxi Science and Technology Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3554820

Reference50 articles.

1. A Transformer-based Approach for Source Code Summarization

2. Miltiadis Allamanis, Marc Brockschmidt, and Mahmoud Khademi. 2018. Learning to represent programs with graphs. In Proceedings of the 6th International Conference on Learning Representations (ICLR’18). OpenReview.net.

3. Uri Alon, Shaked Brody, Omer Levy, and Eran Yahav. 2019. code2seq: Generating sequences from structured representations of code. In Proceedings of the 7th International Conference on Learning Representations (ICLR’19). OpenReview.net.

4. A general path-based representation for predicting program properties

5. Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization@ACL, Jade Goldstein, Alon Lavie, Chin-Yew Lin, and Clare R. Voss (Eds.). Association for Computational Linguistics, 65–72.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semantic similarity loss for neural source code summarization;Journal of Software: Evolution and Process;2024-07-07

2. CodeSense: Code Summarizer for JavaScript;2024 International Conference on Signal Processing, Computation, Electronics, Power and Telecommunication (IConSCEPT);2024-07-04

3. Medical Question Summarization with Entity-driven Contrastive Learning;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-04-15