Why My Code Summarization Model Does Not Work-Reference-Cited by-同舟云学术

Why My Code Summarization Model Does Not Work

Published:2021-03 Issue:2 Volume:30 Page:1-29
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Chen Qiuyuan¹,Xia Xin²,Hu Han²,Lo David³,Li Shanping¹

Affiliation:

1. College of Computer Science and Technology, Zhejiang University, Hangzhou, Zhejiang, China

2. Faculty of Information Technology, Monash University, Victoria, Australia

3. School of Information Systems, Singapore Management University, Singapore

Abstract

Code summarization aims at generating a code comment given a block of source code and it is normally performed by training machine learning algorithms on existing code block-comment pairs. Code comments in practice have different intentions. For example, some code comments might explain how the methods work, while others explain why some methods are written. Previous works have shown that a relationship exists between a code block and the category of a comment associated with it. In this article, we aim to investigate to which extent we can exploit this relationship to improve code summarization performance. We first classify comments into six intention categories and manually label 20,000 code-comment pairs. These categories include “what,” “why,” “how-to-use,” “how-it-is-done,” “property,” and “others.” Based on this dataset, we conduct an experiment to investigate the performance of different state-of-the-art code summarization approaches on the categories. We find that the performance of different code summarization approaches varies substantially across the categories. Moreover, the category for which a code summarization model performs the best is different for the different models. In particular, no models perform the best for “why” and “property” comments among the six categories. We design a composite approach to demonstrate that comment category prediction can boost code summarization to reach better results. The approach leverages classified code-category labeled data to train a classifier to infer categories. Then it selects the most suitable models for inferred categories and outputs the composite results. Our composite approach outperforms other approaches that do not consider comment categories and obtains a relative improvement of 8.57% and 16.34% in terms of ROUGE-L and BLEU-4 score, respectively.

Funder

Australian Research Council's Discovery Early Career Researcher Award

National Key R8D Program of China

NSFC Program

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3434280

Reference72 articles.

1. A Survey of Machine Learning for Big Code and Naturalness

2. code2vec: learning distributed representations of code

Cited by 52 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing source code classification effectiveness via prompt learning incorporating knowledge features;Scientific Reports;2024-08-30

2. Esale: Enhancing Code-Summary Alignment Learning for Source Code Summarization;IEEE Transactions on Software Engineering;2024-08

3. iiPCS: Intent-Based In-Context Learning for Project-Specific Code Summarization;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. Enhancing GUI Exploration Coverage of Android Apps with Deep Link-Integrated Monkey;ACM Transactions on Software Engineering and Methodology;2024-06-27

5. Do Code Summarization Models Process Too Much Information? Function Signature May Be All That Is Needed;ACM Transactions on Software Engineering and Methodology;2024-06-27