Deep is Better? An Empirical Comparison of Information Retrieval and Deep Learning Approaches to Code Summarization-Reference-Cited by-同舟云学术

Deep is Better? An Empirical Comparison of Information Retrieval and Deep Learning Approaches to Code Summarization

Published:2023-11-06 Issue: Volume: Page:
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Zhu Tingwei¹,Li Zhong¹,Pan Minxue²,Shi Chaoxuan¹,Zhang Tian¹,Pei Yu³,Li Xuandong¹

Affiliation:

1. State Key Lab for Novel Software Technology and the Department of Computer Science and Technology, Nanjing University, China

2. State Key Lab for Novel Software Technology and the Software Institute, Nanjing University, China

3. Department of Computing, The Hong Kong Polytechnic University, China

Abstract

Code summarization aims to generate short functional descriptions for source code to facilitate code comprehension. While Information Retrieval (IR) approaches that leverage similar code snippets and corresponding summaries have led the early research, Deep Learning (DL) approaches that use neural models to capture statistical properties between code and summaries are now mainstream. Although some preliminary studies suggest that IR approaches are more effective in some cases, it is currently unclear how effective the existing approaches can be in general, where and why IR/DL approaches perform better, and whether the integration of IR and DL can achieve better performance. Consequently, there is an urgent need for a comprehensive study of the IR and DL code summarization approaches to provide guidance for future development in this area. This paper presents the first large-scale empirical study of 18 IR, DL, and hybrid code summarization approaches on five benchmark datasets. We extensively compare different types of approaches using automatic metrics, we conduct quantitative and qualitative analyses of where and why IR and DL approaches perform better, respectively, and we also study hybrid approaches for assessing the effectiveness of integrating IR and DL. The study shows that the performance of IR approaches should not be underestimated, that while DL models perform better in predicting tokens from method signatures and capturing structural similarities in code, simple IR approaches tend to perform better in the presence of code with high similarity or long reference summaries, and that existing hybrid approaches do not perform as well as individual approaches in their respective areas of strength. Based on our findings, we discuss future research directions for better code summarization.

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3631975

Reference106 articles.

1. A Transformer-based Approach for Source Code Summarization

2. Unified Pre-training for Program Understanding and Generation

3. Loubna Ben Allal , Raymond Li , Denis Kocetkov , Chenghao Mou , Christopher Akiki , Carlos Munoz Ferrandis , Niklas Muennighoff , Mayank Mishra , Alex Gu , Manan Dey , et al . 2023 . SantaCoder : don’t reach for the stars!arXiv preprint arXiv:2301.03988(2023). Loubna Ben Allal, Raymond Li, Denis Kocetkov, Chenghao Mou, Christopher Akiki, Carlos Munoz Ferrandis, Niklas Muennighoff, Mayank Mishra, Alex Gu, Manan Dey, et al. 2023. SantaCoder: don’t reach for the stars!arXiv preprint arXiv:2301.03988(2023).

4. Learning natural coding conventions

5. A Survey of Machine Learning for Big Code and Naturalness

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Comparative Analysis of Large Language Models for Code Documentation Generation;Proceedings of the 1st ACM International Conference on AI-Powered Software;2024-07-10

2. Automatic smart contract comment generation via large language models and in-context learning;Information and Software Technology;2024-04