Affiliation:
1. State Key Laboratory for Novel Software Technology, Nanjing University, China
2. Alibaba Group, USA
3. Arizona State University, USA
Abstract
Commit messages, which summarize the source code changes in natural language, are essential for program comprehension and software evolution understanding. Unfortunately, due to the lack of direct motivation, commit messages are sometimes neglected by developers, making it necessary to automatically generate such messages. State-of-the-art adopts learning based approaches such as neural machine translation models for the commit message generation problem. However, they tend to ignore the code structure information and suffer from the out-of-vocabulary issue. In this paper, we propose CoDiSum to address the above two limitations. In particular, we first extract both code structure and code semantics from the source code changes, and then jointly model these two sources of information so as to better learn the representations of the code changes. Moreover, we augment the model with copying mechanism to further mitigate the out-of-vocabulary issue. Experimental evaluations on real data demonstrate that the proposed approach significantly outperforms the state-of-the-art in terms of accurately generating the commit messages.
Publisher
International Joint Conferences on Artificial Intelligence Organization
Cited by
32 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Evaluating Transfer Learning for Simplifying GitHub READMEs;Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering;2023-11-30
2. Study the correlation between the readme file of GitHub projects and their popularity;Journal of Systems and Software;2023-11
3. Mucha: Multi-channel based Code Change Representation Learning for Commit Message Generation;2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security (QRS);2023-10-22
4. Recommending Code Reviews Leveraging Code Changes with Structured Information Retrieval;2023 IEEE International Conference on Software Maintenance and Evolution (ICSME);2023-10-01
5. BugMiner: Automating Precise Bug Dataset Construction by Code Evolution History Mining;2023 38th IEEE/ACM International Conference on Automated Software Engineering (ASE);2023-09-11