Automatically Recommend Code Updates: Are We There Yet?

Author:

Liu Yue1ORCID,Tantithamthavorn Chakkrit1ORCID,Liu Yonghui1ORCID,Thongtanunam Patanamon2ORCID,Li Li3ORCID

Affiliation:

1. Monash University, Australia

2. The University of Melbourne, Australia

3. Beihang University, China

Abstract

In recent years, large pre-trained Language Models of Code (CodeLMs) have shown promising results on various software engineering tasks. One such task is automatic code update recommendation, which transforms outdated code snippets into their approved and revised counterparts. Although many CodeLM-based approaches have been proposed, claiming high accuracy, their effectiveness and reliability on real-world code update tasks remain questionable. In this paper, we present the first extensive evaluation of state-of-the-art CodeLMs for automatically recommending code updates. We assess their performance on two diverse datasets of paired updated methods, considering factors such as temporal evolution, project specificity, method size, and update complexity. Our results reveal that while CodeLMs exhibit higher performance in settings that ignore temporal information, they struggle in more realistic time-wise scenarios and generalize poorly to new projects. Furthermore, CodeLM performance decreases significantly for larger methods and more complex updates. Furthermore, we observe that many CodeLM-generated “updates” are actually null, especially in time-wise settings, and meaningful edits remain challenging. Our findings highlight the significant gap between the perceived and actual effectiveness of CodeLMs for real-world code update recommendation and emphasize the need for more research on improving their practicality, robustness, and generalizability.

Publisher

Association for Computing Machinery (ACM)

Reference102 articles.

1. (2021) Google bigquery. [Online]. Available: https://console.cloud.google.com/marketplace/details/github/github-repos

2. (2022) Gerrit code review. [Online]. Available: https://www.gerritcodereview.com/

3. Kotlin 2022. [Online]. Available: https://kotlinlang.org/

4. (2024) Android’s gerrit code review repositories. [Online]. Available: https://android-review.googlesource.com/

5. (2024) Google’s gerrit code review repositories. [Online]. Available: https://gerrit-review.googlesource.com/

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3