What really changes when developers intend to improve their source code: a commit-level study of static metric value and static analysis warning changes-Reference-Cited by-同舟云学术

What really changes when developers intend to improve their source code: a commit-level study of static metric value and static analysis warning changes

Published:2023-01-14 Issue:2 Volume:28 Page:
ISSN:1382-3256
Container-title:Empirical Software Engineering
language:en
Short-container-title:Empir Software Eng

Author:

Trautsch Alexander^ORCID,Erbel Johannes,Herbold Steffen^ORCID,Grabowski Jens

Abstract

AbstractMany software metrics are designed to measure aspects that are believed to be related to software quality. Static software metrics, e.g., size, complexity and coupling are used in defect prediction research as well as software quality models to evaluate software quality. Static analysis tools also include boundary values for complexity and size that generate warnings for developers. While this indicates a relationship between quality and software metrics, the extent of it is not well understood. Moreover, recent studies found that complexity metrics may be unreliable indicators for understandability of the source code. To explore this relationship, we leverage the intent of developers about what constitutes a quality improvement in their own code base. We manually classify a randomized sample of 2,533 commits from 54 Java open source projects as quality improving depending on the intent of the developer by inspecting the commit message. We distinguish between perfective and corrective maintenance via predefined guidelines and use this data as ground truth for the fine-tuning of a state-of-the art deep learning model for natural language processing. The benchmark we provide with our ground truth indicates that the deep learning model can be confidently used for commit intent classification. We use the model to increase our data set to 125,482 commits. Based on the resulting data set, we investigate the differences in size and 14 static source code metrics between changes that increase quality, as indicated by the developer, and changes unrelated to quality. In addition, we investigate which files are targets of quality improvements. We find that quality improving commits are smaller than non-quality improving commits. Perfective changes have a positive impact on static source code metrics while corrective changes do tend to add complexity. Furthermore, we find that files which are the target of perfective maintenance already have a lower median complexity than files which are the target of non-pervective changes. Our study results provide empirical evidence for which static source code metrics capture quality improvement from the developers point of view. This has implications for program understanding as well as code smell detection and recommender systems.

Funder

Deutsche Forschungsgemeinschaft

Universität Passau

Publisher

Springer Science and Business Media LLC

Subject

Software

Link

https://link.springer.com/content/pdf/10.1007/s10664-022-10257-9.pdf

Reference71 articles.

1. Abdi H (2007) Bonferroni and sidak corrections for multiple comparisons. In: Encyclopedia of measurement and statistics. Sage, Thousand Oaks, pp 103–107

2. Al Dallal J, Abdin A (2018) Empirical evaluation of the impact of object-oriented code refactoring on quality attributes: a systematic literature review. IEEE Trans Softw Eng 44(1):44–69. https://doi.org/10.1109/TSE.2017.2658573

3. Alali A, Kagdi H, Maletic JI (2008) What’s a typical commit? A characterization of open source software repositories. In: 2008 16th IEEE international conference on program comprehension. https://doi.org/10.1109/ICPC.2008.24, pp 182–191

4. AlOmar EA, Mkaouer MW, Ouni A (2021) Toward the automatic classification of self-affirmed refactoring. J Syst Softw 171:110821. https://doi.org/10.1016/j.jss.2020.110821. http://www.sciencedirect.com/science/article/pii/S016412122030217X

5. Alshayeb M (2009) Empirical investigation of refactoring effect on software quality. Inf Softw Technol 51(9):1319–1326. https://doi.org/10.1016/j.infsof.2009.04.002. http://www.sciencedirect.com/science/article/pii/S095058490900038X

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Meta-Study of Software-Change Intentions;ACM Computing Surveys;2024-04-25

2. Commit-Level Software Change Intent Classification Using a Pre-Trained Transformer-Based Code Model;Mathematics;2024-03-28

3. 7 Dimensions of software change patterns;Scientific Reports;2024-03-13

4. Automatic Refactoring Candidate Identification Leveraging Effective Code Representation;2023 IEEE International Conference on Software Maintenance and Evolution (ICSME);2023-10-01

5. Commit Classification Into Software Maintenance Activities: A Systematic Literature Review;2023 IEEE 47th Annual Computers, Software, and Applications Conference (COMPSAC);2023-06