Two sides of the same coin: A study on developers' perception of defects-Reference-Cited by-同舟云学术

Two sides of the same coin: A study on developers' perception of defects

Published:2024-06-18 Issue: Volume: Page:
ISSN:2047-7473
Container-title:Journal of Software: Evolution and Process
language:en
Short-container-title:J Software Evolu Process

Author:

Santos Geanderson¹^ORCID,Muzetti Igor²,Figueiredo Eduardo¹

Affiliation:

1. Computer Science Department Federal University of Minas Gerais Belo Horizonte Brazil

2. Computer Science Department Federal University of Ouro Preto Ouro Preto Brazil

Abstract

SummarySoftware defect prediction is a subject of study involving the interplay of software engineering and machine learning. The current literature proposed numerous machine learning models to predict software defects from software data, such as commits and code metrics. Further, the most recent literature employs explainability techniques to understand why machine learning models made such predictions (i.e., predicting the likelihood of a defect). As a result, developers are expected to reason on the software features that may relate to defects in the source code. However, little is known about the developers' perception of these machine learning models and their explanations. To explore this issue, we focus on a survey with experienced developers to understand how they evaluate each quality attribute for the defect prediction. We chose the developers based on their contributions at GitHub, where they contributed to at least 10 repositories in the past 2 years. The results show that developers tend to evaluate code complexity as the most important quality attribute to avoid defects compared with the other target attributes such as source code size, coupling, and documentation. At the end, a thematic analysis reveals that developers evaluate testing the code as a relevant aspect not covered by the static software features. We conclude that, qualitatively, there exists a misalignment between developers' perceptions and the outputs of machine learning models. For instance, while machine learning models assign high importance to documentation, developers often overlook documentation and prioritize assessing the complexity of the code instead.

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/smr.2699

Reference77 articles.

1. WangS LiuT TanL.Automatically learning semantic features for defect prediction. In: Proceedings of the 38th International Conference of Software Engineering (ICSE);2016:297‐308.

2. JingX YingS ZhangZ WuS LiuJ.Dictionary learning based software defect prediction. In: Proceedings of the 36th International Conference of Software Engineering (ICSE);2014:414‐423.

3. HassanAE.Predicting faults using the complexity of code changes. In: 2009 IEEE 31st International Conference of Software Engineering (ICSE);2009:78‐88.

4. D'AmbrosM LanzaM RobbesR.An extensive comparison of bug prediction approaches. In: 7th IEEE Working Conference on Mining Software Repositories (MSR);2010:31‐41.

5. The impact of automated parameter optimization on defect prediction models;Tantithamthavorn C;Trans Softw Eng (TSE),2019