Evaluating the Impact of Data Transformation Techniques on the Performance and Interpretability of Software Defect Prediction Models-Reference-Cited by-同舟云学术

Evaluating the Impact of Data Transformation Techniques on the Performance and Interpretability of Software Defect Prediction Models

Published:2023-11-14 Issue: Volume:2023 Page:1-30
ISSN:1751-8814
Container-title:IET Software
language:en
Short-container-title:IET Software

Author:

Zhao Yu¹^ORCID,Huang Zhiqiu¹^ORCID,Gong Lina¹^ORCID,Zhu Yi²^ORCID,Yu Qiao²^ORCID,Gao Yuxiang²^ORCID

Affiliation:

1. School of Computer Science and Technology, Key Laboratory of Safety-Critical Software, Nanjing University of Aeronautics and Astronautics, Nanjing 210000, China

2. School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China

Abstract

The performance of software defect prediction (SDP) models determines the priority of test resource allocation. Researchers also use interpretability techniques to gain empirical knowledge about software quality from SDP models. However, SDP methods designed in the past research rarely consider the impact of data transformation methods, simple but commonly used preprocessing techniques, on the performance and interpretability of SDP models. Therefore, in this paper, we investigate the impact of three data transformation methods (Log, Minmax, and Z-score) on the performance and interpretability of SDP models. Through empirical research on (i) six classification techniques (random forest, decision tree, logistic regression, Naive Bayes, K-nearest neighbors, and multilayer perceptron), (ii) six performance evaluation indicators (Accuracy, Precision, Recall, F1, MCC, and AUC), (iii) two interpretable methods (permutation and SHAP), (iv) two feature importance measures (Top-k feature rank overlap and difference), and (v) three datasets (Promise, Relink, and AEEEM), our results show that the data transformation methods can significantly improve the performance of the SDP models and greatly affect the variation of the most important features. Specifically, the impact of data transformation methods on the performance and interpretability of SDP models depends on the classification techniques and evaluation indicators. We observe that log transformation improves NB model performance by 7%–61% on the other five indicators with a 5% drop in Precision. Minmax and Z-score transformation improves NB model performance by 2%–9% across all indicators. However, all three transformation methods lead to substantial changes in the Top-5 important feature ranks, with differences exceeding 2 in 40%–80% of cases (detailed results available in the main content). Based on our findings, we recommend that (1) considering the impact of data transformation methods on model performance and interpretability when designing SDP approaches as transformations can improve model accuracy, and potentially obscure important features, which lead to challenges in interpretation, (2) conducting comparative experiments with and without the transformations to validate the effectiveness of proposed methods which are designed to improve the prediction performance, and (3) tracking changes in the most important features before and after applying data transformation methods to ensure precise and traceable interpretability conclusions to gain insights. Our study reminds researchers and practitioners of the need for comprehensive considerations even when using other similar simple data processing methods.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Subject

Computer Graphics and Computer-Aided Design

Link

http://downloads.hindawi.com/journals/ietsfw/2023/6293074.pdf

Reference93 articles.

1. AI4SE and SE4AI: A Research Roadmap

2. The early bird catches the worm: better early life cycle defect predictors;N. C. Shrikanth,2021

3. Software defect prediction: do different classifiers find the same defects?

4. Process metrics for software defect prediction in object‐oriented programs

5. Revisiting the Impact of Dependency Network Metrics on Software Defect Prediction

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Video-based beat-by-beat blood pressure monitoring via transfer deep-learning;Applied Intelligence;2024-03