On the need of preserving order of data when validating within-project defect classifiers-Reference-Cited by-同舟云学术

On the need of preserving order of data when validating within-project defect classifiers

Published:2020-08-31 Issue:6 Volume:25 Page:4805-4830
ISSN:1382-3256
Container-title:Empirical Software Engineering
language:en
Short-container-title:Empir Software Eng

Author:

Falessi Davide^ORCID,Huang Jacky,Narayana Likhita,Thai Jennifer Fong,Turhan Burak

Abstract

AbstractWe are in the shoes of a practitioner who uses previous project releases’ data to predict which classes of the current release are defect-prone. In this scenario, the practitioner would like to use the most accurate classifier among the many available ones. A validation technique, hereinafter “technique”, defines how to measure the prediction accuracy of a classifier. Several previous research efforts analyzed several techniques. However, no previous study compared validation techniques in the within-project across-release class-level context or considered techniques that preserve the order of data. In this paper, we investigate which technique recommends the most accurate classifier. We use the last release of a project as the ground truth to evaluate the classifier’s accuracy and hence the ability of a technique to recommend an accurate classifier. We consider nine classifiers, two industry and 13 open projects, and three validation techniques: namely 10-fold cross-validation (i.e., the most used technique), bootstrap (i.e., the recommended technique), and walk-forward (i.e., a technique preserving the order of data). Our results show that: 1) classifiers differ in accuracy in all datasets regardless of their entity per value, 2) walk-forward outperforms both 10-fold cross-validation and bootstrap statistically in all three accuracy metrics: AUC of the selected classifier, bias and absolute bias, 3) surprisingly, all techniques resulted to be more prone to overestimate than to underestimate the performances of classifiers, and 3) the defect rate resulted in changing between the second and first half in both industry projects and 83% of open-source datasets. This study recommends the use of techniques that preserve the order of data such as walk-forward over 10-fold cross-validation and bootstrap in the within-project across-release class-level context given the above empirical results and that walk-forward is by nature more simple, inexpensive, and stable than the other two techniques.

Funder

Università degli Studi di Roma Tor Vergata

Publisher

Springer Science and Business Media LLC

Subject

Software

Link

https://link.springer.com/content/pdf/10.1007/s10664-020-09868-x.pdf

Reference65 articles.

1. Agrawal A, Menzies T (2018) Is ‘better data’ better than ‘better data miners’? 40th Int Conference Software Eng - ICSE 18:1050–1061

2. Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46(3):175–185

3. Austin PC, Steyerberg EW (2017) Events per variable (EPV) and the relative performance of different strategies for estimating the out-of-sample validity of logistic regression models. Stat Methods Med Res 26(2):796–808

4. Bayley S and Falessi D (2018) “Optimizing Prediction Intervals by Tuning Random Forest via Meta-Validation”. arXiv Prepr. arXiv1801.07194

5. Bergmeir C, Benítez JM (2012) On the use of cross-validation for time series predictor evaluation. Inf. Sci. (Ny). 191:192–213

Cited by 30 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A meta-learning method for smart manufacturing: Tool wear prediction using hybrid information under various operating conditions;Robotics and Computer-Integrated Manufacturing;2025-02

2. The untold impact of learning approaches on software fault-proneness predictions: an analysis of temporal aspects;Empirical Software Engineering;2024-06-08

3. VALIDATE: A deep dive into vulnerability prediction datasets;Information and Software Technology;2024-06

4. Just-in-Time crash prediction for mobile apps;Empirical Software Engineering;2024-05

5. Towards More Practical Automation of Vulnerability Assessment;Proceedings of the IEEE/ACM 46th International Conference on Software Engineering;2024-04-12