Affiliation:
1. Department of Computer Science and Engineering, Fukuoka Institute of Technology, 3‐30‐1 Wajiro‐Higashi, Higashi‐ku Fukuoka 811‐0295 Japan
2. Department of Informatics Kindai University, 3‐4‐1 Kowakae, Higashiosaka city Osaka 577‐8502 Japan
Abstract
In software development, improving the efficiency of a testing process is important to ensure reliability during a limited development period. One approach to improving the testing process is identifying modules that are likely to contain faults (fault‐prone modules) and allocate effort toward resolving them. For this purpose, many fault‐prone module detection models have been proposed in previous studies. However, an appropriate fault‐prone module detection model cannot be constructed if outliers, such as modules with a significantly large number of source code lines and branches, but no faults, are included. In this study, we propose a new outlier elimination technique that creates missing values (deleted values) to complete the data artificially and applies a missing value imputation technique using a regression approach. If the imputed value differs significantly from the actual (recorded) value, the proposed technique treats the values as outliers. We name the proposed technique Outlier Elimination Technique using Deletion‐Imputation Iteration (OEdii), and its performance is verified experimentally. In general, our experimental results are more accurate than the previous outlier elimination techniques in the area under the ROC curve. © 2023 Institute of Electrical Engineer of Japan and Wiley Periodicals LLC.
Funder
Japan Society for the Promotion of Science
Subject
Electrical and Electronic Engineering