Software Defect Prediction Using Deep Q-Learning Network-Based Feature Extraction-Reference-Cited by-同舟云学术

Software Defect Prediction Using Deep Q-Learning Network-Based Feature Extraction

Published:2024-05-30 Issue: Volume:2024 Page:1-34
ISSN:1751-8814
Container-title:IET Software
language:en
Short-container-title:IET Software

Author:

Zhang Qinhe¹^ORCID,Zhang Jiachen¹^ORCID,Feng Tie¹¹^ORCID,Xue Jialang¹^ORCID,Zhu Xinxin¹^ORCID,Zhu Ningyang¹^ORCID,Li Zhiheng¹^ORCID

Affiliation:

1. China

Abstract

Machine learning-based software defect prediction (SDP) approaches have been commonly proposed to help to deliver high-quality software. Unfortunately, all the previous research conducted without effective feature reduction suffers from high-dimensional data, leading to unsatisfactory prediction performance measures. Moreover, without proper feature reduction, the interpretability and generalization ability of machine learning models in SDP may be compromised, hindering their practical utility in diverse software development environments. In this paper, an SDP approach using deep Q-learning network (DQN)-based feature extraction is proposed to eliminate irrelevant, redundant, and noisy features and improve the classification performance. In the data preprocessing phase, the undersampling method of BalanceCascade is applied to divide the original datasets. As the first step of feature extraction, the weight ranking of all the metric elements is calculated according to the expected cross-entropy. Then, the relation matrix is constructed by applying random matrix theory. After that, the reward principle is defined for computing the Q value of Q-learning based on weight ranking, relation matrix, and the number of errors, according to which a convolutional neural network model is trained on datasets until the sequences of metric pairs are generated for all datasets acting as the revised feature set. Various experiments have been conducted on 11 NASA and 11 PROMISE repository datasets. Sensitive analysis experiments show that binary classification algorithms based on SDP approaches using the DQN-based feature extraction outperform those without using it. We also conducted experiments to compare our approach with four state-of-the-art approaches on common datasets, which show that our approach is superior to these methods in precision, F-measure, area under receiver operating characteristics curve, and Matthews correlation coefficient values.

Funder

National Key Research and Development Program of China

Publisher

Institution of Engineering and Technology (IET)

Link

http://downloads.spj.sciencemag.org/ietsfw/2024/3946655.pdf

Reference67 articles.

1. Building Defect Prediction Models in Practice

2. Combining data preprocessing methods with imputation techniques for software defect prediction;M. Kakkar;International Journal of Open Source Software and Processes,2018

3. Convolutional neural networks on assembly code for predicting software defects

4. Software Defect Prediction Using Supervised Learning Algorithm and Unsupervised Learning Algorithm

5. Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings