Author:
Ali Aftab,Khan Naveed,Abu-Tair Mamun,Noppen Joost,McClean Sally,McChesney Ian
Abstract
AbstractCorrelated quality metrics extracted from a source code repository can be utilized to design a model to automatically predict defects in a software system. It is obvious that the extracted metrics will result in a highly unbalanced data, since the number of defects in a good quality software system should be far less than the number of normal instances. It is also a fact that the selection of the best discriminating features significantly improves the robustness and accuracy of a prediction model. Therefore, the contribution of this paper is twofold, first it selects the best discriminating features that help in accurately predicting a defect in a software component. Secondly, a cost-sensitive logistic regression and decision tree ensemble-based prediction models are applied to the best discriminating features for precisely predicting a defect in a software component. The proposed models are compared with the most recent schemes in the literature in terms of accuracy, area under the curve, and recall. The models are evaluated using 11 datasets and it is evident from the results and analysis that the performance of the proposed prediction models outperforms the schemes in the literature.
Publisher
Springer Science and Business Media LLC
Reference34 articles.
1. Ali, A., Abu-Tair, M., Noppen, Joost., McClean, Sally., Lin, Zhiwei., McChesney, Ian.: Contributing features-based schemes for software defect prediction. In International Conference on Innovative Techniques and Applications of Artificial Intelligence, pages 350–361. Springer (2019)
2. Ammann, Paul, Offutt, Jeff: Introduction to software testing. Cambridge University Press, Cambridge (2016)
3. Aparna, UR., Paul, S.: Feature selection and extraction in data mining. In 2016 Online International Conference on Green Engineering and Technologies (IC-GET), pages 1–3. IEEE (2016)
4. Arar, Ömer F., Ayan, K.: Software defect prediction using cost-sensitive neural network. Appl. Soft Comput. 33, 263–277 (2015)
5. Arasteh, B.: Software fault-prediction using combination of neural network and naive bayes algorithm. J. Netw. Technol. 9(3), 95 (2018)
Cited by
17 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献