Software fault prediction using machine learning techniques with metric thresholds-Reference-Cited by-同舟云学术

Software fault prediction using machine learning techniques with metric thresholds

Published:2021-07-26 Issue:2 Volume:25 Page:159-172
ISSN:1327-2314
Container-title:International Journal of Knowledge-based and Intelligent Engineering Systems
language:
Short-container-title:KES

Author:

Shatnawi Raed

Abstract

BACKGROUND: Fault data is vital to predicting the fault-proneness in large systems. Predicting faulty classes helps in allocating the appropriate testing resources for future releases. However, current fault data face challenges such as unlabeled instances and data imbalance. These challenges degrade the performance of the prediction models. Data imbalance happens because the majority of classes are labeled as not faulty whereas the minority of classes are labeled as faulty. AIM: The research proposes to improve fault prediction using software metrics in combination with threshold values. Statistical techniques are proposed to improve the quality of the datasets and therefore the quality of the fault prediction. METHOD: Threshold values of object-oriented metrics are used to label classes as faulty to improve the fault prediction models The resulting datasets are used to build prediction models using five machine learning techniques. The use of threshold values is validated on ten large object-oriented systems. RESULTS: The models are built for the datasets with and without the use of thresholds. The combination of thresholds with machine learning has improved the fault prediction models significantly for the five classifiers. CONCLUSION: Threshold values can be used to label software classes as fault-prone and can be used to improve machine learners in predicting the fault-prone classes.

Publisher

IOS Press

Subject

Artificial Intelligence,Control and Systems Engineering,Software

Reference54 articles.

1. A. Folleco, T.M. Khoshgoftaar, J. Van Hulse and L. Bullard, Software quality modeling: The impact of class noise on the random forest classifier, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence), Hong Kong, 2008, pp. 3853–3859.

2. B. Ghotra, S. McIntosh and A.E. Hassan, Revisiting the Impact of Classification Techniques on the Performance of Defect Prediction Models, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, Florence, 2015, pp. 789–800.

3. Software debugging, testing, and verification;Hailpern;IBM Systems Journal,2002

4. Techniques for evaluating fault prediction models;Jiang;Empir Softw Eng,2008

5. Comparison of the predicted and observed secondary structure of T4 phage lysozyme;Matthews;Biochimica et Biophysica Acta (BBA) – Protein Structure,1975

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fault Risk Prediction and Condition Maintenance of Distribution Network Based on Convolutional Neural Network;Proceedings of the 5th International Conference on Information Technologies and Electrical Engineering;2022-11-04