Software Defect Prediction Using Stacking Generalization of Optimized Tree-Based Ensembles-Reference-Cited by-同舟云学术

Software Defect Prediction Using Stacking Generalization of Optimized Tree-Based Ensembles

Published:2022-04-30 Issue:9 Volume:12 Page:4577
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Alazba Amal^ORCID,Aljamaan Hamoud^ORCID

Abstract

Software defect prediction refers to the automatic identification of defective parts of software through machine learning techniques. Ensemble learning has exhibited excellent prediction outcomes in comparison with individual classifiers. However, most of the previous work utilized ensemble models in the context of software defect prediction with the default hyperparameter values, which are considered suboptimal. In this paper, we investigate the applicability of a stacking ensemble built with fine-tuned tree-based ensembles for defect prediction. We used grid search to optimize the hyperparameters of seven tree-based ensembles: random forest, extra trees, AdaBoost, gradient boosting, histogram-based gradient boosting, XGBoost and CatBoost. Then, a stacking ensemble was built utilizing the fine-tuned tree-based ensembles. The ensembles were evaluated using 21 publicly available defect datasets. Empirical results showed large impacts of hyperparameter optimization on extra trees and random forest ensembles. Moreover, our results demonstrated the superiority of the stacking ensemble over all fine-tuned tree-based ensembles.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/9/4577/pdf

Reference55 articles.

1. Defect prediction from static code features: current results, limitations, new approaches

2. A systematic review of machine learning techniques for software fault prediction

3. Reducing false alarms in software defect prediction by decision threshold optimization

4. Automated parameter optimization of classification techniques for defect prediction models

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analysis of Bio Inspired Based Hybrid Learning Model for Software Defect Prediction;SN Computer Science;2024-08-27

2. Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning;PeerJ Computer Science;2024-02-28

3. Malware detection method based on TPE optimized stacking;Third International Conference on High Performance Computing and Communication Engineering (HPCCE 2023);2024-02-09

4. Software Defect Prediction Using an Intelligent Ensemble-Based Model;IEEE Access;2024

5. Software Defects Detection in Explainable Machine Learning Approach;Lecture Notes in Networks and Systems;2024