Incorporating statistical and machine learning techniques into the optimization of correction factors for software development effort estimation-Reference-Cited by-同舟云学术

Incorporating statistical and machine learning techniques into the optimization of correction factors for software development effort estimation

Published:2023-08-31 Issue: Volume: Page:
ISSN:2047-7473
Container-title:Journal of Software: Evolution and Process
language:en
Short-container-title:J Software Evolu Process

Author:

Nhung Ho Le Thi Kim¹,Van Hai Vo²^ORCID,Silhavy Petr³,Prokopova Zdenka³,Silhavy Radek³^ORCID

Affiliation:

1. Faculty of Information Technology University of Science–Vietnam National University Ho Chi Minh City Vietnam

2. Faculty of Information Technology Industrial University of Ho Chi Minh City Ho Chi Minh City Vietnam

3. Faculty of Applied Informatics Tomas Bata University in Zlín Zlín Czech Republic

Abstract

AbstractAccurate effort estimation is necessary for efficient management of software development projects, as it relates to human resource management. Ensemble methods, which employ multiple statistical and machine learning techniques, are more robust, reliable, and accurate effort estimation techniques. This study develops a stacking ensemble model based on optimization correction factors by integrating seven statistical and machine learning techniques (K‐nearest neighbor, random forest, support vector regression, multilayer perception, gradient boosting, linear regression, and decision tree). The grid search optimization method is used to obtain valid search ranges and optimal configuration values, allowing more accurate estimation. We conducted experiments to compare the proposed method with related methods, such as use case points‐based single methods, optimization correction factors‐based single methods, and ensemble methods. The estimation accuracies of the methods were evaluated using statistical tests and unbiased performance measures on a total of four datasets, thus demonstrating the effectiveness of the proposed method more clearly. The proposed method successfully maintained its estimation accuracy across the four experimental datasets and gave the best results in terms of the sum of squares errors, mean absolute error, root mean square error, mean balance relative error, mean inverted balance relative error, median of magnitude of relative error, and percentage of prediction (0.25). The p‐value for the t‐test showed that the proposed method is statistically superior to other methods in terms of estimation accuracy. The results show that the proposed method is a comprehensive approach for improving estimation accuracy and minimizing project risks in the early stages of software development.

Funder

Univerzita Tomáše Bati ve Zlíně

Publisher

Wiley

Subject

Software

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/smr.2611

Reference89 articles.

1. A Systematic Review of Software Development Cost Estimation Studies

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Delving into Human Factors through LSTM by Navigating Environmental Complexity Factors within Use Case Points for Digital Enterprises;Journal of Theoretical and Applied Electronic Commerce Research;2024-02-14