Software Estimation in the Design Stage with Statistical Models and Machine Learning: An Empirical Study

Author:

Sánchez-García Ángel J.1ORCID,González-Hernández María Saarayim1ORCID,Cortés-Verdín Karen1ORCID,Pérez-Arriaga Juan Carlos1ORCID

Affiliation:

1. Facultad de Estadística e Informática, Universidad Veracruzana, Xalapa 91020, Veracruz, Mexico

Abstract

Accurate estimation of software effort and time in the software development process is a key activity to achieve the necessary product quality. However, underestimation or overestimation of effort has become a key challenge for software development. One of the main problems is the estimation with metrics from late stages, because the product must already be finished to make estimates. In this paper, the use of statistical models and machine learning approaches for software estimation are used in early stages such as software design, and a data set is presented with metric values of design artifacts with 37 software projects. As results, models for the estimation of development time and effort are proposed and validated through leave-one-out cross-validation. Further, machine learning techniques were employed in order to compare software projects estimations. Through the statistical tests, it was proven that the errors were not statistically different with the regression models for effort estimation. However, with Random Forest the best statistical results were obtained for estimating development time.

Funder

Consolidation Fund for Academic Groups 2023, of the General Office of Academic Development and Educational Innovation of the Universidad Veracruzana

Publisher

MDPI AG

Reference54 articles.

1. (2015). Quality Management Systems—Fundamentals and Vocabulary (Norm 9000) (Standard No. ISO 9000:2015).

2. A systematic review of software development cost estimation studies;Jorgensen;IEEE Trans. Softw. Eng.,2007

3. Systematic literature review of machine learning based software development effort estimation models;Wen;Inf. Softw. Technol.,2012

4. Towards improvement of analogy-based software development effort estimation: A review;Bardsiri;Int. J. Softw. Eng. Knowl. Eng.,2014

5. Analogy-based software development effort estimation: A systematic mapping and review;Idri;Inf. Softw. Technol.,2015

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3