An Improved and Optimized Random Forest Based Approach to Predict the Software Faults

Author:

Thomas Nikhil Saji,Kaliraj S.ORCID

Abstract

AbstractEffective software fault prediction is crucial for minimizing errors during software development and preventing subsequent failures. This research introduces an enhanced Random Forest-based approach for predicting software faults, specifically focusing on the NASA JM1 dataset. The dataset comprises 21 software metrics indicating the presence or absence of faults in a module, and it is utilized to evaluate the proposed approach. The study delves into the intricacies of the NASA dataset, detailing the cleaning process and addressing class imbalance through Synthetic Minority Over-sampling Technique (SMOTE). The core of our approach involves the implementation and fine-tuning of the Random Forest classifier, with a specific focus on optimizing hyperparameters to enhance predictive accuracy. In comparative evaluations with standard machine learning models, our proposed approach demonstrated superior performance, achieving an accuracy of 82.96% and an F1 score of 89.53%. Notably, we emphasize the significance of software defects and their potential to cause failures and crashes during software development, leading to substantial organizational losses. The paper provides a comprehensive examination of different aspects of the machine learning model, offering detailed insights, examples, and illustrative figures to enhance the understanding of our proposed approach.

Funder

Manipal Academy of Higher Education, Manipal

Publisher

Springer Science and Business Media LLC

Reference22 articles.

1. Burnstein S. Practical software testing: a process-oriented approach. Springer Science & Business Media; 2006.

2. Wallace DR, Kuhn DR. Failure modes in medical device software: an analysis of 15 years of recall data. Int J Reliab Qual Saf Eng. 2001;8(04):351–71.

3. Jones C. The year 2000 software problem: Quantifying the costs and assessing the consequences. ACM Press/Addison-Wesley Publishing Co.; 1997.

4. Kuehn BM. Pacemaker recall highlights security concerns for implantable devices. 2018; pp. 1597–1598.

5. Kaliraj S, Chandru N, Wahi A. A reliability framework of component based software system using Kal-Chan path selection algorithm. Int Rev Comput Softw. 2013;8(2):605–12.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3