Improving Credibility of Machine Learner Models in Software Engineering-Reference-Cited by-同舟云学术

Improving Credibility of Machine Learner Models in Software Engineering

Published:2007 Issue: Volume: Page:52-72
ISSN:
Container-title:Advances in Machine Learning Applications in Software Engineering
language:
Short-container-title:

Author:

Boetticher Gary D.¹

Affiliation:

1. University of Houston – Clear Lake, USA

Abstract

Given a choice, software project managers frequently prefer traditional methods of making decisions rather than relying on empirical software engineering (empirical/machine learning- based models). One reason for this choice is the perceived lack of credibility associated with these models. To promote better empirical software engineering, a series of experiments are conducted on various NASA datasets to demonstrate the importance of assessing the ease/difficulty of a modeling situation. Each dataset is divided into three groups, a training set, and “nice/nasty” neighbor test sets. Using a nearest neighbor approach, “nice neighbors” align closest to same class training instances. “Nasty neighbors” align to the opposite class training instances. The “nice”, “nasty” experiments average 94% and 20%accuracy, respectively. Another set of experiments show how a ten-fold cross-validation is not sufficient in characterizing a dataset. Finally, a set of metric equations is proposed for improving the credibility assessment of empirical/machine learning models.

Publisher

IGI Global

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. On the Relevance of Graph2Vec Source Code Embeddings for Software Defect Prediction;Communications in Computer and Information Science;2024

2. AutoAt: A deep autoencoder-based classification model for supervised authorship attribution;Procedia Computer Science;2021

3. Software fault prediction using Whale algorithm with genetics algorithm;Software: Practice and Experience;2020-12-15

4. Software fault prediction using particle swarm algorithm with genetic algorithm and support vector machine classifier;Software: Practice and Experience;2020-01-27

5. Software Defect Prediction Using a Hybrid Model Based on Semantic Features Learned from the Source Code;Knowledge Science, Engineering and Management;2019