A feature selection strategy for improving software maintainability prediction-Reference-Cited by-同舟云学术

A feature selection strategy for improving software maintainability prediction

Published:2022-03-14 Issue:2 Volume:26 Page:311-344
ISSN:1088-467X
Container-title:Intelligent Data Analysis
language:
Short-container-title:IDA

Author:

Gupta Shikha,Chug Anuradha

Abstract

Software maintainability is a significant contributor while choosing particular software. It is helpful in estimation of the efforts required after delivering the software to the customer. However, issues like imbalanced distribution of datasets, and redundant and irrelevant occurrence of various features degrade the performance of maintainability prediction models. Therefore, current study applies ImpS algorithm to handle imbalanced data and extensively investigates several Feature Selection (FS) techniques including Symmetrical Uncertainty (SU), RandomForest filter, and Correlation-based FS using one open-source, three proprietaries and two commercial datasets. Eight different machine learning algorithms are utilized for developing prediction models. The performance of models is evaluated using Accuracy, G-Mean, Balance, & Area under the ROC Curve. Two statistical tests, Friedman Test and Wilcoxon Signed Ranks Test are conducted for assessing different FS techniques. The results substantiate that FS techniques significantly improve the performance of various prediction models with an overall improvement of 18.58%, 129.73%, 80.00%, and 45.76% in the median values of Accuracy, G-Mean, Balance, & AUC, respectively for all the datasets taken together. Friedman test advocates the supremacy of SU FS technique. Wilcoxon Signed Ranks test showcases that SU FS technique is significantly superior to the CFS technique for three out of six datasets.

Publisher

IOS Press

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Theoretical Computer Science

Reference84 articles.

1. Application of artificial neural network for predicting maintainability using object-oriented metrics;Aggarwal;Transactions on Engineering, Computing and Technology,2008

2. Machine learning approaches for predicting software maintainability: a fuzzy-based transparent model;Ahmed;IET software,2013

3. Predicting Software Maintainability in Object-Oriented Systems Using Ensemble Techniques

4. Application of Ensemble Techniques in Predicting Object-Oriented Software Maintainability

5. A systematic and comprehensive investigation of methods to build and evaluate fault prediction models;Arisholm;Journal of Systems and Software,2010

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Ordination-based verification of feature selection in pattern evolution research;Intelligent Data Analysis;2023-10-12

2. Machine Learning Implementation for Refactoring Prediction;2022 IEEE 4th PhD Colloquium on Emerging Domain Innovation and Technology for Society (PhD EDITS);2022-11-04