Dissimilarity Space Based Multi-Source Cross-Project Defect Prediction-Reference-Cited by-同舟云学术

Dissimilarity Space Based Multi-Source Cross-Project Defect Prediction

Published:2019-01-02 Issue:1 Volume:12 Page:13
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Ren Shengbing^ORCID,Zhang Wanying,Munir Hafiz Shahbaz,Xia Lei

Abstract

Software defect prediction is an important means to guarantee software quality. Because there are no sufficient historical data within a project to train the classifier, cross-project defect prediction (CPDP) has been recognized as a fundamental approach. However, traditional defect prediction methods use feature attributes to represent samples, which cannot avoid negative transferring, may result in poor performance model in CPDP. This paper proposes a multi-source cross-project defect prediction method based on dissimilarity space (DM-CPDP). This method not only retains the original information, but also obtains the relationship with other objects. So it can enhances the discriminant ability of the sample attributes to the class label. This method firstly uses the density-based clustering method to construct the prototype set with the cluster center of samples in the target set. Then, the arc-cosine kernel is used to calculate the sample dissimilarities between the prototype set and the source domain or the target set to form the dissimilarity space. In this space, the training set is obtained with the earth mover’s distance (EMD) method. For the unlabeled samples converted from the target set, the k-Nearest Neighbor (KNN) algorithm is used to label those samples. Finally, the model is learned from training data based on TrAdaBoost method and used to predict new potential defects. The experimental results show that this approach has better performance than other traditional CPDP methods.

Funder

the Central South University Graduate Research Innovation Project

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/12/1/13/pdf

Reference27 articles.

1. Software fault prediction: A literature review and current trends

2. A systematic study of cross-project defect prediction with meta-learning;Porto;IEEE Trans. Softw. Eng.,2018

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving Cross-Project Software Defect Prediction Method Through Transformation and Feature Selection Approach;IEEE Access;2023

2. Tsbagging: A Novel Cross-Project Software Defect Prediction Algorithm Based on Semisupervised Clustering;Scientific Programming;2022-09-28

3. Cross-project software defect prediction based on domain adaptation learning and optimization;Expert Systems with Applications;2021-06

4. complexFuzzy: A novel clustering method for selecting training instances of cross-project defect prediction;Computer Science;2021-02-01