Software defect prediction method based on the heterogeneous integration algorithm

Author:

Zheng Zhangqi12,Liu Yongshan1,Zhang Bing12,Ren Jiadong12,Zong Yongsheng1,Wang Qian12,Yang Xiaolei1,Liu Qian1

Affiliation:

1. School of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei, P.R. China

2. The Key Laboratory of Software Engineering of Hebei Province, Qinhuangdao, China

Abstract

A software defect is a common cyberspace security problem, leading to information theft, system crashes, and other network hazards. Software security is a fundamental challenge for cyberspace security defense. However, when researching software defects, the defective code in the software is small compared with the overall code, leading to data imbalance problems in predicting software vulnerabilities. This study proposes a heterogeneous integration algorithm based on imbalance rate threshold drift for the data imbalance problem and for predicting software defects. First, the Decision Tree-based integration algorithm was designed following sample perturbation. Moreover, the Support Vector Machine (SVM)-based integration algorithm was designed based on attribute perturbation. Following the heterogeneous integration algorithm, the primary classifier was trained by sample diversity and model structure diversity. Second, we combined the integration algorithms of two base classifiers to form a heterogeneous integration model. The imbalance rate was designed to achieve threshold transfer and obtain software defect prediction results. Finally, the NASA-MDP and Juliet datasets were used to verify the heterogeneous integration algorithm’s validity, correctness, and generalization based on the Decision Tree and SVM.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference37 articles.

1. Solving the class imbalance problem using ensemble algorithm: application of screening for aortic dissection;Liu;BMC Medical Informatics and Decision Making,2022

2. Lean, Zhou and Rongtian , A DBN-based resampling SVM ensemble learning paradigm for credit classification with imbalanced data, Applied Soft Computing, 2018.

3. Integrated Learning Methods: A Review of Research;Xu;Journal of Yunnan University,2018

4. Zhou Z.H. , Machine learning, Tsingmua University Press, 2016.

5. Wahono R.S. , A Systematic Literature Review of Software Defect Prediction: Research Trends, Datasets, Methods and Frameworks, Journal of Software Engineering 1(1) (2015).

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3