MIM: A Multiple Integration Model for Intrusion Detection on Imbalanced Samples

Author:

Zhang Zhiqiang1,Wang Le1,Zhu Dong1,Zhu Junyi1,Gu Zhaoquan2,Zhang Yanchun1

Affiliation:

1. Guangzhou University

2. Harbin Institute of Technology (Shenzhen)

Abstract

Abstract The quantity of normal samples is commonly significantly greater than that of malicious samples, resulting in an imbalance in network security data. When dealing with imbalanced samples, the classification model requires careful sampling and attribute selection methods to cope with bias towards majority classes. Simple data sampling methods and incomplete feature selection techniques cannot improve the accuracy of intrusion detection models. In addition, a single intrusion detection model cannot accurately classify all attack types in the face of massive imbalanced security data. Nevertheless, the existing model integration methods based on stacking or voting technologies, suffer from high coupling that undermines their stability and reliability. To address these issues, we propose a Multiple Integration Model (MIM) to implement feature selection and attack classification. First, MIM uses random Oversampling, random Undersampling and Washing Methods (OUWM) to reconstruct the data. Then, a modified simulated annealing algorithm is employed to generate candidate features. Finally, an integrated model based on Light Gradient Boosting Machine (LightGBM), eXtreme Gradient Boosting (XGBoost) and gradient Boosting with Categorical features support (CatBoost) is designed to achieve intrusion detection and attack classification. MIM leverages a Rule-based and Priority-based Ensemble Strategy (RPES) to combine the high accuracy of the former and the high effectiveness of the latter two, improving the stability and reliability of the integration model. We evaluate the effectiveness of our approach on two publicly available intrusion detection datasets, as well as a dataset created by researchers from the University of New Brunswick and another dataset collected by the Australian Center for Cyber Security. In our experiments, MIM significantly outperforms several existing intrusion detection models in terms of accuracy, such as quadratic discriminant analysis, k-nearest neighbor, and back propagation. Specifically, MIM achieves a higher accuracy compared to the two famous models, as well as a model combines deep neural network with deep auto-encoder and another model combines incremental extreme learning machine with an adaptive principal component, with improvements of 5.12% and 5.79%, respectively.

Publisher

Research Square Platform LLC

Reference43 articles.

1. STG2P: A two-stage pipeline model for intrusion detection based on improved LightGBM and K-means;Zhang Z;Simul. Model. Pract. Theory,2022

2. GAN augmentation to deal with imbalance in imaging-based intrusion detection;Giuseppina Andresini A;Future Generation Computer Systems,2021

3. Iman Sharafaldin, A.H., Lashkari: and Ali A. Ghorbani.: Toward generating a new intrusion detection dataset and intrusion traffic characterization. In: Proceedings of the 4th International Conference on Information Systems Security and Privacy, pp. 108–116 (2018)

4. Autoencoder-based deep metric learning for network intrusion detection;Giuseppina Andresini A;Inf. Sci.,2021

5. Building an Intrusion Detection System Using a Filter-Based Feature Selection Algorithm;Mohammed A;IEEE Trans. Comput.,2016

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3