Machine Learning Algorithms for Raw and Unbalanced Intrusion Detection Data in a Multi-Class Classification Problem-Reference-Cited by-同舟云学术

Machine Learning Algorithms for Raw and Unbalanced Intrusion Detection Data in a Multi-Class Classification Problem

Published:2023-06-20 Issue:12 Volume:13 Page:7328
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Bacevicius Mantas¹,Paulauskaite-Taraseviciene Agne¹^ORCID

Affiliation:

1. Faculty of Informatics, Kaunas University of Technology, Studentu 50, 51368 Kaunas, Lithuania

Abstract

Various machine learning algorithms have been applied to network intrusion classification problems, including both binary and multi-class classifications. Despite the existence of numerous studies involving unbalanced network intrusion datasets, such as CIC-IDS2017, a prevalent approach is to address the issue by either merging the classes to optimize their numbers or retaining only the most dominant ones. However, there is no consistent trend showing that accuracy always decreases as the number of classes increases. Furthermore, it is essential for cybersecurity practitioners to recognize the specific type of attack and comprehend the causal factors that contribute to the resulting outcomes. This study focuses on tackling the challenges associated with evaluating the performance of multi-class classification for network intrusions using highly imbalanced raw data that encompasses the CIC-IDS2017 and CSE-CIC-IDS2018 datasets. The research concentrates on investigating diverse machine learning (ML) models, including Logistic Regression, Random Forest, Decision Trees, CNNs, and Artificial Neural Networks. Additionally, it explores the utilization of explainable AI (XAI) methods to interpret the obtained results. The results obtained indicated that decision trees using the CART algorithm performed best on the 28-class classification task, with an average macro F1-score of 0.96878.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/12/7328/pdf

Reference52 articles.

1. A Comprehensive Systematic Literature Review on Intrusion Detection Systems;Samet;IEEE Access,2021

2. A comprehensive review study of cyber-attacks and cyber security; Emerging trends and recent developments;Li;Energy Rep.,2021

3. Jin, S., Chung, J.-G., and Xu, Y. (2021, January 22–28). Signature-Based Intrusion Detection System (IDS) for In-Vehicle CAN Bus Network. Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS), Daegu, Republic of Korea.

4. Erlacher, F., and Dressler, F. (2018, January 23–27). FIXIDS: A high-speed signature-based flow intrusion detection system. Proceedings of the IEEE/IFIP Network Operations and Management Symposium, Taipei, Taiwan.

5. Preuveneers, D., Rimmer, V., Tsingenopoulos, I., Spooren, J., Joosen, W., and Ilie-Zudor, E. (2018). Chained Anomaly Detection Models for Federated Learning: An Intrusion Detection Case Study. Appl. Sci., 8.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SINNER: A Reward-Sensitive Algorithm for Imbalanced Malware Classification Using Neural Networks with Experience Replay;Information;2024-07-23

2. A Novel IDS with a Dynamic Access Control Algorithm to Detect and Defend Intrusion at IoT Nodes;Sensors;2024-03-29

3. From Bytes to Insights: A Systematic Literature Review on Unraveling IDS Datasets for Enhanced Cybersecurity Understanding;IEEE Access;2024

4. A SRC-RF and WGANs-Based Hybrid Approach for Intrusion Detection;Lecture Notes in Computer Science;2024

5. Classification;Artificial Intelligence for a More Sustainable Oil and Gas Industry and the Energy Transition;2024