How to Effectively Collect and Process Network Data for Intrusion Detection?-Reference-Cited by-同舟云学术

How to Effectively Collect and Process Network Data for Intrusion Detection?

Published:2021-11-18 Issue:11 Volume:23 Page:1532
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Komisarek Mikołaj^ORCID,Pawlicki Marek^ORCID,Kozik Rafał^ORCID,Hołubowicz Witold,Choraś Michał^ORCID

Abstract

The number of security breaches in the cyberspace is on the rise. This threat is met with intensive work in the intrusion detection research community. To keep the defensive mechanisms up to date and relevant, realistic network traffic datasets are needed. The use of flow-based data for machine-learning-based network intrusion detection is a promising direction for intrusion detection systems. However, many contemporary benchmark datasets do not contain features that are usable in the wild. The main contribution of this work is to cover the research gap related to identifying and investigating valuable features in the NetFlow schema that allow for effective, machine-learning-based network intrusion detection in the real world. To achieve this goal, several feature selection techniques have been applied on five flow-based network intrusion detection datasets, establishing an informative flow-based feature set. The authors’ experience with the deployment of this kind of system shows that to close the research-to-market gap, and to perform actual real-world application of machine-learning-based intrusion detection, a set of labeled data from the end-user has to be collected. This research aims at establishing the appropriate, minimal amount of data that is sufficient to effectively train machine learning algorithms in intrusion detection. The results show that a set of 10 features and a small amount of data is enough for the final model to perform very well.

Funder

European Union's HORIZON 2020

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/23/11/1532/pdf

Reference74 articles.

1. The recent trends in cyber security: A review

2. Guidelines for Stego/Malware Detection Tools: Achieving GDPR Compliance

3. The Proposition and Evaluation of the RoEduNet-SIMARGL2021 Network Intrusion Detection Dataset

4. Real-time stream processing tool for detecting suspicious network patterns using machine learning

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Network intrusion detection: An optimized deep learning approach using big data analytics;Expert Systems with Applications;2024-10

2. Extraction of Minimal Set of Traffic Features Using Ensemble of Classifiers and Rank Aggregation for Network Intrusion Detection Systems;Applied Sciences;2024-08-09

3. Exploring Deep Learning Architectures for Enhanced Cyber Threat Detection: A Survey;2024 International Conference on Science, Engineering and Business for Driving Sustainable Development Goals (SEB4SDG);2024-04-02

4. Ensuring network security with a robust intrusion detection system using ensemble-based machine learning;Array;2023-09

5. Intrusion detection system for large-scale IoT NetFlow networks using machine learning with modified Arithmetic Optimization Algorithm;Internet of Things;2023-07