Infrequent Pattern Detection for Reliable Network Traffic Analysis Using Robust Evolutionary Computation-Reference-Cited by-同舟云学术

Infrequent Pattern Detection for Reliable Network Traffic Analysis Using Robust Evolutionary Computation

Published:2021-04-25 Issue:9 Volume:21 Page:3005
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Rashid A. N. M. Bazlur^ORCID,Ahmed Mohiuddin^ORCID,Pathan Al-Sakib Khan^ORCID

Abstract

While anomaly detection is very important in many domains, such as in cybersecurity, there are many rare anomalies or infrequent patterns in cybersecurity datasets. Detection of infrequent patterns is computationally expensive. Cybersecurity datasets consist of many features, mostly irrelevant, resulting in lower classification performance by machine learning algorithms. Hence, a feature selection (FS) approach, i.e., selecting relevant features only, is an essential preprocessing step in cybersecurity data analysis. Despite many FS approaches proposed in the literature, cooperative co-evolution (CC)-based FS approaches can be more suitable for cybersecurity data preprocessing considering the Big Data scenario. Accordingly, in this paper, we have applied our previously proposed CC-based FS with random feature grouping (CCFSRFG) to a benchmark cybersecurity dataset as the preprocessing step. The dataset with original features and the dataset with a reduced number of features were used for infrequent pattern detection. Experimental analysis was performed and evaluated using 10 unsupervised anomaly detection techniques. Therefore, the proposed infrequent pattern detection is termed Unsupervised Infrequent Pattern Detection (UIPD). Then, we compared the experimental results with and without FS in terms of true positive rate (TPR). Experimental analysis indicates that the highest rate of TPR improvement was by cluster-based local outlier factor (CBLOF) of the backdoor infrequent pattern detection, and it was 385.91% when using FS. Furthermore, the highest overall infrequent pattern detection TPR was improved by 61.47% for all infrequent patterns using clustering-based multivariate Gaussian outlier score (CMGOS) with FS.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/9/3005/pdf

Reference26 articles.

1. Access methods for Big Data: current status and future directions

2. Knowledge management overview of feature selection problem in high-dimensional financial data: cooperative co-evolution and MapReduce perspectives

3. Cooperative Co-Evolution and MapReduce

4. A Novel Penalty-Based Wrapper Objective Function for Feature Selection in Big Data Using Cooperative Co-Evolution

5. An Investigation of Performance Analysis of Anomaly Detection Techniques for Big Data in SCADA Systems

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. RETRACTED: Optimizing building material selection: A machine learning approach for efficient concrete compressive strength forecasting;Journal of Intelligent & Fuzzy Systems;2024-04-18

2. Load Forecasting with Machine Learning and Deep Learning Methods;Applied Sciences;2023-07-06

3. Misinformation Detection in Cyber Smart Cities;Advanced Sciences and Technologies for Security Applications;2023

4. Cyber Safe Data Repositories;Advanced Sciences and Technologies for Security Applications;2023

5. EDSUCh: A robust ensemble data summarization method for effective medical diagnosis;Digital Communications and Networks;2022-07