Affiliation:
1. Department of Computer Science, Faculty of Science, Chiang Mai University, Chiang Mai, Thailand
Abstract
A class imbalance problem is a problem in which the number of majority class and minority class varies greatly. In this article, we propose an oversampling method using GA and k-Nearest Neighbors (kNN) to deal with a network intrusion, a class imbalance problem. We use GA as the main algorithm and use a kNN as its fitness function. We compare the proposed method with a very popular oversampling technique which is a SMOTE family. The experimental results show that the proposed method provides better Accuracy, Precision, and F-measure values than a SMOTE family in almost all datasets with almost all classifiers. Moreover, in some datasets with some classifiers, the proposed method also gives a better Recall value than a SMOTE family as well. This is because the proposed method can generate new intruders in a more independent area than a SMOTE family.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference23 articles.
1. An advanced profile hidden Markov modelfor malware detection;Alipour;Intelligence Data Analysis,2020
2. An efficient network behavioranomaly detection using a hybrid DBN-LSTM network;Chen;Computersand Security,2022
3. SMOTE forlearning from imbalanced data: progress and challenges, marking the15-year anniversary;Fernandez;Journal of Artificial IntelligenceResearch,2018
4. An effective fuzzy clustering algorithmwith outlier identification feature;Gosain;Journal of Intelligent andFuzzy Systems,2021
5. Unsw-Nb15 datasetand machine learning based intrusion detection systems;Sonule;International Journal of Engineering and Advanced Technology,2020