A Hadoop Based Framework Integrating Machine Learning Classifiers for Anomaly Detection in the Internet of Things-Reference-Cited by-同舟云学术

A Hadoop Based Framework Integrating Machine Learning Classifiers for Anomaly Detection in the Internet of Things

Published:2021-08-13 Issue:16 Volume:10 Page:1955
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Thaseen Ikram Sumaiya,Mohanraj Vanitha,Ramachandran Sakthivel,Sanapala Kishore^ORCID,Yeo Sang-Soo^ORCID

Abstract

In recent years, different variants of the botnet are targeting government, private organizations and there is a crucial need to develop a robust framework for securing the IoT (Internet of Things) network. In this paper, a Hadoop based framework is proposed to identify the malicious IoT traffic using a modified Tomek-link under-sampling integrated with automated Hyper-parameter tuning of machine learning classifiers. The novelty of this paper is to utilize a big data platform for benchmark IoT datasets to minimize computational time. The IoT benchmark datasets are loaded in the Hadoop Distributed File System (HDFS) environment. Three machine learning approaches namely naive Bayes (NB), K-nearest neighbor (KNN), and support vector machine (SVM) are used for categorizing IoT traffic. Artificial immune network optimization is deployed during cross-validation to obtain the best classifier parameters. Experimental analysis is performed on the Hadoop platform. The average accuracy of 99% and 90% is obtained for BoT_IoT and ToN_IoT datasets. The accuracy difference in ToN-IoT dataset is due to the huge number of data samples captured at the edge layer and fog layer. However, in BoT-IoT dataset only 5% of the training and test samples from the complete dataset are considered for experimental analysis as released by the dataset developers. The overall accuracy is improved by 19% in comparison with state-of-the-art techniques. The computational times for the huge datasets are reduced by 3–4 hours through Map Reduce in HDFS.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/16/1955/pdf

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Network intrusion detection: An optimized deep learning approach using big data analytics;Expert Systems with Applications;2024-10

2. A survey of data mining methodologies in the environment of IoT and its variants;Journal of Network and Computer Applications;2024-08

3. Toward Efficient Neural Networks Through Predictor-Assisted NSGA-III for Anomaly Traffic Detection of IoT;IEEE Transactions on Cognitive Communications and Networking;2024-06

4. RobEns: Robust Ensemble Adversarial Machine Learning Framework for Securing IoT Traffic;Sensors;2024-04-19

5. Machine Learning and Deep Learning Techniques for Internet of Things Network Anomaly Detection—Current Research Trends;Sensors;2024-03-20