Predict Network Intruder Using Machine Learning Model and Classification

Author:

Raja Chithik1,K. Hemachandran2,Devarajan V.1,Begum K. Jarina3

Affiliation:

1. University of Technology and Applied Sciences Salalah, Salalah, Sultanate of Oman

2. Department of Artificial Intelligence, School of Business, Woxsen University, Hyderabad, India

3. Jazan University, Jazan, Kingdom of Saudi Arabia

Abstract

The massive number of sensors deployed in IoT generates humongous volumes of data for a broad range of applications such as smart home, smart healthcare, smart manufacturing, smart transportation, smart grid, smart agriculture etc. Analyzing such data in order to facilitate enhanced decision making and increase productivity and accuracy is a critical process for businesses and life improving paradigm. Machine Learning would play a vital role in creating smarter techniques to predict the intruder from the dataset. It has shown remarkable results in different fields, including Network security, image recognition, information retrieval, speech recognition, natural language processing, indoor localization, physiological and psychological state detection, etc. In this regard, intrusion detection is becoming a research focus in the field of information security. In our experiment, we used the CICIDS2017 data set to predict the Network Intruder. The Canadian Institute of Cyber Security released the data set CICIDS-2017, which consists of eight separate files and includes five days’ worth of normal cum abnormal network packet data. The goal of this research is to examine relevant and significant elements of large network packets in order to increase network packet attack detection accuracy and reduce execution time. We choose important and meaningful features by applying Information Gain, ranking and grouping features based on little weight values on the CICIDS-2017 dataset; and then use Random Forest (RF), Random Tree (RT), Naive Bayes (NB), Bayes Net (BN), and J48 classifier algorithms. The findings of the experiment reveal that the amount of relevant and significant features produced by Information Gain has a substantial impact on improving detection accuracy and execution time. The Random Forest method, for example, has the best accuracy with 0.14% of negative results when using 22 relevant selected features, whereas the Random Tree classifier algorithm has a higher accuracy with 0.13% of negative results when using 52 relevant selected features but takes a longer execution time.

Publisher

BENTHAM SCIENCE PUBLISHERS

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3