Towards Optimization of Malware Detection using Extra-Tree and Random Forest Feature Selections on Ensemble Classifiers

Author:

Gbenga Fadare Oluwaseun1,Olusola Adetunmbi Adebayo2,Elohor Oyinloye Oghenerukevwe1

Affiliation:

1. Department of Computer Science, Ekiti State University, Nigeria.

2. Department of Computer Science, Federal University of Technology, Akure.

Abstract

The proliferation of Malware on computer communication systems posed great security challenges to confidential data stored and other valuable substances across the globe. There have been several attempts in curbing the menace using a signature-based approach and in recent times, machine learning techniques have been extensively explored. This paper proposes a framework combining the exploit of both feature selections based on extra tree and random forest and eight ensemble techniques on five base learners- KNN, Naive Bayes, SVM, Decision Trees, and Logistic Regression. K-Nearest Neighbors returns the highest accuracy of 96.48%, 96.40%, and 87.89% on extra-tree, random forest, and without feature selection (WFS) respectively. Random forest ensemble accuracy on both Feature Selections are the highest with 98.50% and 98.16% on random forest and extra-tree respectively. The Extreme Gradient Boosting Classifier is next on random-forest FS with an accuracy of 98.37% while Voting returns the least detection accuracy of 95.80%. On extra-tree FS, Bagging is next with a detection accuracy of 98.09% while Voting returns the least accuracy of 95.54%. Random Forest has the highest all in seven evaluative measures in both extra tree and random forest feature selection techniques. The study results uncover the tree-based ensemble model is proficient and successful for malware classification.

Publisher

Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESP

Subject

Management of Technology and Innovation,General Engineering

Reference28 articles.

1. AV-TEST (2019), The Independent IT-Security Institute, https://www.av-test.org/en/statistics/malware/. Accessed 2 November 2019.

2. Kaspersky Security Bulletin (2016), Overall statistics, https://securelist.com/kaspersky-security-bulletin-2016-e xecutive-summary/76858/. Accessed 12 May 2016.

3. McAfee Labs Threats Report (2017),https://www.mcafee.com/us/resources/reports/rp-quarterl y-threats-jun-2017.pdf. Accessed 2 June 2017.

4. Chandrashekar G. and Sahin F., "A survey on feature selection methods", Computers & Electrical Engineering., vol. 40(1), 2014, pp.16-28.

5. HarshaLatha P. and Mohanasundaram R, "A New Hybrid Strategy for Malware Detection Classification with Multiple Feature Selection Methods and Ensemble Learning Methods", International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249-8958., vol. 9(2), 2019, pp. 4013-4019.

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Comparative Study on Malicious URL Using Classifiers and Boosting Algorithms;Signals and Communication Technology;2024

2. Improving Network Intrusion Detection Using Supervised Learning for Feature Selection;2023 IEEE/ACIS 8th International Conference on Big Data, Cloud Computing, and Data Science (BCD);2023-12-14

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3