Towards Optimization of Malware Detection using Extra-Tree and Random Forest Feature Selections on Ensemble Classifiers-Reference-Cited by-同舟云学术

Towards Optimization of Malware Detection using Extra-Tree and Random Forest Feature Selections on Ensemble Classifiers

Published:2021-03-30 Issue:6 Volume:9 Page:223-232
ISSN:2277-3878
Container-title:International Journal of Recent Technology and Engineering
language:en
Short-container-title:IJRTE

Author:

Gbenga Fadare Oluwaseun¹,Olusola Adetunmbi Adebayo²,Elohor Oyinloye Oghenerukevwe¹

Affiliation:

1. Department of Computer Science, Ekiti State University, Nigeria.

2. Department of Computer Science, Federal University of Technology, Akure.

Abstract

The proliferation of Malware on computer communication systems posed great security challenges to confidential data stored and other valuable substances across the globe. There have been several attempts in curbing the menace using a signature-based approach and in recent times, machine learning techniques have been extensively explored. This paper proposes a framework combining the exploit of both feature selections based on extra tree and random forest and eight ensemble techniques on five base learners- KNN, Naive Bayes, SVM, Decision Trees, and Logistic Regression. K-Nearest Neighbors returns the highest accuracy of 96.48%, 96.40%, and 87.89% on extra-tree, random forest, and without feature selection (WFS) respectively. Random forest ensemble accuracy on both Feature Selections are the highest with 98.50% and 98.16% on random forest and extra-tree respectively. The Extreme Gradient Boosting Classifier is next on random-forest FS with an accuracy of 98.37% while Voting returns the least detection accuracy of 95.80%. On extra-tree FS, Bagging is next with a detection accuracy of 98.09% while Voting returns the least accuracy of 95.54%. Random Forest has the highest all in seven evaluative measures in both extra tree and random forest feature selection techniques. The study results uncover the tree-based ensemble model is proficient and successful for malware classification.

Publisher

Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESP

Subject

Management of Technology and Innovation,General Engineering

Reference28 articles.

1. AV-TEST (2019), The Independent IT-Security Institute, https://www.av-test.org/en/statistics/malware/. Accessed 2 November 2019.

2. Kaspersky Security Bulletin (2016), Overall statistics, https://securelist.com/kaspersky-security-bulletin-2016-e xecutive-summary/76858/. Accessed 12 May 2016.

3. McAfee Labs Threats Report (2017),https://www.mcafee.com/us/resources/reports/rp-quarterl y-threats-jun-2017.pdf. Accessed 2 June 2017.

4. Chandrashekar G. and Sahin F., "A survey on feature selection methods", Computers & Electrical Engineering., vol. 40(1), 2014, pp.16-28.

5. HarshaLatha P. and Mohanasundaram R, "A New Hybrid Strategy for Malware Detection Classification with Multiple Feature Selection Methods and Ensemble Learning Methods", International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249-8958., vol. 9(2), 2019, pp. 4013-4019.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comparative Study on Malicious URL Using Classifiers and Boosting Algorithms;Signals and Communication Technology;2024

2. Improving Network Intrusion Detection Using Supervised Learning for Feature Selection;2023 IEEE/ACIS 8th International Conference on Big Data, Cloud Computing, and Data Science (BCD);2023-12-14