Affiliation:
1. Department of Computer Science, Ekiti State University, Nigeria.
2. Department of Computer Science, Federal University of Technology, Akure.
Abstract
The proliferation of Malware on computer communication systems posed great security challenges to confidential data stored and other valuable substances across the globe. There have been several attempts in curbing the menace using a signature-based approach and in recent times, machine learning techniques have been extensively explored. This paper proposes a framework combining the exploit of both feature selections based on extra tree and random forest and eight ensemble techniques on five base learners- KNN, Naive Bayes, SVM, Decision Trees, and Logistic Regression. K-Nearest Neighbors returns the highest accuracy of 96.48%, 96.40%, and 87.89% on extra-tree, random forest, and without feature selection (WFS) respectively. Random forest ensemble accuracy on both Feature Selections are the highest with 98.50% and 98.16% on random forest and extra-tree respectively. The Extreme Gradient Boosting Classifier is next on random-forest FS with an accuracy of 98.37% while Voting returns the least detection accuracy of 95.80%. On extra-tree FS, Bagging is next with a detection accuracy of 98.09% while Voting returns the least accuracy of 95.54%. Random Forest has the highest all in seven evaluative measures in both extra tree and random forest feature selection techniques. The study results uncover the tree-based ensemble model is proficient and successful for malware classification.
Publisher
Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESP
Subject
Management of Technology and Innovation,General Engineering
Reference28 articles.
1. AV-TEST (2019), The Independent IT-Security Institute, https://www.av-test.org/en/statistics/malware/. Accessed 2 November 2019.
2. Kaspersky Security Bulletin (2016), Overall statistics, https://securelist.com/kaspersky-security-bulletin-2016-e xecutive-summary/76858/. Accessed 12 May 2016.
3. McAfee Labs Threats Report (2017),https://www.mcafee.com/us/resources/reports/rp-quarterl y-threats-jun-2017.pdf. Accessed 2 June 2017.
4. Chandrashekar G. and Sahin F., "A survey on feature selection methods", Computers & Electrical Engineering., vol. 40(1), 2014, pp.16-28.
5. HarshaLatha P. and Mohanasundaram R, "A New Hybrid Strategy for Malware Detection Classification with Multiple Feature Selection Methods and Ensemble Learning Methods", International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249-8958., vol. 9(2), 2019, pp. 4013-4019.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献