Detection of Malicious Software by Analyzing Distinct Artifacts Using Machine Learning and Deep Learning Algorithms-Reference-Cited by-同舟云学术

Detection of Malicious Software by Analyzing Distinct Artifacts Using Machine Learning and Deep Learning Algorithms

Published:2021-07-15 Issue:14 Volume:10 Page:1694
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Ashik Mathew,Jyothish A.,Anandaram S.,Vinod P.,Mercaldo Francesco,Martinelli Fabio,Santone Antonella

Abstract

Malware is one of the most significant threats in today’s computing world since the number of websites distributing malware is increasing at a rapid rate. Malware analysis and prevention methods are increasingly becoming necessary for computer systems connected to the Internet. This software exploits the system’s vulnerabilities to steal valuable information without the user’s knowledge, and stealthily send it to remote servers controlled by attackers. Traditionally, anti-malware products use signatures for detecting known malware. However, the signature-based method does not scale in detecting obfuscated and packed malware. Considering that the cause of a problem is often best understood by studying the structural aspects of a program like the mnemonics, instruction opcode, API Call, etc. In this paper, we investigate the relevance of the features of unpacked malicious and benign executables like mnemonics, instruction opcodes, and API to identify a feature that classifies the executable. Prominent features are extracted using Minimum Redundancy and Maximum Relevance (mRMR) and Analysis of Variance (ANOVA). Experiments were conducted on four datasets using machine learning and deep learning approaches such as Support Vector Machine (SVM), Naïve Bayes, J48, Random Forest (RF), and XGBoost. In addition, we also evaluate the performance of the collection of deep neural networks like Deep Dense network, One-Dimensional Convolutional Neural Network (1D-CNN), and CNN-LSTM in classifying unknown samples, and we observed promising results using APIs and system calls. On combining APIs/system calls with static features, a marginal performance improvement was attained comparing models trained only on dynamic features. Moreover, to improve accuracy, we implemented our solution using distinct deep learning methods and demonstrated a fine-tuned deep neural network that resulted in an F1-score of 99.1% and 98.48% on Dataset-2 and Dataset-3, respectively.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/14/1694/pdf

Reference57 articles.

1. PRISEC: Comparison of Symmetric Key Algorithms for IoT Devices

2. A Survey On Automated Dynamic Malware Analysis Evasion and Counter-Evasion

3. Mandiant https://www.fireeye.com/mandiant.html

4. Ether

5. Analysis and classification of context-based malware behavior

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. High Accuracy Detection of Mobile Malware Using Machine Learning;Electronics;2023-03-15

2. Machine Learning Based Model to Find Out Firewall Decisions Towards Improving Cyber Defence;Lecture Notes in Networks and Systems;2023

3. A hierarchical layer of atomic behavior for malicious behaviors prediction;Journal of Computer Virology and Hacking Techniques;2022-04-07

4. Evaluation of Feature Selection Methods on Psychosocial Education Data Using Additive Ratio Assessment;Electronics;2021-12-30

5. Identifying Symmetric-Key Algorithms Using CNN in Intel Processor Trace;Electronics;2021-10-13