FastText-Based Local Feature Visualization Algorithm for Merged Image-Based Malware Classification Framework for Cyber Security and Cyber Defense-Reference-Cited by-同舟云学术

FastText-Based Local Feature Visualization Algorithm for Merged Image-Based Malware Classification Framework for Cyber Security and Cyber Defense

Published:2020-03-24 Issue:3 Volume:8 Page:460
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Jang Sejun,Li Shuyu,Sung Yunsick^ORCID

Abstract

The importance of cybersecurity has recently been increasing. A malware coder writes malware into normal executable files. A computer is more likely to be infected by malware when users have easy access to various executables. Malware is considered as the starting point for cyber-attacks; thus, the timely detection, classification and blocking of malware are important. Malware visualization is a method for detecting or classifying malware. A global image is visualized through binaries extracted from malware. The overall structure and behavior of malware are considered when global images are utilized. However, the visualization of obfuscated malware is tough, owing to the difficulties encountered when extracting local features. This paper proposes a merged image-based malware classification framework that includes local feature visualization, global image-based local feature visualization, and global and local image merging methods. This study introduces a fastText-based local feature visualization method: First, local features such as opcodes and API function names are extracted from the malware; second, important local features in each malware family are selected via the term frequency inverse document frequency algorithm; third, the fastText model embeds the selected local features; finally, the embedded local features are visualized through a normalization process. Malware classification based on the proposed method using the Microsoft Malware Classification Challenge dataset was experimentally verified. The accuracy of the proposed method was approximately 99.65%, which is 2.18% higher than that of another contemporary global image-based approach.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/8/3/460/pdf

Reference24 articles.

1. Affective social big data generation algorithm for autonomous controls by CRNN-based end-to-end controls

2. Automatic Melody Composition Using Enhanced GAN

3. Decision Tree Generation Algorithm for Image-based Video Conferencing;Sung;J. Intern. Technol.,2019

4. A Holistic Approach for Personalization, Relevance Feedback & Recommendation in Enriched Multimedia Content;Stai;Multimed. Tools Appl.,2018

5. Fab

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Learning for Image Classification: A Review;Lecture Notes in Electrical Engineering;2024

2. Enhancing Cyber Threat Hunting: A Visual Approach with the Forensic Visualization Toolkit;2023 IEEE International Conference on Big Data (BigData);2023-12-15

3. Traffic Accident Detection Using Background Subtraction and CNN Encoder–Transformer Decoder in Video Frames;Mathematics;2023-06-27

4. Traffic Accident Detection Method Using Trajectory Tracking and Influence Maps;Mathematics;2023-04-05

5. Efficient Windows malware identification and classification scheme for plant protection information systems;Frontiers in Plant Science;2023-02-15