Classifying Malicious Documents on the Basis of Plain-Text Features: Problem, Solution, and Experiences-Reference-Cited by-同舟云学术

Classifying Malicious Documents on the Basis of Plain-Text Features: Problem, Solution, and Experiences

Published:2022-04-18 Issue:8 Volume:12 Page:4088
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Hong Jiwon^ORCID,Jeong Dongho^ORCID,Kim Sang-Wook^ORCID

Abstract

Cyberattacks widely occur by using malicious documents. A malicious document is an electronic document containing malicious codes along with some plain-text data that is human-readable. In this paper, we propose a novel framework that takes advantage of such plaintext data to determine whether a given document is malicious. We extracted plaintext features from the corpus of electronic documents and utilized them to train a classification model for detecting malicious documents. Our extensive experimental results with different combinations of three well-known vectorization strategies and three popular classification methods on five types of electronic documents demonstrate that our framework provides high prediction accuracy in detecting malicious documents.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/8/4088/pdf

Reference53 articles.

1. A Survey on Malware Detection Using Data Mining Techniques

2. Dynamic Malware Analysis in the Modern Era—A State of the Art Survey

3. A Survey on Malware Analysis Techniques: Static, Dynamic, Hybrid and Memory Analysis

4. Malware detection employed by visualization and deep neural network

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. GLDOC: detection of implicitly malicious MS-Office documents using graph convolutional networks;Cybersecurity;2024-07-25

2. Methodology for Collecting Data on the Activity of Malware for Windows OS Based on MITRE ATT&CK;Informatics and Automation;2024-05-28

3. UFADF: A Unified Feature Analysis and Detection Framework for Malicious Office Documents;2023 IEEE 22nd International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom);2023-11-01

4. Offensive Security: Cyber Threat Intelligence Enrichment With Counterintelligence and Counterattack;IEEE Access;2022