Leveraging Adversarial Samples for Enhanced Classification of Malicious and Evasive PDF Files

Author:

Trad Fouad1ORCID,Hussein Ali1,Chehab Ali1ORCID

Affiliation:

1. Electrical and Computer Engineering, American University of Beirut, Beirut 1107-2020, Lebanon

Abstract

The Portable Document Format (PDF) is considered one of the most popular formats due to its flexibility and portability across platforms. Although people have used machine learning techniques to detect malware in PDF files, the problem with these models is their weak resistance against evasion attacks, which constitutes a major security threat. The goal of this study is to introduce three machine learning-based systems that enhance malware detection in the presence of evasion attacks by substantially relying on evasive data to train malware and evasion detection models. To evaluate the robustness of the proposed systems, we used two testing datasets, a real dataset containing around 100,000 PDF samples and an evasive dataset containing 500,000 samples that we generated. We compared the results of the proposed systems to a baseline model that was not adversarially trained. When tested against the evasive dataset, the proposed systems provided an increase of around 80% in the f1-score compared to the baseline. This proves the value of the proposed approaches towards the ability to deal with evasive attacks.

Funder

Maroun Semaan Faculty of Engineering and Architecture (MSFEA) at the American University of Beirut

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference49 articles.

1. The recent trends in cyber security: A review;Kaur;J. King Saud Univ. Comput. Inf. Sci.,2022

2. A comprehensive review study of cyber-attacks and cyber security; Emerging trends and recent developments;Li;Energy Rep.,2021

3. A Comprehensive Review on Malware Detection Approaches;Aslan;IEEE Access,2020

4. Blonce, A., Filiol, E., and Frayssignes, L. (2008, January 24–28). Portable Document Format (PDF) Security Analysis and Malware Threats. Proceedings of the Europe BlackHat 2008 Conference, Amsterdam, The Netherlands.

5. Fleury, N., Dubrunquez, T., and Alouani, I. (2021). PDF-Malware: An Overview on Threats, Detection and Evasion Attacks. arXiv.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3