Enhancing Phishing Email Detection through Ensemble Learning and Undersampling

Author:

Qi Qinglin1ORCID,Wang Zhan1,Xu Yijia1ORCID,Fang Yong1,Wang Changhui2

Affiliation:

1. College of Cybersecurity, Sichuan University, Chengdu 610065, China

2. Department of Fundamental Courses, Chengdu Textile College, Chengdu 611731, China

Abstract

In real-world scenarios, the number of phishing and benign emails is usually imbalanced, leading to traditional machine learning or deep learning algorithms being biased towards benign emails and misclassifying phishing emails. Few studies take measures to address the imbalance between them, which significantly threatens people’s financial and information security. To mitigate the impact of imbalance on the model and enhance the detection performance of phishing emails, this paper proposes two new algorithms with undersampling: the Fisher–Markov-based phishing ensemble detection (FMPED) method and the Fisher–Markov–Markov-based phishing ensemble detection (FMMPED) method. The algorithms first remove benign emails in overlapping areas, then undersample the remaining benign emails, and finally, combine the retained benign emails with phishing emails into a new training set, using ensemble learning algorithms for training and classification. Experimental results have demonstrated that the proposed algorithms outperform other machine learning and deep learning algorithms, achieving an F1-score of 0.9945, an accuracy of 0.9945, an AUC of 0.9828, and a G-mean of 0.9827.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference43 articles.

1. (2020, September 04). INTERPOL Report Shows Alarming Rate of Cyberattacks during COVID-19. Available online: https://www.interpol.int/News-and-Events/News/2020/INTERPOL-report-shows-alarming-rate-of-cyberattacks-during-COVID-19.

2. (2022, August 08). The University of Science and Technology of China Sent 40,000 “Free Mooncake Giveaway” Phishing Emails. Available online: https://www.thepaper.cn/newsDetail_forward_19819224.

3. (2023, March 27). 2022 China Corporate Email Security Study. Available online: https://www.qianxin.com/threat/reportdetail?report_id=294.

4. (2023, January 31). Global Email Threat Report for 2022. Available online: http://mailsec.cn/news/html/?539.html.

5. (2023, March 29). 2023 Email Security Report. Available online: https://cofense.com/blog/phishing-emails-increased-in-2022-according-to-annual-report-from-cofense/.

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. DeepEPhishNet: a deep learning framework for email phishing detection using word embedding algorithms;Sādhanā;2024-07-11

2. An Investigation of AI-Based Ensemble Methods for the Detection of Phishing Attacks;Engineering, Technology & Applied Science Research;2024-06-01

3. OEC Net: Optimal feature selection-based email classification network using unsupervised learning with deep CNN model;e-Prime - Advances in Electrical Engineering, Electronics and Energy;2024-03

4. Investigation of Phishing Susceptibility with Explainable Artificial Intelligence;Future Internet;2024-01-17

5. Comparative Analysis of Neural Network Models for Spam E-mail Detection;2024 Fourth International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT);2024-01-11

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3