An Improved Dandelion Optimizer Algorithm for Spam Detection: Next-Generation Email Filtering System

Author:

Tubishat Mohammad1,Al-Obeidat Feras1,Sadiq Ali Safaa2ORCID,Mirjalili Seyedali34ORCID

Affiliation:

1. College of Technological Innovation, Zayed University, Abu Dhabi P.O. Box 144534, United Arab Emirates

2. Department of Computer Science, Nottingham Trent University, Clifton Lane, Nottingham NG11 8NS, UK

3. Centre of Artificial Intelligence Research and Optimisation, Torrens University Australia, Brisbane, QLD 4006, Australia

4. University Research and Innovation Center, Obuda University, 1034 Budapest, Hungary

Abstract

Spam emails have become a pervasive issue in recent years, as internet users receive increasing amounts of unwanted or fake emails. To combat this issue, automatic spam detection methods have been proposed, which aim to classify emails into spam and non-spam categories. Machine learning techniques have been utilized for this task with considerable success. In this paper, we introduce a novel approach to spam email detection by presenting significant advancements to the Dandelion Optimizer (DO) algorithm. The DO is a relatively new nature-inspired optimization algorithm inspired by the flight of dandelion seeds. While the DO shows promise, it faces challenges, especially in high-dimensional problems such as feature selection for spam detection. Our primary contributions focus on enhancing the DO algorithm. Firstly, we introduce a new local search algorithm based on flipping (LSAF), designed to improve the DO’s ability to find the best solutions. Secondly, we propose a reduction equation that streamlines the population size during algorithm execution, reducing computational complexity. To showcase the effectiveness of our modified DO algorithm, which we refer to as the Improved DO (IDO), we conduct a comprehensive evaluation using the Spam base dataset from the UCI repository. However, we emphasize that our primary objective is to advance the DO algorithm, with spam email detection serving as a case study application. Comparative analysis against several popular algorithms, including Particle Swarm Optimization (PSO), the Genetic Algorithm (GA), Generalized Normal Distribution Optimization (GNDO), the Chimp Optimization Algorithm (ChOA), the Grasshopper Optimization Algorithm (GOA), Ant Lion Optimizer (ALO), and the Dragonfly Algorithm (DA), demonstrates the superior performance of our proposed IDO algorithm. It excels in accuracy, fitness, and the number of selected features, among other metrics. Our results clearly indicate that the IDO overcomes the local optima problem commonly associated with the standard DO algorithm, owing to the incorporation of LSAF and the reduction in equation methods. In summary, our paper underscores the significant advancement made in the form of the IDO algorithm, which represents a promising approach for solving high-dimensional optimization problems, with a keen focus on practical applications in real-world systems. While we employ spam email detection as a case study, our primary contribution lies in the improved DO algorithm, which is efficient, accurate, and outperforms several state-of-the-art algorithms in various metrics. This work opens avenues for enhancing optimization techniques and their applications in machine learning.

Funder

Zayed University

Publisher

MDPI AG

Subject

Computer Networks and Communications,Human-Computer Interaction

Reference42 articles.

1. Prevention and mitigation measures against phishing emails: A sequential schema model;Suzuki;Secur. J.,2022

2. APWG (2023, April 01). Phishing Activity Trends Report: 3rd Quarter 2020. Available online: https://docs.apwg.org/reports/apwg_trends_report_q3_2020.pdf.

3. A comprehensive dual-layer architecture for phishing and spam email detection;Doshi;Comput. Secur.,2023

4. Cyber-Situational Crime Prevention and the Breadth of Cybercrimes among Higher Education Institutions;Back;Int. J. Cybersecur. Intell. Cybercrime,2020

5. A semantic-based classification approach for an enhanced spam detection;Saidani;Comput. Secur.,2020

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3