A Novel Bio-Inspired Approach for Multilingual Spam Filtering

Author:

Bouarara Hadj Ahmed1ORCID,Hamou Reda Mohamed1ORCID,Amine Abdelmalek1ORCID

Affiliation:

1. GeCode Laboratory, Department of Computer Science, Tahar Moulay University of Saida, Saida, Algeria

Abstract

In today's digital world the email service has revolutionized the sphere of electronic communication. It has become a veritable social phenomenon in our daily life. Unfortunately, this technology has become incontestably the original source of malicious activities especially the plague called undesirable emails (SPAM) that has grown tremendously in the last few years. The battle against spam emails is extremely fierce. This paper deals with an intelligent spam filtering system called artificial heart-lungs system (AHLS) mimicked from the biological phenomenon of general circulation and oxygenation of blood. It is composed of different steps: Selection to stop automatically emails with undesirable identifier. Multilingual pre-processing to treat the problem of multilingual spam emails and vectoring them. Heart filter and lungs filter to classify unwelcome email in the spam folder and welcome email in the ham folder to present them to the recipient. The method uses an automatic updating of learning basis and black list, and a ranking step to order the spam mails according to their spam relevancy. For the authors' experimentation, they have constructed a new dataset M.SPAM composed of emails pre-classified as spam or ham with different language (English, Spanish, French, and melange) and using the validation measures (recall, precision, f-measure, entropy, accuracy and error, false positive rate and false negative rate, ROC and learning curve). The authors have optimized the sensitive parameters (text representation technique, lungs filters, and the size of initial leaning basis). The results are positive compared to the result of other bio-inspired techniques (artificial social bees, artificial social cockroaches), supervised algorithm (decision tree C4.5) and automatic algorithm (K-means). Finally, a visual result mining tool was developed in order to see the results in graphical form (3d cub and cobweb) with more realism using the functionality of zooming and rotation. The authors' aims are to eliminate a large proportion of unwelcome email, treated the multilingual emails, ensuring an automatic updating of their system and poses a minimal risk of eliminating ham email.

Publisher

IGI Global

Subject

Decision Sciences (miscellaneous),Information Systems

Cited by 21 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A Comparison of Machine Learning Algorithms for Multilingual Phishing Detection;2023 20th Annual International Conference on Privacy, Security and Trust (PST);2023-08-21

2. Advanced Bioinspiration Methods for Optimisation Problems;Advanced Bioinspiration Methods for Healthcare Standards, Policies, and Reform;2022-11-18

3. Depressive Person Detection using Social Asian Elephants' (SAE) Algorithm over Twitter Posts;Research Anthology on Usage, Identity, and Impact of Social Media on Society and Culture;2022-06-10

4. Sentiment Analysis Using Machine Learning Algorithms and Text Mining to Detect Symptoms of Mental Difficulties Over Social Media;Research Anthology on Implementing Sentiment Analysis Across Multiple Disciplines;2022-06-10

5. Research Information;Advanced Deep Learning Applications in Big Data Analytics;2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3