A Novel Fuzzy-Logic-Based Multi-Criteria Metric for Performance Evaluation of Spam Email Detection Algorithms-Reference-Cited by-同舟云学术

A Novel Fuzzy-Logic-Based Multi-Criteria Metric for Performance Evaluation of Spam Email Detection Algorithms

Published:2022-07-12 Issue:14 Volume:12 Page:7043
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Khan Salman A.,Iqbal Kashif,Mohammad Nazeeruddin^ORCID,Akbar Rehan,Ali Syed Saad Azhar^ORCID,Siddiqui Ammar Ahmed

Abstract

The increasing volume of unsolicited bulk emails has become a major threat to global security. While a significant amount of research has been carried out in terms of proposing new and better algorithms for email spam detection, relatively less attention has been given to evaluation metrics. Some widely used metrics include accuracy, recall, precision, and F-score. This paper proposes a new evaluation metric based on the concepts of fuzzy logic. The proposed metric, termed μO, combines accuracy, recall, and precision into a multi-criteria fuzzy function. Several possible evaluation rules are proposed. As proof of concept, a preliminary empirical analysis of the proposed scheme is carried out using two models, namely BERT (Bidirectional Encoder Representations from Transformers) and LSTM (Long short-term memory) from the domain of deep learning, while utilizing three benchmark datasets. Results indicate that for the Enron and PU datasets, LSTM produces better results of μO, with the values in the range of 0.88 to 0.96, whereas BERT generates better values of μO in the range of 0.94 to 0.96 for Lingspam dataset. Furthermore, extrinsic evaluation confirms the effectiveness of the proposed fuzzy logic metric.

Funder

Prince Mohammad bin Fahd University

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/14/7043/pdf

Reference59 articles.

1. A support vector machine based naive Bayes algorithm for spam filtering;Feng;Proceedings of the 2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC),2016

2. Machine learning for email spam filtering: review, approaches and open research problems

3. https://www.statista.com/statistics/456500/daily-number-of-e-mails-worldwide/

4. Measuring, Characterizing, and Avoiding Spam Traffic Costs

5. The effect of spam and privacy concerns on e-mail users’ behavior;Park;J. Inf. Syst. Secur.,2007

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Email spam detection by deep learning models using novel feature selection technique and BERT;Egyptian Informatics Journal;2024-06

2. A distributed relay selection using a fuzzy-BCM based decision making strategy for multi-hop data dissemination in VANETs;Wireless Networks;2024-03-15

3. OEC Net: Optimal feature selection-based email classification network using unsupervised learning with deep CNN model;e-Prime - Advances in Electrical Engineering, Electronics and Energy;2024-03

4. Email Spam Detection by Machine Learning Approaches: A Review;Lecture Notes in Networks and Systems;2024

5. A Systematic Review on Deep-Learning-Based Phishing Email Detection;Electronics;2023-11-05