Abstract
The increasing volume of unsolicited bulk emails has become a major threat to global security. While a significant amount of research has been carried out in terms of proposing new and better algorithms for email spam detection, relatively less attention has been given to evaluation metrics. Some widely used metrics include accuracy, recall, precision, and F-score. This paper proposes a new evaluation metric based on the concepts of fuzzy logic. The proposed metric, termed μO, combines accuracy, recall, and precision into a multi-criteria fuzzy function. Several possible evaluation rules are proposed. As proof of concept, a preliminary empirical analysis of the proposed scheme is carried out using two models, namely BERT (Bidirectional Encoder Representations from Transformers) and LSTM (Long short-term memory) from the domain of deep learning, while utilizing three benchmark datasets. Results indicate that for the Enron and PU datasets, LSTM produces better results of μO, with the values in the range of 0.88 to 0.96, whereas BERT generates better values of μO in the range of 0.94 to 0.96 for Lingspam dataset. Furthermore, extrinsic evaluation confirms the effectiveness of the proposed fuzzy logic metric.
Funder
Prince Mohammad bin Fahd University
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference59 articles.
1. A support vector machine based naive Bayes algorithm for spam filtering;Feng;Proceedings of the 2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC),2016
2. Machine learning for email spam filtering: review, approaches and open research problems
3. https://www.statista.com/statistics/456500/daily-number-of-e-mails-worldwide/
4. Measuring, Characterizing, and Avoiding Spam Traffic Costs
5. The effect of spam and privacy concerns on e-mail users’ behavior;Park;J. Inf. Syst. Secur.,2007
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献