Affiliation:
1. Pontifícia Universidade Católica do Parana (PUCPR)
2. Technology Innovation Institute (TII)
3. Federal University of Amazonas
Abstract
Abstract
Many issues are faced in the email environment due to Spam, such as bottlenecks in the email gateways despite substantial investments in servers' infrastructure, wasted computational resources, and ineffective detection despite the demand for frequent spam model updates. This paper proposes a reliable detection model to deal with the non-stationary behavior of spam messages over time. A high detection rate is provided in a shallow classifier wherein only reliable spam message classification is accepted. Unreliable classifications are rejected and forwarded to a deep learning classifier, providing reliability and a high detection throughput. Experiments performed on a new dataset with 1,898,843 real and valid spam messages stored for over ten years show that they (i) can improve its reliability over time, (ii) detect outdated models without human assistance, and (iii) provide a high classification throughput rate.
Publisher
Research Square Platform LLC
Reference53 articles.
1. Botnet Spam Email Detection Using Deep Recurrent Neural Network;Alauthman M;Int J Emerg Trends Eng Res,2020
2. Towards an Unsupervised Feature Selection Method for Effective Dynamic Features;Almusallam N;IEEE Access
3. Alom Z, Carminati B, Ferrari E (2020) A deep learning model for Twitter spam detection," in Online Social Networks and Media, vol. 18. https://doi.org/10.1016/j.osnem.2020.100079
4. Apache Spam Assassin – The #1 Enterprise Open-Source Spam Filter. (n.d.) https://spamassassin.apache.org/
5. Apruzzese G, Colajanni M, Ferretti L, Guido A, Marchetti M (2018) On the effectiveness of machine and deep learning for cyber security. 2018 10th International Conference on Cyber Conflict (CyCon), Tallinn, Estonia, pp. 371–390. https://doi.org/10.23919/CYCON.2018.8405026