Author:
Abdulrahman Saifuldeen H,Salim Mohammad
Abstract
Emails have become the most economical and fastest communication forms. However, during the past few years, the increment of email users has dramatically increased spam emails. Various anti-spam techniques have been developed to minimize if not eliminate the spam problem. In this paper, we study the disparity in the effectiveness of using different decision tree algorithms in email classification and combat spam problems. For that, we have chosen Universiti Utara Malaysia emails as a case study. To achieve the best possible classification accuracy, we compared all chosen algorithms’ performance, which are Random Forest, LMT, Decision Stump, J48, Random Tree, and REP Tree. The experimental results showed that the Decision Stump algorithm is more effective to be used in classifying the emails, and the F-measures, Precision, and recall score for the Decision Stump algorithm are higher than the other comparison algorithms.
Reference22 articles.
1. Verma T., Gill N. S. J. I. J. o. I. T., and Engineering E., “Email Spams via Text Mining using Machine Learning Techniques, ” 19, no. 4, pp. 2535-2539, (2020).
2. Saidani N., Adi K., Allili M. S. J. C., and Security, “A semantic-based classification approach for an enhanced spam detection, ” 94, p. 101716, (2020).
3. Taylor H., “Making Mass-Spamming Illegal Rises, ” Harris Interactive (2011).
4. Fighting Spam on Social Web Sites: A Survey of Approaches and Future Challenges
5. Clearbridge. What is the global cost of spam? Available: http://www.mailshine.com/2011/06/whats-the-globalcost-of-spam/(2011).
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Naive Bayesian Spam Filtering;Highlights in Science, Engineering and Technology;2023-03-16