Affiliation:
1. Department of CS&IT, Maulana Azad National Urdu University, Hyderabad, India
2. Amity Global Institute, Singapore 238466, Singapore
Abstract
The revolution of Industry 4.0 will leave an impact on the domain of
everyone's lives directly or indirectly. Several new complex applications will be
developed in the days to come that are complicated to predict in the current scenario.
With the help of machine learning approaches and intelligent IoT devices, people will
be relieved from extra overheads of redundant work currently being performed.
Industry 4.0 has become a significant catalyst for innovation and development in
various industrial sectors like production processes and quality improvement with
greater flexibility. This chapter applied different machine learning algorithms for spam
detection and classifying emails into legitimate and spam. Seven classification models:
Decision Trees, Random Forest, Artificial Neural Network, Gradient Boosting
Machines, AdaBoost, Naive Bayes, and Support Vector Machines are applied. Three
benchmark spam datasets are extracted from standard repositories to conduct the
experiments. The chapter also presents a quantitative performance analysis. The results
from rigorous experiments reveal that ensemble methods, Gradient Boosting and
AdaBoost, outperformed other methods with an overall accuracy of 98.70% and
98.18%, respectively. The ensembled models are effective on a large-sized dataset
embedded with more extensive features. The performance of non-ensemble methods,
ANN and Naïve Bayes, was instrumental on large datasets as a viable alternative, with
an overall accuracy of 98.38% and 97.63% on test data.<br>
Publisher
BENTHAM SCIENCE PUBLISHERS
Reference40 articles.
1. Ikonomarkis M.; Kotsiantis S.; Tampakas V.; Text classification using machine learning techniques. WSEAS Trans Comput 2005,8(4),966-974
2. Dada E-G.; Bassi J.S.; Chiroma H.; Abdulhamid S.M.; Adetunmbi A.O.; Ajibuwa O.E.; Machine learning for email spam filtering: review, approaches and open research problems. Heliyon 2019,5(6),e01802
3. Awad M.; Foqaha M.; Email spam classification using hybrid approach of RBF neural network and particle swarm optimization. International Journal of Network Security 2016,8(4),17-28
4. Sebastiani F.; Machine learning in automated text categorization. ACM Comput Surv 2002,34(1),1-47
5. Clement J.; Spam statistics: spam email traffic share 2019 [Accessed: 23-Jun-2021] https://www.statista.com/statistics/ 420391/spam-email-traffic-share/