A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning-Reference-Cited by-同舟云学术

A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning

Published:2023-09-25 Issue:19 Volume:23 Page:8070
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Shaukat Muhammad Waqas¹,Amin Rashid²^ORCID,Muslam Muhana Magboul Ali³^ORCID,Alshehri Asma Hassan⁴,Xie Jiang⁵^ORCID

Affiliation:

1. Department of Computer Science, University of Engineering and Technology, Taxila 47050, Pakistan

2. Department of Computer Science, University of Chakwal, Chakwal 48800, Pakistan

3. Department of Information Technology, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University, Riyadh 11432, Saudi Arabia

4. Durma College of Science and Humanities, Shaqra University, Shaqra 11961, Saudi Arabia

5. Department of Electrical and Computer Engineering, The University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, NC 28223, USA

Abstract

Phishing attacks are evolving with more sophisticated techniques, posing significant threats. Considering the potential of machine-learning-based approaches, our research presents a similar modern approach for web phishing detection by applying powerful machine learning algorithms. An efficient layered classification model is proposed to detect websites based on their URL structure, text, and image features. Previously, similar studies have used machine learning techniques for URL features with a limited dataset. In our research, we have used a large dataset of 20,000 website URLs, and 22 salient features from each URL are extracted to prepare a comprehensive dataset. Along with this, another dataset containing website text is also prepared for NLP-based text evaluation. It is seen that many phishing websites contain text as images, and to handle this, the text from images is extracted to classify it as spam or legitimate. The experimental evaluation demonstrated efficient and accurate phishing detection. Our layered classification model uses support vector machine (SVM), XGBoost, random forest, multilayer perceptron, linear regression, decision tree, naïve Bayes, and SVC algorithms. The performance evaluation revealed that the XGBoost algorithm outperformed other applied models with maximum accuracy and precision of 94% in the training phase and 91% in the testing phase. Multilayer perceptron also worked well with an accuracy of 91% in the testing phase. The accuracy results for random forest and decision tree were 91% and 90%, respectively. Logistic regression and SVM algorithms were used in the text-based classification, and the accuracy was found to be 87% and 88%, respectively. With these precision values, the models classified phishing and legitimate websites very well, based on URL, text, and image features. This research contributes to early detection of sophisticated phishing attacks, enhancing internet user security.

Funder

Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/19/8070/pdf

Reference36 articles.

1. Cloud computing platform: Performance analysis of prominent cryptographic algorithms;Ajmal;Concurr. Comput. Pract. Exp.,2022

2. Tandale, K.D., and Pawar, S.N. (2020, January 30–31). Different types of phishing attacks and detection techniques: A review. Proceedings of the 2020 International Conference on Smart Innovations in Design, Environment, Management, Planning and Computing (ICSIDEMPC), Aurangabad, India.

3. Phishing activity trends report: 4th quarter 2016;APWG;Anti-Phishing Work. Group. Retrieved Dec.,2017

4. Smart home security: Challenges, issues and solutions at different IoT layers;Touqeer;J. Supercomput.,2021

5. Alabdan, R. (2020). Phishing attacks survey: Types, vectors, and technical approaches. Future Internet, 12.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Time series forecasting and anomaly detection using deep learning;Computers & Chemical Engineering;2024-03