Phishing URLs Detection Using Sequential and Parallel ML Techniques: Comparative Analysis-Reference-Cited by-同舟云学术

Phishing URLs Detection Using Sequential and Parallel ML Techniques: Comparative Analysis

Published:2023-03-26 Issue:7 Volume:23 Page:3467
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Nagy Naya¹,Aljabri Malak²^ORCID,Shaahid Afrah³^ORCID,Ahmed Amnah Albin³,Alnasser Fatima³^ORCID,Almakramy Linda³,Alhadab Manar³^ORCID,Alfaddagh Shahad³

Affiliation:

1. SAUDI ARAMCO Cybersecurity Chair, Department of Networks and Communication, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia

2. Department of Computer Science, College of Computers and Information Systems, Umm Al-Qura University, Makkah 21955, Saudi Arabia

3. SAUDI ARAMCO Cybersecurity Chair, Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia

Abstract

In today’s digitalized era, the world wide web services are a vital aspect of each individual’s daily life and are accessible to the users via uniform resource locators (URLs). Cybercriminals constantly adapt to new security technologies and use URLs to exploit vulnerabilities for illicit benefits such as stealing users’ personal and sensitive data, which can lead to financial loss, discredit, ransomware, or the spread of malicious infections and catastrophic cyber-attacks such as phishing attacks. Phishing attacks are being recognized as the leading source of data breaches and the most prevalent deceitful scam of cyber-attacks. Artificial intelligence (AI)-based techniques such as machine learning (ML) and deep learning (DL) have proven to be infallible in detecting phishing attacks. Nevertheless, sequential ML can be time intensive and not highly efficient in real-time detection. It can also be incapable of handling vast amounts of data. However, utilizing parallel computing techniques in ML can help build precise, robust, and effective models for detecting phishing attacks with less computation time. Therefore, in this proposed study, we utilized various multiprocessing and multithreading techniques in Python to train ML and DL models. The dataset used comprised 54 K records for training and 12 K for testing. Five experiments were carried out, the first one based on sequential execution followed by the next four based on parallel execution techniques (threading using Python parallel backend, threading using Python parallel backend and number of jobs, threading manually, and multiprocessing using Python parallel backend). Four models, namely, random forest (RF), naïve bayes (NB), convolutional neural network (CNN), and long short-term memory (LSTM) were deployed to carry out the experiments. Overall, the experiments yielded excellent results and speedup. Lastly, to consolidate, a comprehensive comparative analysis was performed.

Funder

SAUDI ARAMCO Cybersecurity Chair at Imam Abdulrahman Bin Faisal University

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/7/3467/pdf

Reference34 articles.

1. An effective detection approach for phishing websites using URL and HTML features;Aljofey;Sci. Rep.,2022

2. (2022, December 19). Number of Global Phishing Sites 2021|Statista. Available online: https://www.statista.com/statistics/266155/number-of-phishing-domain-names-worldwide/.

3. Aljabri, M., and Mirza, S. (2022, January 1–3). Phishing Attacks Detection using Machine Learning and Deep Learning Models. Proceedings of the 2022 7th International Conference on Data Science and Machine Learning Applications (CDMA), Riyadh, Saudi Arabia.

4. Detecting Malicious URLs Using Machine Learning Techniques: Review and Research Directions;Aljabri;IEEE Access,2022

5. Machine learning-based social media bot detection: A comprehensive literature review;Aljabri;Soc. Netw. Anal. Min.,2023

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. OEC Net: Optimal feature selection-based email classification network using unsupervised learning with deep CNN model;e-Prime - Advances in Electrical Engineering, Electronics and Energy;2024-03

2. Detection of phishing addresses and pages with a data set balancing approach by generative adversarial network (GAN) and convolutional neural network (CNN) optimized with swarm intelligence;Concurrency and Computation: Practice and Experience;2024-01-29

3. Android Ransomware Detection Using Supervised Machine Learning Techniques Based on Traffic Analysis;Sensors;2023-12-28

4. The impact of artificial intelligence on organisational cyber security: An outcome of a systematic literature review;Data and Information Management;2023-12

5. A Hybrid Transformer Ensemble Approach for Phishing Website Detection;2023 International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS);2023-10-18