Detecting Phishing URLs With Word Embedding and Deep Learning-Reference-Cited by-同舟云学术

Detecting Phishing URLs With Word Embedding and Deep Learning

Published:2023-06-30 Issue: Volume: Page:296-319
ISSN:2327-3453
Container-title:Advances in Systems Analysis, Software Engineering, and High Performance Computing
language:
Short-container-title:

Author:

Selamat Ali¹^ORCID,Quang Do Nguyet²,Krejcar Ondrej³^ORCID

Affiliation:

1. Universiti Teknologi Malaysia, Malaysia & Hradec Kralove University, Czech Republic

2. Universiti Teknologi Malaysia, Malaysia

3. Hradec Kralove University, Czech Republic

Abstract

The past decade has witnessed the rapid development of natural language processing and machine learning in the phishing detection domain. However, there needs to be more research on word embedding and deep learning for malicious URL classification. Inspired to solve this problem, this chapter aims to examine the application of word embedding and deep learning in extracting features from website URLs. To achieve this, several word embedding techniques, such as Keras, Word2Vec, GloVe, and FastText, were used to learn feature representations of webpage URLs. The obtained feature vectors were fed into a deep-learning model based on CNN-BiGRU for extraction and classification. Two different datasets were used to conduct numerous experiments, while various metrics were utilized to evaluate the phishing detection model's performance. The obtained findings indicated that when combined with deep learning, Keras outperformed other text embedding methods and achieved the best results across all evaluation metrics on both datasets.

Publisher

IGI Global

Reference43 articles.

1. URLdeepDetect: A Deep Learning Approach for Detecting Malicious URLs Using Semantic Vector Models

2. A Deep Learning Technique for Web Phishing Detection Combined URL Features and Visual Similarity

3. Detecting ransomware attacks using intelligent algorithms: recent development and next direction from deep learning and big data perspectives

4. A Phishing-Attack-Detection Model Using Natural Language Processing and Deep Learning

5. Is this URL Safe: Detection of Malicious URLs Using Global Vector for Word Representation