An Effective Phishing Detection Model Based on Character Level Convolutional Neural Network from URL-Reference-Cited by-同舟云学术

An Effective Phishing Detection Model Based on Character Level Convolutional Neural Network from URL

Published:2020-09-15 Issue:9 Volume:9 Page:1514
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Aljofey Ali,Jiang Qingshan,Qu Qiang,Huang Mingqing^ORCID,Niyigena Jean-Pierre^ORCID

Abstract

Phishing is the easiest way to use cybercrime with the aim of enticing people to give accurate information such as account IDs, bank details, and passwords. This type of cyberattack is usually triggered by emails, instant messages, or phone calls. The existing anti-phishing techniques are mainly based on source code features, which require to scrape the content of web pages, and on third-party services which retard the classification process of phishing URLs. Although the machine learning techniques have lately been used to detect phishing, they require essential manual feature engineering and are not an expert at detecting emerging phishing offenses. Due to the recent rapid development of deep learning techniques, many deep learning-based methods have also been introduced to enhance the classification performance. In this paper, a fast deep learning-based solution model, which uses character-level convolutional neural network (CNN) for phishing detection based on the URL of the website, is proposed. The proposed model does not require the retrieval of target website content or the use of any third-party services. It captures information and sequential patterns of URL strings without requiring a prior knowledge about phishing, and then uses the sequential pattern features for fast classification of the actual URL. For evaluations, comparisons are provided between different traditional machine learning models and deep learning models using various feature sets such as hand-crafted, character embedding, character level TF-IDF, and character level count vectors features. According to the experiments, the proposed model achieved an accuracy of 95.02% on our dataset and an accuracy of 98.58%, 95.46%, and 95.22% on benchmark datasets which outperform the existing phishing URL models.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/9/9/1514/pdf

Reference51 articles.

1. https://docs.apwg.org/reports/apwg_trends_report_q1_2019.pdf

2. The state of phishing attacks

3. Symantec, Executive Summaryhttps://docs.broadcom.com/doc/istr-23-2018-executive-summary-en-aa

4. Kaspersky Lab: Spam and Phishing in 2017https://securelist.com/spam-and-phishing-in-2017/83833/

Cited by 87 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An integrated model based on deep learning classifiers and pre-trained transformer for phishing URL detection;Future Generation Computer Systems;2024-12

2. Phishing Webpage Detection via Multi-Modal Integration of HTML DOM Graphs and URL Features Based on Graph Convolutional and Transformer Networks;Electronics;2024-08-22

3. A comprehensive literature review on phishing URL detection using deep learning techniques;Journal of Cyber Security Technology;2024-07-23

4. Comparative evaluation of machine learning algorithms for phishing site detection;PeerJ Computer Science;2024-06-24

5. APFormer: Anti-Phishing Transformer for Website-Phishing Detection Via Joint Feature Learning;2024 International Conference on Engineering & Computing Technologies (ICECT);2024-05-23