Optimized URL Feature Selection Based on Genetic-Algorithm-Embedded Deep Learning for Phishing Website Detection-Reference-Cited by-同舟云学术

Optimized URL Feature Selection Based on Genetic-Algorithm-Embedded Deep Learning for Phishing Website Detection

Published:2022-03-30 Issue:7 Volume:11 Page:1090
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Bu Seok-Jun^ORCID,Kim Hae-Jung

Abstract

Deep learning models for phishing URL classification based on character- and word-level URL features achieve the best performance in terms of accuracy. Various improvements have been proposed through deep learning parameters, including the structure and learning strategy. However, the existing deep learning approach shows a degradation in recall according to the nature of a phishing attack that is immediately discarded after being reported. An additional optimization process that can minimize the false negatives by selecting the core features of phishing URLs is a promising avenue of improvement. To search the optimal URL feature set and to fully exploit it, we propose a combined searching and learning strategy that effectively models the URL classifier for recall. By incorporating the deep-learning-based URL classifier with the genetic algorithm to search the optimal feature set that minimizing the false negatives, an optimized classifier that guarantees the best performance was obtained. Extensive experiments on three real-world datasets consisting of 222,541 URLs showed the highest recall among the deep learning models. We demonstrated the superiority of the method by 10-fold cross-validation and confirmed that the recall improved compared to the latest deep learning method. In particular, the accuracy and recall were improved by 4.13%p and 7.07%p, respectively, compared to the convolutional–recurrent neural network in which the feature selection optimization was omitted.

Funder

National Research Foundation of Korea

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/7/1090/pdf

Reference23 articles.

1. PhishStorm: Detecting Phishing With Streaming Analytics

2. Deep Character-Level Anomaly Detection Based on a Convolutional Autoencoder for Zero-Day Phishing URL Detection

3. Accurate and fast URL phishing detector: A convolutional neural network approach

4. URLNet: Learning a URL representation with deep learning for malicious URL detection;Le;arXiv,2018

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hybrid optimization enabled squeeze net for phishing attack detection;Computers & Security;2024-09

2. Phishing Webpage Detection via Multi-Modal Integration of HTML DOM Graphs and URL Features Based on Graph Convolutional and Transformer Networks;Electronics;2024-08-22

3. Walkthrough phishing detection techniques;Computers and Electrical Engineering;2024-08

4. Analysis of the Use of Artificial Intelligence in Software-Defined Intelligent Networks: A Survey;Technologies;2024-07-02

5. Detection of phishing URLs with deep learning based on GAN-CNN-LSTM network and swarm intelligence algorithms;Signal, Image and Video Processing;2024-06-17