Learning to detect malicious URLs-Reference-Cited by-同舟云学术

Learning to detect malicious URLs

Published:2011-04 Issue:3 Volume:2 Page:1-24
ISSN:2157-6904
Container-title:ACM Transactions on Intelligent Systems and Technology
language:en
Short-container-title:ACM Trans. Intell. Syst. Technol.

Author:

Ma Justin¹,Saul Lawrence K.²,Savage Stefan²,Voelker Geoffrey M.²

Affiliation:

1. University of California, Berkeley

2. University of California, San Diego

Abstract

Malicious Web sites are a cornerstone of Internet criminal activities. The dangers of these sites have created a demand for safeguards that protect end-users from visiting them. This article explores how to detect malicious Web sites from the lexical and host-based features of their URLs. We show that this problem lends itself naturally to modern algorithms for online learning. Online algorithms not only process large numbers of URLs more efficiently than batch algorithms, they also adapt more quickly to new features in the continuously evolving distribution of malicious URLs. We develop a real-time system for gathering URL features and pair it with a real-time feed of labeled URLs from a large Web mail provider. From these features and labels, we are able to train an online classifier that detects malicious Web sites with 99% accuracy over a balanced dataset.

Funder

Office of Naval Research

National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/1961189.1961202

Reference44 articles.

1. A comparison of machine learning techniques for phishing detection

Cited by 114 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PMANet: Malicious URL detection via post-trained language model guided multi-level feature attention network;Information Fusion;2025-01

2. One-step Bayesian example-dependent cost classification: The OsC-MLP method;Neural Networks;2024-05

3. Detection of Malicious Websites using Machine Learning;International Journal of Innovative Science and Research Technology (IJISRT);2024-03-29

4. Multi-Modal Features Representation-Based Convolutional Neural Network Model for Malicious Website Detection;IEEE Access;2024

5. Machine learning methods of sleuthing malevolent web channels;AIP Conference Proceedings;2024