DGA domain embedding with deep metric learning-Reference-Cited by-同舟云学术

DGA domain embedding with deep metric learning

Published:2024-07-29 Issue: Volume: Page:
ISSN:0010-4620
Container-title:The Computer Journal
language:en
Short-container-title:

Author:

Yang Yifan¹,Li Xionglve¹,Yang Tao¹,Hou Bingnan¹,Zeng Lingbin¹,Cai Zhiping¹,Kuang Wenyuan²

Affiliation:

1. Department of Computer Science , National University of Defense Technology, No. 109, Deya Road, Changsha City, Hunan Province, 410073, China

2. 360 Digital Security Group , Beijing, 100015, China

Abstract

Abstract Botnets currently use domain-generation algorithms to produce fast-flux domains that enable them to evade detection. Accurately categorizing these botnet domains is crucial to develop cybersecurity solutions against botnet threats. However, existing methods, requiring labeled data, are ineffective against new botnets. To address this issue, we propose Domain2Vec, a metric learning-based approach that can explore new botnets. Domain2Vec integrates a framework of metric learning, which uses individual domains from known botnets for categorization of unknown botnet domains. The training involves an attention-based encoder, and it includes a constraint to ensure that samples with the same labels are closer in the embedding space. The categorization uses the encoder to project domain names into appropriate representations (numerical vectors), even for domains from new botnets. Finally, Domain2Vec uses numerical vectors to explore botnets. Experiments showed that Domain2Vec performs well on domain retrieval and clustering tasks without labeled data, outperforming the state of the art by 13% and 100%, respectively. Real-world tests demonstrate that Domain2Vec can effectively identify unreported malicious domains and monitor botnet activities.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Science and Technology Innovation Program of Hunan Province

Publisher

Oxford University Press (OUP)

Link

https://academic.oup.com/comjnl/advance-article-pdf/doi/10.1093/comjnl/bxae072/58677799/bxae072.pdf

Reference35 articles.

1. Kindred domains: detecting and clustering botnet domains using dns traffic;Thomas,2014

2. Understanding the mirai botnet;Antonakakis,2017

3. Real-time behavioral dga detection through machine learning;Bisio,2017

4. Unveiling zeus: automated classification of malware samples;Mohaisen,2013

5. Peerclean: Unveiling peer-to-peer botnets through dynamic group behavior analysis;Yan,2015