Affiliation:
1. Department of Computer Science , National University of Defense Technology, No. 109, Deya Road, Changsha City, Hunan Province, 410073, China
2. 360 Digital Security Group , Beijing, 100015, China
Abstract
Abstract
Botnets currently use domain-generation algorithms to produce fast-flux domains that enable them to evade detection. Accurately categorizing these botnet domains is crucial to develop cybersecurity solutions against botnet threats. However, existing methods, requiring labeled data, are ineffective against new botnets. To address this issue, we propose Domain2Vec, a metric learning-based approach that can explore new botnets. Domain2Vec integrates a framework of metric learning, which uses individual domains from known botnets for categorization of unknown botnet domains. The training involves an attention-based encoder, and it includes a constraint to ensure that samples with the same labels are closer in the embedding space. The categorization uses the encoder to project domain names into appropriate representations (numerical vectors), even for domains from new botnets. Finally, Domain2Vec uses numerical vectors to explore botnets. Experiments showed that Domain2Vec performs well on domain retrieval and clustering tasks without labeled data, outperforming the state of the art by 13% and 100%, respectively. Real-world tests demonstrate that Domain2Vec can effectively identify unreported malicious domains and monitor botnet activities.
Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Science and Technology Innovation Program of Hunan Province
Publisher
Oxford University Press (OUP)
Reference35 articles.
1. Kindred domains: detecting and clustering botnet domains using dns traffic;Thomas,2014
2. Understanding the mirai botnet;Antonakakis,2017
3. Real-time behavioral dga detection through machine learning;Bisio,2017
4. Unveiling zeus: automated classification of malware samples;Mohaisen,2013
5. Peerclean: Unveiling peer-to-peer botnets through dynamic group behavior analysis;Yan,2015