Affiliation:
1. State Key Laboratory of Astronautic Dynamics Xi'an Satellite Control Center Xi'an China
2. State Key Laboratory of Integrated Services Networks Xidian University Xi'an China
Abstract
AbstractMalicious domains provide malware with covert communication channels which poses a severe threat to cybersecurity. Despite the continuous progress in detecting malicious domains with various machine learning algorithms, maintaining up‐to‐date various samples with fine‐labeled data for training is difficult. To handle these issues and improve the detection accuracy, a novel malicious domain detection method named MDND‐SS‐PO is proposed that combines semi‐supervised learning and parameter optimization. The contributions of the study are as follows. First, the method extracts the statistical features of the IP address, TTL value, the NXDomain record, and the domain name query characteristics to discriminate Domain‐Flux and Fast‐Flux domain names simultaneously. Second, an improved DBSCAN based on the neighborhood division is designed to cluster labeled data and unlabeled data with low time consumption. Then, based on the clustering hypothesis, unlabeled data is tagged with pseudo‐label according to the cluster results, which aims to train a supervised classifier effectively. Finally, Gaussian process regression is used to optimize parameter settings of the algorithm. And the Silhouette index and F1 score are introduced to evaluate the optimization results. Experimental results show that the proposed method achieved a precise detection performance of 0.885 when the ratio of labeled data is 5%.
Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Publisher
Institution of Engineering and Technology (IET)
Reference25 articles.
1. A survey on malicious domains detection through DNS data analysis;Yury Z.;ACM Comput. Surv.,2018
2. Unsupervised malicious domain detection with less labeling effort
3. A density‐based algorithm for discovering clusters in large spatial databases with noise;Ester M.;Knowl. Discov. Data Mining,1996
4. Detect Fast‐Flux domain name with DGA through IP fluctuation;Jiang H.;Int. J. Netw. Secu,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献