TSFN: A Novel Malicious Traffic Classification Method Using BERT and LSTM-Reference-Cited by-同舟云学术

TSFN: A Novel Malicious Traffic Classification Method Using BERT and LSTM

Published:2023-05-19 Issue:5 Volume:25 Page:821
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Shi Zhaolei¹,Luktarhan Nurbol¹,Song Yangyang¹,Yin Huixin¹

Affiliation:

1. College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China

Abstract

Traffic classification is the first step in network anomaly detection and is essential to network security. However, existing malicious traffic classification methods have several limitations; for example, statistical-based methods are vulnerable to hand-designed features, and deep learning-based methods are vulnerable to the balance and adequacy of data sets. In addition, the existing BERT-based malicious traffic classification methods only focus on the global features of traffic and ignore the time-series features of traffic. To address these problems, we propose a BERT-based Time-Series Feature Network (TSFN) model in this paper. The first is a Packet encoder module built by the BERT model, which completes the capture of global features of the traffic using the attention mechanism. The second is a temporal feature extraction module built by the LSTM model, which captures the time-series features of the traffic. Then, the global and time-series features of the malicious traffic are incorporated together as the final feature representation, which can better represent the malicious traffic. The experimental results show that the proposed approach can effectively improve the accuracy of malicious traffic classification on the publicly available USTC-TFC dataset, reaching an F1 value of 99.50%. This shows that the time-series features in malicious traffic can help improve the accuracy of malicious traffic classification.

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/25/5/821/pdf

Reference31 articles.

1. Zhang, Z., Han, X., Liu, Z., Jiang, X., Sun, M., and Liu, Q. (2019). ERNIE: Enhanced language representation with informative entities. arXiv.

2. Bader, O., Lichy, A., Hajaj, C., Dubin, R., and Dvir, A. (2022, January 8–11). MalDIST: From Encrypted Traffic Classification to Malware Traffic Detection and Classification. Proceedings of the 2022 IEEE 19th Annual Consumer Communications & Networking Conference (CCNC), Las Vegas, NV, USA.

3. Wang, W., Zhu, M., Wang, J., Zeng, X., and Yang, Z. (2017, January 22–24). End-to-end encrypted traffic classification with one-dimensional convolution neural networks. Proceedings of the 2017 IEEE International Conference on Intelligence and Security Informatics (ISI), Beijing, China.

4. Lin, X., Xiong, G., Gou, G., Li, Z., Shi, J., and Yu, J. (2022, January 25–29). ET-BERT: A Contextualized Datagram Representation with Pre-training Transformers for Encrypted Traffic Classification. Proceedings of the ACM Web Conference 2022, Lyon, France.

5. Wang, W., Zhu, M., Zeng, X., Ye, X., and Sheng, Y. (2017, January 11–13). Malware traffic classification using convolutional neural network for representation learning. Proceedings of the 2017 IEEE International Conference on Information Networking (ICOIN), Da Nang, Vietnam.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A novel approach for application classification with encrypted traffic using BERT and packet headers;Computer Networks;2024-12

2. Feasibility of State Space Models for Network Traffic Generation;Proceedings of the 2024 SIGCOMM Workshop on Networks for AI Computing;2024-08-04

3. A Review of Advancements and Applications of Pre-Trained Language Models in Cybersecurity;2024 12th International Symposium on Digital Forensics and Security (ISDFS);2024-04-29

4. A Multi-Scenario Traffic Classification Method Based on Pretrained Encoder and Text Convolutional Neural Network;2024 IEEE 7th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC);2024-03-15

5. Anomaly Detection Method for Integrated Encrypted Malicious Traffic Based on RFCNN-GRU;Communications in Computer and Information Science;2024