Affiliation:
1. Department of Electronic and Communication Technologies, School of Engineering, Universidad Autónoma de Madrid, 28049 Madrid, Spain
Abstract
Real-time traffic classification is one of the most important challenges for both Internet Service Providers and users, because correct traffic policing and planning allow for proper optimization of the network resources. However, there is no perfect solution for this problem, due to the grade of complexity of modern traffic. Nowadays, convolutional neural networks (CNNs) are believed to be the miraculous solution for network packet classification of encrypted traffic. Nevertheless, given the obscure nature of deep learning, an appropriate explanation could not be easily obtained on how the model detects each traffic category. In this paper, we present an analysis on some popular CNN-based models for network packet classification, focusing on how the model works and how it was implemented, trained, and tested. By using eXplainable Artificial Intelligence (XAI), we are able to extract the most important regions of the models and extract some reasoning to justify their decisions. Moreover, in the process, we look for possible flawed methodologies that can lead to data leakage or an unrealistic performance evaluation. The results show that CNNs mainly focus on the packet length to make a decision, which is definitely a waste of resources. As we also check, the same could also be implemented with simpler machine learning models, such as decision trees. Our findings indicate that poor experimental protocols result in an unrealistic performance evaluation. Moreover, XAI techniques are of great help in the assessment of the model, showing that CNNs do not detect significant features in encrypted payloads apart from packet length.
Funder
Spanish State Research Agency under the project AgileMon
Spanish Ministry of Science, Innovation and Universities under the program for the training of university lecturers
Reference44 articles.
1. Botnet Detection by Monitoring Similar Communication Patterns;Zeidanloo;Int. J. Comput. Sci. Inf. Secur.,2010
2. Bremler-Barr, A., Harchol, Y., Hay, D., and Koral, Y. (2014, January 2–5). Deep packet inspection as a service. Proceedings of the 10th ACM International on Conference on Emerging Networking Experiments and Technologies, Sydney, Australia.
3. An SVM-based machine learning method for accurate internet traffic classification;Yuan;Inf. Syst. Front.,2010
4. Deep learning for encrypted traffic classification: An overview;Rezaei;IEEE Commun. Mag.,2019
5. Complex network classification with convolutional neural network;Xin;Tsinghua Sci. Technol.,2020