A Look Behind the Curtain: Traffic Classification in an Increasingly Encrypted Web-Reference-Cited by-同舟云学术

A Look Behind the Curtain: Traffic Classification in an Increasingly Encrypted Web

Published:2021-02-18 Issue:1 Volume:5 Page:1-26
ISSN:2476-1249
Container-title:Proceedings of the ACM on Measurement and Analysis of Computing Systems
language:en
Short-container-title:Proc. ACM Meas. Anal. Comput. Syst.

Author:

Akbari Iman¹,Salahuddin Mohammad A.¹,Ven Leni¹,Limam Noura¹,Boutaba Raouf¹,Mathieu Bertrand²,Moteau Stephanie²,Tuffin Stephane²

Affiliation:

1. University of Waterloo, Waterloo, ON, Canada

2. Orange Labs, Lannion, France

Abstract

Traffic classification is essential in network management for operations ranging from capacity planning, performance monitoring, volumetry, and resource provisioning, to anomaly detection and security. Recently, it has become increasingly challenging with the widespread adoption of encryption in the Internet, e.g., as a de-facto in HTTP/2 and QUIC protocols. In the current state of encrypted traffic classification using Deep Learning (DL), we identify fundamental issues in the way it is typically approached. For instance, although complex DL models with millions of parameters are being used, these models implement a relatively simple logic based on certain header fields of the TLS handshake, limiting model robustness to future versions of encrypted protocols. Furthermore, encrypted traffic is often treated as any other raw input for DL, while crucial domain-specific considerations exist that are commonly ignored. In this paper, we design a novel feature engineering approach that generalizes well for encrypted web protocols, and develop a neural network architecture based on Stacked Long Short-Term Memory (LSTM) layers and Convolutional Neural Networks (CNN) that works very well with our feature design. We evaluate our approach on a real-world traffic dataset from a major ISP and Mobile Network Operator. We achieve an accuracy of 95% in service classification with less raw traffic and smaller number of parameters, out-performing a state-of-the-art method by nearly 50% fewer false classifications. We show that our DL model generalizes for different classification objectives and encrypted web protocols. We also evaluate our approach on a public QUIC dataset with finer and application-level granularity in labeling, achieving an overall accuracy of 99%.

Publisher

Association for Computing Machinery (ACM)

Subject

General Medicine

Link

https://dl.acm.org/doi/pdf/10.1145/3447382

Reference43 articles.

1. Mobile Encrypted Traffic Classification Using Deep Learning

2. Mobile Encrypted Traffic Classification Using Deep Learning: Experimental Evaluation, Lessons Learned, and Challenges

3. Machine learning based encrypted traffic classification: Identifying SSH and Skype

4. Machine Learning for Encrypted Malware Traffic Classification

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Streaming traffic classification: a hybrid deep learning and big data approach;Cluster Computing;2024-01-18

2. OSF-EIMTC: An open-source framework for standardized encrypted internet traffic classification;Computer Communications;2024-01

3. Extensible Machine Learning for Encrypted Network Traffic Application Labeling via Uncertainty Quantification;IEEE Transactions on Artificial Intelligence;2024-01

4. DataZoo: Streamlining Traffic Classification Experiments;Proceedings of the 2023 on Explainable and Safety Bounded, Fidelitous, Machine Learning for Networking;2023-12-05

5. Application-layer Characterization and Traffic Analysis for Encrypted QUIC Transport Protocol;2023 IEEE Conference on Communications and Network Security (CNS);2023-10-02