Affiliation:
1. Georgia Institute of Technology
2. AT&T Labs -- Research
Abstract
Knowing the distribution of the sizes of traffic flows passing through a network link helps a network operator to characterize network resource usage, infer traffic demands, detect traffic anomalies, and accommodate new traffic demands through better traffic engineering. Previous work on estimating the flow size distribution has been focused on making inferences from sampled network traffic. Its accuracy is limited by the (typically) low sampling rate required to make the sampling operation affordable. In this paper we present a novel data streaming algorithm to provide much more accurate estimates of flow distribution, using a "lossy data structure" which consists of an array of counters fitted well into SRAM. For each incoming packet, our algorithm only needs to increment one underlying counter, making the algorithm fast enough even for 40 Gbps (OC-768) links. The data structure is lossy in the sense that sizes of multiple flows may collide into the same counter. Our algorithm uses Bayesian statistical methods such as Expectation Maximization to infer the most likely flow size distribution that results in the observed counter values after collision. Evaluations of this algorithm on large Internet traces obtained from several sources (including a tier-1 ISP) demonstrate that it has very high measurement accuracy (within 2%). Our algorithm not only dramatically improves the accuracy of flow distribution measurement, but also contributes to the field of data streaming by formalizing an existing methodology and applying it to the context of estimating the flow-distribution.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Cited by
92 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Server-Assisted Traffic Measurement for Programmable Data Center Networks;IEEE Transactions on Network Science and Engineering;2024-09
2. Learning-Based Sketch for Adaptive and High-Performance Network Measurement;IEEE/ACM Transactions on Networking;2024-06
3. BitMatcher: Bit-level Counter Adjustment for Sketches;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
4. Scalable Overspeed Item Detection in Streams;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
5. Hierarchical Sketch: An Efficient Solution for Threshold-t Flows Measurement in High-Speed Networks;2023 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom);2023-12-21