Probabilistic lossy counting-Reference-Cited by-同舟云学术

Probabilistic lossy counting

Published:2008-01-30 Issue:1 Volume:38 Page:5-5
ISSN:0146-4833
Container-title:ACM SIGCOMM Computer Communication Review
language:en
Short-container-title:SIGCOMM Comput. Commun. Rev.

Author:

Dimitropoulos Xenofontas¹,Hurley Paul¹,Kind Andreas¹

Affiliation:

1. IBM Zurich Research Laboratory

Abstract

Knowledge of the largest traffic ows in a network is important for many network management applications. The problem of finding these ows is known as the heavy-hitter problem and has been the subject of many studies in the past years. One of the most efficient and well-known algorithms for finding heavy hitters is lossy counting [29]. In this work we introduce probabilistic lossy counting (PLC), which enhances lossy counting in computing network traffic heavy hitters. PLC uses on a tighter error bound on the estimated sizes of traffic ows and provides probabilistic rather than deterministic guarantees on its accuracy. The probabilistic-based error bound substantially improves the memory consumption of the algorithm. In addition, PLC reduces the rate of false positives of lossy counting and achieves a low estimation error, although slightly higher than that of lossy counting We compare PLC with state-of-the-art algorithms for finding heavy hitters. Our experiments using real traffic traces find that PLC has 1) between 34.4% and 74% lower memory consumption, 2) between 37.9% and 40.5% fewer false positives than lossy counting, and 3) a small estimation error.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Software

Link

https://dl.acm.org/doi/pdf/10.1145/1341431.1341433

Reference34 articles.

1. L. A. Adamic. Zipf Power-laws and Pareto - a ranking tutorial. http://www.hpl.hp.com/research/idl/papers/ranking/ranking.html. L. A. Adamic. Zipf Power-laws and Pareto - a ranking tutorial. http://www.hpl.hp.com/research/idl/papers/ranking/ranking.html.

2. Tracking join and self-join sizes in limited storage

3. The space complexity of approximating the frequency moments

4. Distributed top-k monitoring

5. Ranking flows from sampled traffic

Cited by 62 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning-Based Sketch for Adaptive and High-Performance Network Measurement;IEEE/ACM Transactions on Networking;2024-06

2. CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models;Proceedings of the ACM on Management of Data;2024-03-12

3. Single Update Sketch with Variable Counter Structure;Proceedings of the VLDB Endowment;2023-09

4. From CountMin to Super kJoin Sketches for Flow Spread Estimation;IEEE Transactions on Network Science and Engineering;2023

5. TalentSketch: LSTM-based Sketch for Adaptive and High-Precision Network Measurement;2022 IEEE 30th International Conference on Network Protocols (ICNP);2022-10-30