Affiliation:
1. AT&T Labs-Research, Florham Park, NJ, USA
Abstract
Measurement, collection, and interpretation of network usage data commonly involves multiple stage of sampling and aggregation. Examples include sampling packets, aggregating them into flow statistics at a router, sampling and aggregation of usage records in a network data repository for reporting, query and archiving. Although unbiased estimates of packet, bytes and flows usage can be formed for each sampling operation, for many applications it is crucial to know the inherent estimation error. Previous work in this area has been limited mainly to analyzing the estimator variance for particular methods, e.g., independent packet sampling. However, the variance is of limited use for more general sampling methods, where the estimate may not be well approximated by a Gaussian distribution.
This motivates our paper, in which we establish Chernoff bounds on the likelihood of estimation error in a general multistage combination of measurement sampling and aggregation. We derive the scale against which errors are measured, in terms of the constituent sampling and aggregation operations. In particular this enables us to obtain rigorous confidence intervals around any given estimate. We apply our method to a number of sampling schemes both in the literature and currently deployed, including sampling of packet sampled NetFlow records, Sample and Hold, and Flow Slicing. We obtain one particularly striking result in the first case: that for a range of parameterizations, packet sampling has no additional impact on the estimator confidence derived from our bound, beyond that already imposed by flow sampling.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Reference28 articles.
1. Estimating arbitrary subset sums with few probes
2. Impact of packet sampling on anomaly detection metrics
3. Cisco. White paper - netflow services and applications. http://www.cisco.com/warp/public/cc/pd/iosw/ioft/neflct/tech/napps_wp.htm. Cisco. White paper - netflow services and applications. http://www.cisco.com/warp/public/cc/pd/iosw/ioft/neflct/tech/napps_wp.htm.
4. Application of sampling methodologies to network traffic characterization
5. Sketching unaggregated data streams for subpopulation-size queries
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Continuously Distinct Sampling over Centralized and Distributed High Speed Data Streams;IEEE Transactions on Parallel and Distributed Systems;2019-02-01
2. Statistical Research in Networks: Looking Forward;Encyclopedia of Social Network Analysis and Mining;2018
3. Statistical Research in Networks: Looking Forward;Encyclopedia of Social Network Analysis and Mining;2017
4. Statistical Research in Networks – Looking Forward;Encyclopedia of Social Network Analysis and Mining;2014
5. A survey of network flow applications;Journal of Network and Computer Applications;2013-03