Abstract
Estimation of Rényi entropy is of fundamental importance to many applications in cryptography, statistical inference, and machine learning. This paper aims to improve the existing estimators with regard to: (a) the sample size, (b) the estimator adaptiveness, and (c) the simplicity of the analyses. The contribution is a novel analysis of the generalized “birthday paradox” collision estimator. The analysis is simpler than in prior works, gives clear formulas, and strengthens existing bounds. The improved bounds are used to develop an adaptive estimation technique that outperforms previous methods, particularly in regimes of low or moderate entropy. Last but not least, to demonstrate that the developed techniques are of broader interest, a number of applications concerning theoretical and practical properties of “birthday estimators” are discussed.
Subject
General Physics and Astronomy
Reference48 articles.
1. Rényi, A. (July, January 20). On measures of information and entropy. Proceedings of the 4th Berkeley Symposium on Mathematics, Statistics and Probability, Berkeley, CA, USA.
2. Masisi, L., Nelwamondo, V., and Marwala, T. (2008, January 27–29). The use of entropy to measure structural diversity. Proceedings of the 2008 IEEE International Conference on Computational Cybernetics, Stara Lesna, Slovakia.
3. Generalised entropy indices have a long history in ecology—A comment;Community Ecol.,2005
4. Leinster, T., and Meckes, M. (2016). Maximizing Diversity in Biology and Beyond. Entropy, 18.
5. Statistical mechanics based on Renyi entropy;Lenzi;Phys. A Stat. Mech. Its Appl.,2000