Affiliation:
1. School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, UK
Abstract
The information bottleneck (IB) framework formalises the essential requirement for efficient information processing systems to achieve an optimal balance between the complexity of their representation and the amount of information extracted about relevant features. However, since the representation complexity affordable by real-world systems may vary in time, the processing cost of updating the representations should also be taken into account. A crucial question is thus the extent to which adaptive systems can leverage the information content of already existing IB-optimal representations for producing new ones, which target the same relevant features but at a different granularity. We investigate the information-theoretic optimal limits of this process by studying and extending, within the IB framework, the notion of successive refinement, which describes the ideal situation where no information needs to be discarded for adapting an IB-optimal representation’s granularity. Thanks in particular to a new geometric characterisation, we analytically derive the successive refinability of some specific IB problems (for binary variables, for jointly Gaussian variables, and for the relevancy variable being a deterministic function of the source variable), and provide a linear-programming-based tool to numerically investigate, in the discrete case, the successive refinement of the IB. We then soften this notion into a quantification of the loss of information optimality induced by several-stage processing through an existing measure of unique information. Simple numerical experiments suggest that this quantity is typically low, though not entirely negligible. These results could have important implications for (i) the structure and efficiency of incremental learning in biological and artificial agents, (ii) the comparison of IB-optimal observation channels in statistical decision problems, and (iii) the IB theory of deep neural networks.
Subject
General Physics and Astronomy
Reference78 articles.
1. Tishby, N., Pereira, F., and Bialek, W. (1999, January 22–24). The Information Bottleneck Method. Proceedings of the 37th Allerton Conference on Communication, Control and Computation, Monticello, IL, USA.
2. Gilad-Bachrach, R., Navot, A., and Tishby, N. (2003). Learning Theory and Kernel Machines, Springer. Lecture Notes in Computer Science.
3. Bialek, W., De Ruyter Van Steveninck, R.R., and Tishby, N. (2006, January 9–14). Efficient representation as a design principle for neural coding and computation. Proceedings of the 2006 IEEE International Symposium on Information Theory, Seattle, WA, USA.
4. Past-future information bottleneck in dynamical systems;Creutzig;Phys. Rev. E,2009
5. Amir, N., Tiomkin, S., and Tishby, N. (2015, January 15–18). Past-future Information Bottleneck for linear feedback systems. Proceedings of the 2015 54th IEEE Conference on Decision and Control (CDC), Osaka, Japan.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Semantic Compression with Information Lattice Learning;2024 IEEE International Symposium on Information Theory Workshops (ISIT-W);2024-07-07