The Deterministic Information Bottleneck-Reference-Cited by-同舟云学术

The Deterministic Information Bottleneck

Published:2017-06 Issue:6 Volume:29 Page:1611-1630
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Strouse DJ¹,Schwab David J.²

Affiliation:

1. Department of Physics, Princeton University, Princeton, NJ 08544, U.S.A.

2. Department of Physics, Northwestern University, Evanston, IL 60208, U.S.A.

Abstract

Lossy compression and clustering fundamentally involve a decision about which features are relevant and which are not. The information bottleneck method (IB) by Tishby, Pereira, and Bialek ( 1999 ) formalized this notion as an information-theoretic optimization problem and proposed an optimal trade-off between throwing away as many bits as possible and selectively keeping those that are most important. In the IB, compression is measured by mutual information. Here, we introduce an alternative formulation that replaces mutual information with entropy, which we call the deterministic information bottleneck (DIB) and argue better captures this notion of compression. As suggested by its name, the solution to the DIB problem turns out to be a deterministic encoder, or hard clustering, as opposed to the stochastic encoder, or soft clustering, that is optimal under the IB. We compare the IB and DIB on synthetic data, showing that the IB and DIB perform similarly in terms of the IB cost function, but that the DIB significantly outperforms the IB in terms of the DIB cost function. We also empirically find that the DIB offers a considerable gain in computational efficiency over the IB, over a range of convergence parameters. Our derivation of the DIB also suggests a method for continuously interpolating between the soft clustering of the IB and the hard clustering of the DIB.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/NECO_a_00961

Reference32 articles.

1. What Does the Retina Know about Natural Scenes?

2. The Ferrier lecture, 1980

3. Redundancy reduction revisited

4. The exploitation of regularities in the environment by the brain

Cited by 69 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Generalized quantum Arimoto–Blahut algorithm and its application to quantum information bottleneck;Quantum Science and Technology;2024-09-04

2. Iterative minimization algorithm on a mixture family;Information Geometry;2024-08-13

3. A Survey on Error Exponents in Distributed Hypothesis Testing: Connections with Information Theory, Interpretations, and Applications;Entropy;2024-07-12

4. Data Efficiency, Dimensionality Reduction, and the Generalized Symmetric Information Bottleneck;Neural Computation;2024-06-07

5. SUBTLE: An Unsupervised Platform with Temporal Link Embedding that Maps Animal Behavior;International Journal of Computer Vision;2024-05-20