Abstract
A double-sided variant of the information bottleneck method is considered. Let (X,Y) be a bivariate source characterized by a joint pmf PXY. The problem is to find two independent channels PU|X and PV|Y (setting the Markovian structure U→X→Y→V), that maximize I(U;V) subject to constraints on the relevant mutual information expressions: I(U;X) and I(V;Y). For jointly Gaussian X and Y, we show that Gaussian channels are optimal in the low-SNR regime but not for general SNR. Similarly, it is shown that for a doubly symmetric binary source, binary symmetric channels are optimal when the correlation is low and are suboptimal for high correlations. We conjecture that Z and S channels are optimal when the correlation is 1 (i.e., X=Y) and provide supporting numerical evidence. Furthermore, we present a Blahut–Arimoto type alternating maximization algorithm and demonstrate its performance for a representative setting. This problem is closely related to the domain of biclustering.
Subject
General Physics and Astronomy
Reference70 articles.
1. The information bottleneck method;Tishby;Proceedings of the 37th Annual Allerton Conference on Communication, Control and Computing,1999
2. Distributed information-theoretic clustering
3. Algorithms for Clustering Data;Jain,1988
4. Modeling Biclustering as an optimization problem using Mutual Information;Gupta;Proceedings of the International Conference on Methods and Models in Computer Science (ICM2CS),2009
5. Direct Clustering of a Data Matrix