1. Achille A, Soatto S. Emergence of invariance and disentanglement in deep representations. J Mach Learn Res. 2018;19(1):1947–80.
2. Shamir O. Learning and generalization in neural networks: a statistical physics view [J]. Adv Neural Inf Process Syst. 2010;23:1–9.
3. Wainwright MJ. High-dimensional statistics: A non-asymptotic viewpoint [M]. Cambridge University Press, 2019.
4. Tishby N, Pereira FC, Bialek W. The information bottleneck method [J]. arXiv preprint physics/0004057, 2000
5. Cover TM. Elements of information theory [M]. Wiley, 1999