Reviewing Evolution of Learning Functions and Semantic Information Measures for Understanding Deep Learning-Reference-Cited by-同舟云学术

Reviewing Evolution of Learning Functions and Semantic Information Measures for Understanding Deep Learning

Published:2023-05-15 Issue:5 Volume:25 Page:802
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Lu Chenguang¹²^ORCID

Affiliation:

1. Intelligence Engineering and Mathematics Institute, Liaoning Technical University, Fuxin 123000, China

2. School of Computer Engineering and Applied Mathematics, Changsha University, Changsha 410022, China

Abstract

A new trend in deep learning, represented by Mutual Information Neural Estimation (MINE) and Information Noise Contrast Estimation (InfoNCE), is emerging. In this trend, similarity functions and Estimated Mutual Information (EMI) are used as learning and objective functions. Coincidentally, EMI is essentially the same as Semantic Mutual Information (SeMI) proposed by the author 30 years ago. This paper first reviews the evolutionary histories of semantic information measures and learning functions. Then, it briefly introduces the author’s semantic information G theory with the rate-fidelity function R(G) (G denotes SeMI, and R(G) extends R(D)) and its applications to multi-label learning, the maximum Mutual Information (MI) classification, and mixture models. Then it discusses how we should understand the relationship between SeMI and Shannon’s MI, two generalized entropies (fuzzy entropy and coverage entropy), Autoencoders, Gibbs distributions, and partition functions from the perspective of the R(G) function or the G theory. An important conclusion is that mixture models and Restricted Boltzmann Machines converge because SeMI is maximized, and Shannon’s MI is minimized, making information efficiency G/R close to 1. A potential opportunity is to simplify deep learning by using Gaussian channel mixture models for pre-training deep neural networks’ latent layers without considering gradients. It also discusses how the SeMI measure is used as the reward function (reflecting purposiveness) for reinforcement learning. The G theory helps interpret deep learning but is far from enough. Combining semantic information theory and deep learning will accelerate their development.

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/25/5/802/pdf

Reference84 articles.

1. Belghazi, M.I., Baratin, A., Rajeswar, S., Ozair, S., Bengio, Y., Courville, A., and Hjelm, R.D. (2018, January 10–15). MINE: Mutual information neural estimation. Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden.

2. Oord, A.V.D., Li, Y., and Vinyals, O. (2018). Representation Learning with Contrastive Predictive Coding. arXiv.

3. A mathematical theory of communication;Shannon;Bell Syst. Tech. J.,1948

4. Hjelm, R.D., Fedorov, A., Lavoie-Marchildon, S., Grewal, K., Trischler, A., and Bengio, Y. (2018). Learning Deep Representations by Mutual Information Estimation and Maximization. arXiv.

5. Bachman, P., Hjelm, R.D., and Buchwalter, W. (2018). Learning Representations by Maximizing Mutual Information Across Views. arXiv.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. (HTBNet)Arbitrary Shape Scene Text Detection with Binarization of Hyperbolic Tangent and Cross-Entropy;Entropy;2024-06-29

2. Information Reflection Theory Based on Information Theories, Analog Symbolism, and the Generalized Relativity Principle;IS4SI Summit 2023;2023-08-11