Author:
Shibata Hiroki, ,Ishikawa Hiroshi,Takama Yasufumi
Abstract
This paper proposes a method to estimate the posterior distribution of a Boltzmann machine. Due to high feature extraction ability, a Boltzmann machine is often used for both of supervised and unsupervised learning. It is expected to be suitable for multimodal data because of its bi-directional connection property. However, it needs a sampling method to estimate the posterior distribution, which becomes a problem during an inference period because of the computation time and instability. Therefore, it is usually converted to feedforward neural networks, which means to lose its bi-directional property. To deal with these problems, this paper proposes a method to estimate the posterior distribution of a Boltzmann machine fast and stably without converting it to feedforward neural networks. The key idea of the proposed method is to estimate the posterior distribution using a simulated annealing on non-uniform temperature distribution. The advantage of the proposed method against Gibbs sampling and conventional simulated annealing is shown through experiments with artificial dataset and MNIST. Furthermore, this paper also gives the mathematical analysis of Boltzmann machine’s behaviour with regard to temperature distribution.
Publisher
Fuji Technology Press Ltd.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Human-Computer Interaction
Reference18 articles.
1. Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proc. of The IEEE, Vol.86, No.11, pp. 2278-2324, 1998.
2. T. Mikolov, M. Karafiát, L. Burget, J. Cernoký, and S. Khudanpur, “Recurrent neural network based language model,” Proc. of Interspeech, 2010.
3. D. H. Ackley, et. al., “A learning algorithm for Boltzmann machines,” Cognitive Science, Vol.9, No.1, pp. 147-169, 1985.
4. R. Salakhutdinov and G. Hinton, “Deep Boltzmann machines,” Proc. of AI and Statistics, Vol.5, pp. 448-455, 2009.
5. N. T. Kuong, E. Uchino, and N. Suetake, “IVUS tissue characterization of coronary plaque by classification restricted Boltzmann machine,” J. Adv. Comput. Intell. Intell. Inform., Vol.21, No.1, pp. 67-73, 2017.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Application of Exchanging Monte Carlo Method to Sample Deep Boltzmann Machines;2020 International Conference on Technologies and Applications of Artificial Intelligence (TAAI);2020-12
2. Improvements on Probability Model for Sightseeing Route Recommendation Method Employing Generalized Formulation in terms of Edges;2019 International Conference on Technologies and Applications of Artificial Intelligence (TAAI);2019-11