Neuro-SERKET: Development of Integrative Cognitive System Through the Composition of Deep Probabilistic Generative Models
-
Published:2020-01-22
Issue:1
Volume:38
Page:23-48
-
ISSN:0288-3635
-
Container-title:New Generation Computing
-
language:en
-
Short-container-title:New Gener. Comput.
Author:
Taniguchi Tadahiro,Nakamura Tomoaki,Suzuki Masahiro,Kuniyasu Ryo,Hayashi Kaede,Taniguchi Akira,Horii Takato,Nagai Takayuki
Abstract
AbstractThis paper describes a framework for the development of an integrative cognitive system based on probabilistic generative models (PGMs) called Neuro-SERKET. Neuro-SERKET is an extension of SERKET, which can compose elemental PGMs developed in a distributed manner and provide a scheme that allows the composed PGMs to learn throughout the system in an unsupervised way. In addition to the head-to-tail connection supported by SERKET, Neuro-SERKET supports tail-to-tail and head-to-head connections, as well as neural network-based modules, i.e., deep generative models. As an example of a Neuro-SERKET application, an integrative model was developed by composing a variational autoencoder (VAE), a Gaussian mixture model (GMM), latent Dirichlet allocation (LDA), and automatic speech recognition (ASR). The model is called VAE + GMM + LDA + ASR. The performance of VAE + GMM + LDA + ASR and the validity of Neuro-SERKET were demonstrated through a multimodal categorization task using image data and a speech signal of numerical digits.
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Hardware and Architecture,Theoretical Computer Science,Software
Reference50 articles.
1. Taniguchi, T., Ugur, E., Hoffmann, M., Jamone, L., Nagai, T., Rosman, B., Matsuka, T., Iwahashi, N., Oztop, E., Piater, J. et al.: Symbol emergence in cognitive developmental systems: a survey. IEEE Trans. Cogn. Dev. Syst. (2018) 2. Nakamura, T., Nagai, T., Iwahashi, N.: Multimodal object categorization by a robot. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 2415–2420 (2007) 3. Shun, N., Tetsuya, O., Jun, T., Kazunori, K., Hiroshi, O.G.: Predicting object dynamics from visual images through active sensing experiences. Adv. Robot. 22(5), 527 (2008) 4. Ogata, T., Nishide, S., Kozima, H., Komatani, K., Okuno, H.: Inter-modality mapping in robot with recurrent neural network. Pattern Recogn. Lett. 31(12), 1560 (2010) 5. Mangin, O., Filliat, D., Ten Bosch, L., Oudeyer, P.Y.: MCA-NMF: multimodal concept acquisition with non-negative matrix factorization. PLoS One 10, 10, e0140732 (2015)
Cited by
40 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|