Learning to Represent Spatial Transformations with Factored Higher-Order Boltzmann Machines-Reference-Cited by-同舟云学术

Learning to Represent Spatial Transformations with Factored Higher-Order Boltzmann Machines

Published:2010-06 Issue:6 Volume:22 Page:1473-1492
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Memisevic Roland¹,Hinton Geoffrey E.¹

Affiliation:

1. Department of Computer Science, University of Toronto, Toronto M5S 3G4, Canada

Abstract

To allow the hidden units of a restricted Boltzmann machine to model the transformation between two successive images, Memisevic and Hinton ( 2007 ) introduced three-way multiplicative interactions that use the intensity of a pixel in the first image as a multiplicative gain on a learned, symmetric weight between a pixel in the second image and a hidden unit. This creates cubically many parameters, which form a three-dimensional interaction tensor. We describe a low-rank approximation to this interaction tensor that uses a sum of factors, each of which is a three-way outer product. This approximation allows efficient learning of transformations between larger image patches. Since each factor can be viewed as an image filter, the model as a whole learns optimal filter pairs for efficiently representing transformations. We demonstrate the learning of optimal filter pairs from various synthetic and real image sequences. We also show how learning about image transformations allows the model to perform a simple visual analogy task, and we show how a completely unsupervised network trained on transformations perceives multiple motions of transparent dot patterns in the same way as humans.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.2010.01-09-953

Reference18 articles.

1. The “independent components” of natural scenes are edge filters

2. Learning Invariance from Transformation Sequences

3. Bilinear Sparse Coding for Invariant Vision

4. Training Products of Experts by Minimizing Contrastive Divergence

Cited by 89 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Energy-Based Models;Deep Generative Modeling;2024

2. A Literature Review on Image Preprocessing Methods Used in Deep Learning Studies Using Tomosynthesis Images;European Journal of Science and Technology;2023-07-06

3. Learning Rotation-Equivariant Features for Visual Correspondence;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06

4. A scoping review on deep learning for next-generation RNA-Seq. data analysis;Functional & Integrative Genomics;2023-04-21

5. In-memory factorization of holographic perceptual representations;Nature Nanotechnology;2023-03-30