Learning Optimized Features for Hierarchical Models of Invariant Object Recognition-Reference-Cited by-同舟云学术

Learning Optimized Features for Hierarchical Models of Invariant Object Recognition

Published:2003-07-01 Issue:7 Volume:15 Page:1559-1588
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Wersing Heiko¹,Körner Edgar¹

Affiliation:

1. HONDA Research Institute Europe GmbH, 63073 Offenbach/Main, Germany,

Abstract

There is an ongoing debate over the capabilities of hierarchical neural feedforward architectures for performing real-world invariant object recognition. Although a variety of hierarchical models exists, appropriate supervised and unsupervised learning methods are still an issue of intense research. We propose a feedforward model for recognition that shares components like weight sharing, pooling stages, and competitive nonlinearities with earlier approaches but focuses on new methods for learning optimal feature-detecting cells in intermediate stages of the hierarchical network. We show that principles of sparse coding, which were previously mostly applied to the initial feature detection stages, can also be employed to obtain optimized intermediate complex features. We suggest a new approach to optimize the learning of sparse features under the constraints of a weight-sharing or convolutional architecture that uses pooling operations to achieve gradual invariance in the feature hierarchy. The approach explicitly enforces symmetry constraints like translation invariance on the feature set. This leads to a dimension reduction in the search space of optimal features and allows determining more efficiently the basis representatives, which achieve a sparse decomposition of the input. We analyze the quality of the learned feature representation by investigating the recognition performance of the resulting hierarchical network on object and face databases. We show that a hierarchy with features learned on a single object data set can also be applied to face recognition without parameter changes and is competitive with other recent machine learning recognition approaches. To investigate the effect of the interplay between sparse coding and processing nonlinearities, we also consider alternative feedforward pooling nonlinearities such as presynaptic maximum selection and sum-of-squares integration. The comparison shows that a combination of strong competitive nonlinearities with sparse coding offers the best recognition performance in the difficult scenario of segmentation-free recognition in cluttered surround. We demonstrate that for both learning and recognition, a precise segmentation of the objects is not necessary.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/089976603321891800

Reference44 articles.

1. A Neural Network Architecture for Visual Selection

2. Single Units and Sensation: A Neuron Doctrine for Perceptual Psychology?

3. The Twelfth Bartlett Memorial Lecture: The Role of Single Neurons in the Psychology of Perception

4. The “independent components” of natural scenes are edge filters

5. Learning the invariance properties of complex cells from their responses to natural stimuli

Cited by 108 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The impact of artificial intelligence methods on drug design;Cheminformatics, QSAR and Machine Learning Applications for Novel Drug Development;2023

2. Brain-inspired models for visual object recognition: an overview;Artificial Intelligence Review;2022-01-10

3. A brain-inspired multibranch parallel interactive vision mechanism for advanced driver assistance systems;International Journal of Sensor Networks;2022

4. Hierarchical Models of the Visual System;Encyclopedia of Computational Neuroscience;2022

5. A controlled investigation of behaviorally-cloned deep neural network behaviors in an autonomous steering task;Robotics and Autonomous Systems;2021-08