Decision-Theoretic Saliency: Computational Principles, Biological Plausibility, and Implications for Neurophysiology and Psychophysics-Reference-Cited by-同舟云学术

Decision-Theoretic Saliency: Computational Principles, Biological Plausibility, and Implications for Neurophysiology and Psychophysics

Published:2009-01 Issue:1 Volume:21 Page:239-271
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Gao Dashan¹,Vasconcelos Nuno¹

Affiliation:

1. Statistical Visual Computing Laboratory, University of California San Diego, La Jolla, CA 92093, U.S.A.

Abstract

A decision-theoretic formulation of visual saliency, first proposed for top-down processing (object recognition) (Gao & Vasconcelos, 2005a ), is extended to the problem of bottom-up saliency. Under this formulation, optimality is defined in the minimum probability of error sense, under a constraint of computational parsimony. The saliency of the visual features at a given location of the visual field is defined as the power of those features to discriminate between the stimulus at the location and a null hypothesis. For bottom-up saliency, this is the set of visual features that surround the location under consideration. Discrimination is defined in an information-theoretic sense and the optimal saliency detector derived for a class of stimuli that complies with known statistical properties of natural images. It is shown that under the assumption that saliency is driven by linear filtering, the optimal detector consists of what is usually referred to as the standard architecture of V1: a cascade of linear filtering, divisive normalization, rectification, and spatial pooling. The optimal detector is also shown to replicate the fundamental properties of the psychophysics of saliency: stimulus pop-out, saliency asymmetries for stimulus presence versus absence, disregard of feature conjunctions, and Weber's law. Finally, it is shown that the optimal saliency architecture can be applied to the solution of generic inference problems. In particular, for the class of stimuli studied, it performs the three fundamental operations of statistical inference: assessment of probabilities, implementation of Bayes decision rule, and feature selection.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.2009.11-06-391

Reference84 articles.

1. Spatiotemporal energy models for the perception of motion

2. Stimulus Specific Responses from Beyond the Classical Receptive Field: Neurophysiological Mechanisms for Local-Global Comparisons in Visual Neurons

3. Some informational aspects of visual perception.

Cited by 61 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Learning based Abnormal Event Detection in Pedestrian Pathways;2024 International Conference on Intelligent and Innovative Technologies in Computing, Electrical and Electronics (IITCEE);2024-01-24

2. Optical Flow-Based Video Anomaly Detection Approaches;Cognitive Intelligence and Robotics;2024

3. Audio–visual collaborative representation learning for Dynamic Saliency Prediction;Knowledge-Based Systems;2022-11

4. Cascaded normalizations for spatial integration in the primary visual cortex of primates;Cell Reports;2022-08

5. Handwritten Annotation Spotting in Printed Documents Using Top-Down Visual Saliency Models;ACM Transactions on Asian and Low-Resource Language Information Processing;2022-05-31