Using the forest to see the trees-Reference-Cited by-同舟云学术

Using the forest to see the trees

Published:2010-03 Issue:3 Volume:53 Page:107-114
ISSN:0001-0782
Container-title:Communications of the ACM
language:en
Short-container-title:Commun. ACM

Author:

Torralba A.¹,Murphy K. P.²,Freeman W. T.¹

Affiliation:

1. Massachusetts Institute of Technology, Cambridge, MA

2. University of British Columbia, Vancouver, Canada

Abstract

Recognizing objects in images is an active area of research in computer vision. In the last two decades, there has been much progress and there are already object recognition systems operating in commercial products. However, most of the algorithms for detecting objects perform an exhaustive search across all locations and scales in the image comparing local image regions with an object model. That approach ignores the semantic structure of scenes and tries to solve the recognition problem by brute force. In the real world, objects tend to covary with other objects, providing a rich collection of contextual associations. These contextual associations can be used to reduce the search space by looking only in places in which the object is expected to be; this also increases performance, by rejecting patterns that look like the target but appear in unlikely places. Most modeling attempts so far have defined the context of an object in terms of other previously recognized objects. The drawback of this approach is that inferring the context becomes as difficult as detecting each object. An alternative view of context relies on using the entire scene information holistically. This approach is algorithmically attractive since it dispenses with the need for a prior step of individual object recognition. In this paper, we use a probabilistic framework for encoding the relationships between context and object properties and we show how an integrated system provides improved performance. We view this as a significant step toward general purpose machine vision systems.

Funder

NGA

Office of Naval Research

Division of Information and Intelligent Systems

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/1666420.1666446

Reference20 articles.

1. A Bayesian Hierarchical Model for Learning Natural Scene Categories

2. Pyramid-based texture analysis/synthesis

3. Geometric context from a single image

4. Hierarchical Mixtures of Experts and the EM Algorithm

Cited by 65 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A face retrieval technique combining large models and artificial neural networks;Concurrency and Computation: Practice and Experience;2024-03-25

2. Driving Environment Inference from POI of Navigation Map: Fuzzy Logic and Machine Learning Approaches;Sensors;2023-11-13

3. Context understanding in computer vision: A survey;Computer Vision and Image Understanding;2023-03

4. Research Review of Dispensing Based on Machine Vision;2022 4th International Conference on Applied Machine Learning (ICAML);2022-07

5. From Node to Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection;2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2022-01