Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects-Reference-Cited by-同舟云学术

Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects

Published:2016-03-05 Issue:1 Volume:30 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Bagherinezhad Hessam,Hajishirzi Hannaneh,Choi Yejin,Farhadi Ali

Abstract

Human vision greatly benefits from the information about sizes of objects. The role of size in several visual reasoning tasks has been thoroughly explored in human perception and cognition. However, the impact of the information about sizes of objects is yet to be determined in AI. We postulate that this is mainly attributed to the lack of a comprehensive repository of size information. In this paper, we introduce a method to automatically infer object sizes, leveraging visual and textual information from web. By maximizing the joint likelihood of textual and visual observations, our method learns reliable relative size estimates, with no explicit human supervision. We introduce the relative size dataset and show that our method outperforms competitive textual and visual baselines in reasoning about size comparisons.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Grounding spatial relations in text-only language models;Neural Networks;2024-02

2. Learning Visual Representations via Language-Guided Sampling;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06

3. Spatial Commonsense Reasoning for Machine Reading Comprehension;Advanced Data Mining and Applications;2023

4. Can Visual Linguistic Models become Knowledge Bases: A Prompt Learning Framework for Size Perception;2022 IEEE 21st International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC);2022-12-08