Physical scene understanding-Reference-Cited by-同舟云学术

Physical scene understanding

Published:2024-02-09 Issue:1 Volume:45 Page:156-164
ISSN:0738-4602
Container-title:AI Magazine
language:en
Short-container-title:AI Magazine

Author:

Wu Jiajun¹^ORCID

Affiliation:

1. Stanford University Stanford California USA

Abstract

AbstractCurrent AI systems still fail to match the flexibility, robustness, and generalizability of human intelligence: how even a young child can manipulate objects to achieve goals of their own invention or in cooperation, or can learn the essentials of a complex new task within minutes. We need AI with such embodied intelligence: transforming raw sensory inputs to rapidly build a rich understanding of the world for seeing, finding, and constructing things, achieving goals, and communicating with others. This problem of physical scene understanding is challenging because it requires a holistic interpretation of scenes, objects, and humans, including their geometry, physics, functionality, semantics, and modes of interaction, building upon studies across vision, learning, graphics, robotics, and AI. My research aims to address this problem by integrating bottom‐up recognition models, deep networks, and inference algorithms with top‐down structured graphical models, simulation engines, and probabilistic programs.

Funder

Stanford University

National Science Foundation

Office of Naval Research

Air Force Office of Scientific Research

Massachusetts Institute of Technology

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/aaai.12148

Reference70 articles.

1. Ajay Anurag MariaBauza JiajunWu NimaFazeli Joshua B.Tenenbaum AlbertoRodriguez andLeslieP Kaelbling.2019. “Combining Physical Simulators and Object‐Based Networks for Control.” InIEEE International Conference on Robotics and Automation (ICRA).

2. Ajay Anurag JiajunWu NimaFazeli MariaBauza Leslie P.Kaelbling Joshua B.Tenenbaum andAlbertoRodriguez.2018. “Augmenting Physical Simulators with Stochastic Neural Networks: Case Study of Planar Pushing and Bouncing.” InIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

3. Simulation as an engine of physical scene understanding

4. Chan Eric R. MarcoMonteiro PetrKellnhofer JiajunWu andGordonWetzstein.2021. “pi‐GAN: Periodic Implicit Generative Adversarial Networks for 3D‐Aware Image Synthesis.” InIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

5. Chen Zhenfang JiayuanMao JiajunWu Kwan‐YeeKenneth Wong Joshua B.Tenenbaum andChuangGan.2021. “Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning.” InInternational Conference on Learning Representations (ICLR).