Efficient grounding of abstract spatial concepts for natural language interaction with robot platforms-Reference-Cited by-同舟云学术

Efficient grounding of abstract spatial concepts for natural language interaction with robot platforms

Published:2018-06-25 Issue:10 Volume:37 Page:1269-1299
ISSN:0278-3649
Container-title:The International Journal of Robotics Research
language:en
Short-container-title:The International Journal of Robotics Research

Author:

Paul Rohan¹,Arkin Jacob²,Aksaray Derya¹,Roy Nicholas¹,Howard Thomas M.²

Affiliation:

1. Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology, Cambridge, MA, USA

2. Robotics and Artificial Intelligence Laboratory (RAIL), Electrical and Computer Engineering, University of Rochester, Rochester, NY, USA

Abstract

Our goal is to develop models that allow a robot to efficiently understand or “ground” natural language instructions in the context of its world representation. Contemporary approaches estimate correspondences between language instructions and possible groundings such as objects, regions, and goals for actions that the robot should execute. However, these approaches typically reason in relatively small domains and do not model abstract spatial concepts such as as “rows,” “columns,” or “groups” of objects and, hence, are unable to interpret an instruction such as “pick up the middle block in the row of five blocks.” In this paper, we introduce two new models for efficient natural language understanding of robot instructions. The first model, which we call the adaptive distributed correspondence graph (ADCG), is a probabilistic model for interpreting abstract concepts that require hierarchical reasoning over constituent concrete entities as well as notions of cardinality and ordinality. Abstract grounding variables form a Markov boundary over concrete groundings, effectively de-correlating them from the remaining variables in the graph. This structure reduces the complexity of model training and inference. Inference in the model is posed as an approximate search procedure that orders factor computation such that the estimated probable concrete groundings focus the search for abstract concepts towards likely hypothesis, pruning away improbable portions of the exponentially large space of abstractions. Further, we address the issue of scalability to complex domains and introduce a hierarchical extension to a second model termed the hierarchical adaptive distributed correspondence graph (HADCG). The model utilizes the abstractions in the ADCG but infers a coarse symbolic structure from the utterance and the environment model and then performs fine-grained inference over the reduced graphical model, further improving the efficiency of inference. Empirical evaluation demonstrates accurate grounding of abstract concepts embedded in complex natural language instructions commanding a robotic torso and a mobile robot. Further, the proposed approximate inference method allows significant efficiency gains compared with the baseline, with minimal trade-off in accuracy.

Funder

Robotics Con- sortium of the U.S Army Research Laboratory under the Collaborative Technology Alliance Program

National Science Foundation

Publisher

SAGE Publications

Subject

Applied Mathematics,Artificial Intelligence,Electrical and Electronic Engineering,Mechanical Engineering,Modeling and Simulation,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/0278364918777627

Reference51 articles.

1. Alignment-Based Compositional Semantics for Instruction Following

2. Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions

Cited by 33 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Advances in Flexible Robotic Manipulator Systems — Part II: Planning, Control, Applications, and Perspectives;IEEE/ASME Transactions on Mechatronics;2024-06

2. Multimodal Attention-Based Instruction-Following Part-Level Affordance Grounding;Applied Sciences;2024-05-29

3. Statler: State-Maintaining Language Models for Embodied Reasoning;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

4. Embodied intelligence in manufacturing: leveraging large language models for autonomous industrial robotics;Journal of Intelligent Manufacturing;2024-01-09

5. State-of-the-Art Elderly Service Robot: Environmental Perception, Compliance Control, Intention Recognition, and Research Challenges;IEEE Systems, Man, and Cybernetics Magazine;2024-01