Intention Understanding in Human–Robot Interaction Based on Visual-NLP Semantics-Reference-Cited by-同舟云学术

Intention Understanding in Human–Robot Interaction Based on Visual-NLP Semantics

Published:2021-02-02 Issue: Volume:14 Page:
ISSN:1662-5218
Container-title:Frontiers in Neurorobotics
language:
Short-container-title:Front. Neurorobot.

Author:

Li Zhihao,Mu Yishan,Sun Zhenglong,Song Sifan,Su Jionglong,Zhang Jiaming

Abstract

With the rapid development of robotic and AI technology in recent years, human–robot interaction has made great advancement, making practical social impact. Verbal commands are one of the most direct and frequently used means for human–robot interaction. Currently, such technology can enable robots to execute pre-defined tasks based on simple and direct and explicit language instructions, e.g., certain keywords must be used and detected. However, that is not the natural way for human to communicate. In this paper, we propose a novel task-based framework to enable the robot to comprehend human intentions using visual semantics information, such that the robot is able to satisfy human intentions based on natural language instructions (total three types, namely clear, vague, and feeling, are defined and tested). The proposed framework includes a language semantics module to extract the keywords despite the explicitly of the command instruction, a visual object recognition module to identify the objects in front of the robot, and a similarity computation algorithm to infer the intention based on the given task. The task is then translated into the commands for the robot accordingly. Experiments are performed and validated on a humanoid robot with a defined task: to pick the desired item out of multiple objects on the table, and hand over to one desired user out of multiple human participants. The results show that our algorithm can interact with different types of instructions, even with unseen sentence structures.

Funder

Shenzhen Municipal Science and Technology Innovation Council

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Biomedical Engineering

Reference27 articles.

1. Augmented robotics dialog system for enhancing human-robot interaction;Alonso-Martín;Sensors,2015

2. What to do and how to do it: translating natural language directives into temporal and dynamic logic representation for goal management and action execution,;Dzifcak,2009

3. Exploiting deep semantics and compositionality of natural language for human-robot-interaction,;Eppe,2016

4. 3d human gesture capturing and recognition by the immu-based data glove;Fang;Neurocomputing,2018

5. Skill learning for human-robot interaction using wearable device;Fang;Tsinghua Sci. Technol,2019

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HOTSPOT: An ad hoc teamwork platform for mixed human-robot teams;PLOS ONE;2024-06-28

2. Multimodal Attention-Based Instruction-Following Part-Level Affordance Grounding;Applied Sciences;2024-05-29

3. PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning;Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction;2024-03-11

4. Synergizing Natural Language Towards Enhanced Shared Autonomy;Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction;2024-03-11

5. Voice Command Recognition for Explicit Intent Elicitation in Collaborative Object Transportation Tasks: a ROS-based Implementation;Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction;2024-03-11