Real-time Auditory and Visual Multiple-speaker Tracking For Human-robot Interaction-Reference-Cited by-同舟云学术

Real-time Auditory and Visual Multiple-speaker Tracking For Human-robot Interaction

Published:2002-10-20 Issue:5 Volume:14 Page:479-489
ISSN:1883-8049
Container-title:Journal of Robotics and Mechatronics
language:en
Short-container-title:J. Robot. Mechatron.

Author:

Nakadai Kazuhiro, ,Hidai Ken-ichi,Okuno Hiroshi G.,Mizoguchi Hiroshi,Kitano Hiroaki, , , ,

Abstract

This paper addresses real-time multiple speaker tracking because it is essential in robot perception and human-robot social interaction. The difficulty lies in treating a mixture of sounds, occlusion (some speakers are hidden) and real-time processing. Our approach consists of three components: (1) the extraction of the direction of each speaker by using interaural phase difference and interaural intensity difference, (2) the resolution of each speakers direction by multimodal integration of audition, vision and motion with canceling inevitable motor noises in motion in case of an unseen or silent speaker, and (3) the distributed implementation to three PCs connected by TCP/IP network to attain real-time processing. As a result, we attain robust real-time speaker tracking with 200 ms delay in a non-anechoic room, even when multiple speakers exist and the tracking person is visually occluded. In addition, the feasibility of social interaction is shown through application of our technique to a receptionist robot and a companion robot at a party.

Publisher

Fuji Technology Press Ltd.

Subject

Electrical and Electronic Engineering,General Computer Science

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Effects of a Novel Sympathy-Expression Method on Collaborative Learning Among Junior High School Students and Robots;Journal of Robotics and Mechatronics;2018-04-20

2. Audio-Visual Beat Tracking Based on a State-Space Model for a Robot Dancer Performing with a Human Dancer;Journal of Robotics and Mechatronics;2017-02-20

3. Simultaneous Identification and Localization of Still and Mobile Speakers Based on Binaural Robot Audition;Journal of Robotics and Mechatronics;2017-02-20

4. Sound Source Localization Using Deep Learning Models;Journal of Robotics and Mechatronics;2017-02-20

5. Noise-Robust MUSIC-Based Sound Source Localization Using Steering Vector Transformation for Small Humanoids;Journal of Robotics and Mechatronics;2017-02-20