Author:
Sasaki Yoko, ,Kaneyoshi Masahito,Kagami Satoshi,Mizoguchi Hiroshi,Enomoto Tadashi, ,
Abstract
This paper presents a sound identification method for a mobile robot in home and office environments. We propose a short-term sound recognition method using Pitch-Cluster-Maps (PCMs) sound database (DB) based on a Vector Quantization approach. A binarized frequency spectrum is used to generate PCMs codebook, which describes a variety of sound sources, not only voice, from short-term sound input. PCMs sound identification requires several tens of milliseconds of sound input, and is suitable for mobile robot applications in which conditions are continuously and dynamically changing. We implemented this in mobile robot audition system using a 32-channel microphone array. Robot noise reduction and sound source tracking using our proposal are applied to robot audition system, and we evaluate daily sound recognition performance for separated sound sources from a moving robot.
Publisher
Fuji Technology Press Ltd.
Subject
Electrical and Electronic Engineering,General Computer Science
Reference15 articles.
1. S. Furui, “50 years of progress in speech and speaker recognition,” In Proc. of SPECOM2005, Patras, Greece, pp. 1-9, 2005.
2. T. Matsui and K. Tanabe, “Comparative study of speaker identification methods : dplrm, svm and gmm,” IEICE Trans. on INFOMATION and SYSTEMS, Vol.89-D, No.3, pp. 1066-1073, Mar., 2006.
3. N. Roman and D. L. Wang, “Pitch-based monaural segregation of reverberant speech,” J. of Acoustics Society of America, Vol.120, No.1, pp. 458-469, Jul., 2006.
4. Y. Shao and D. L. Wang, “Model-based sequential organization in cochannel speech,” IEEE Trans. on Audio, Speech, and Language Processing, Vol.14, No.1, pp. 289-298, Jan., 2006.
5. M. Goto, “Analysis of musical audio signals. In D. L. Wang and G. J. Brown, editors, Computational Auditory Scene Analysis: Principles, Algorithms, and Applications,” Wiley-IEEE Press, pp. 251-295. 2006.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献