Few-shot re-identification of the speaker by social robots-Reference-Cited by-同舟云学术

Few-shot re-identification of the speaker by social robots

Published:2022-11-07 Issue:2 Volume:47 Page:181-192
ISSN:0929-5593
Container-title:Autonomous Robots
language:en
Short-container-title:Auton Robot

Author:

Foggia Pasquale,Greco Antonio,Roberto Antonio^ORCID,Saggese Alessia,Vento Mario

Abstract

AbstractNowadays advanced machine learning, computer vision, audio analysis and natural language understanding systems can be widely used for improving the perceptive and reasoning capabilities of the social robots. In particular, artificial intelligence algorithms for speaker re-identification make the robot aware of its interlocutor and able to personalize the conversation according to the information gathered in real-time and in the past interactions with the speaker. Anyway, this kind of application requires to train neural networks having available only a few samples for each speaker. Within this context, in this paper we propose a social robot equipped with a microphone sensor and a smart deep learning algorithm for few-shot speaker re-identification, able to run in real time over an embedded platform mounted on board of the robot. The proposed system has been experimentally evaluated over the VoxCeleb1 dataset, demonstrating a remarkable re-identification accuracy by varying the number of samples per speaker, the number of known speakers and the duration of the samples, and over the SpReW dataset, showing its robustness in real noisy environments. Finally, a quantitative evaluation of the processing time over the embedded platform proves that the processing pipeline is almost immediate, resulting in a pleasant user experience.

Funder

Università degli Studi di Salerno

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10514-022-10073-6.pdf

Reference42 articles.

1. Breazeal, C. (2002). Designing sociable robots. MIT Press.

2. Broekens, J., Heerink, M., & Rosendal, H. (2009). Assistive social robots in elderly care: A review. Gerontechnology, 8(2), 94–103. https://doi.org/10.4017/gt.2009.08.02.002.00.

3. Burger, B., Ferrané, I., Lerasle, F., & Infantes, G. (2011). Two-handed gesture recognition and fusion with speech to command a robot. Autonomous Robots, 32(2), 129–147. https://doi.org/10.1007/s10514-011-9263-y.