Data-driven Communicative Behaviour Generation: A Survey-Reference-Cited by-同舟云学术

Data-driven Communicative Behaviour Generation: A Survey

Published:2024-01-30 Issue:1 Volume:13 Page:1-39
ISSN:2573-9522
Container-title:ACM Transactions on Human-Robot Interaction
language:en
Short-container-title:J. Hum.-Robot Interact.

Author:

Oralbayeva Nurziya¹,Aly Amir²,Sandygulova Anara¹,Belpaeme Tony³

Affiliation:

1. Department of Robotics and Mechatronics, School of Engineering and Digital Sciences, Nazarbayev University, Kazakhstan

2. School of Engineering, Computing and Mathematics, University of Plymouth, United Kingdom

3. Ghent University, IDLab-imec, Belgium

Abstract

The development of data-driven behaviour generating systems has recently become the focus of considerable attention in the fields of human–agent interaction and human–robot interaction. Although rule-based approaches were dominant for years, these proved inflexible and expensive to develop. The difficulty of developing production rules, as well as the need for manual configuration to generate artificial behaviours, places a limit on how complex and diverse rule-based behaviours can be. In contrast, actual human–human interaction data collected using tracking and recording devices makes humanlike multimodal co-speech behaviour generation possible using machine learning and specifically, in recent years, deep learning. This survey provides an overview of the state of the art of deep learning-based co-speech behaviour generation models and offers an outlook for future research in this area.

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3609235

Reference237 articles.

1. Kyubyong Park. 2018. KSS Dataset: Korean Single Speaker Speech Dataset. https://www.kaggle.com/bryanpark/korean-single-speaker-speech-dataset

2. Social eye gaze in human-robot interaction: A review;Admoni Henny;J. Hum.-Robot Interact.,2017

3. Chaitanya Ahuja, Dong Won Lee, Yukiko I. Nakano, and Louis-Philippe Morency. 2020. Style transfer for co-speech gesture animation: A multi-speaker conditional-mixture approach. In Proceedings of the European Conference on Computer Vision (ECCV’20), Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer International Publishing, Cham, 248–265.

4. Niki Aifanti, Christos Papachristou, and Anastasios Delopoulos. 2010. The MUG facial expression database. In Proceedings of the 11th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS’10). IEEE, Desenzano del Garda, Italy, 1–4.

5. Style‐Controllable Speech‐Driven Gesture Synthesis Using Normalising Flows

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey on Dialogue Management in Human-robot Interaction;ACM Transactions on Human-Robot Interaction;2024-06-14