Author:
Vázquez Marynel,Lew Alexander,Gorevoy Eden,Connolly Joe
Abstract
We study two approaches for predicting an appropriate pose for a robot to take part in group formations typical of social human conversations subject to the physical layout of the surrounding environment. One method is model-based and explicitly encodes key geometric aspects of conversational formations. The other method is data-driven. It implicitly models key properties of spatial arrangements using graph neural networks and an adversarial training regimen. We evaluate the proposed approaches through quantitative metrics designed for this problem domain and via a human experiment. Our results suggest that the proposed methods are effective at reasoning about the environment layout and conversational group formations. They can also be used repeatedly to simulate conversational spatial arrangements despite being designed to output a single pose at a time. However, the methods showed different strengths. For example, the geometric approach was more successful at avoiding poses generated in nonfree areas of the environment, but the data-driven method was better at capturing the variability of conversational spatial formations. We discuss ways to address open challenges for the pose generation problem and other interesting avenues for future work.
Subject
Artificial Intelligence,Computer Science Applications
Reference58 articles.
1. Wasserstein Generative Adversarial Networks;Arjovsky,2017
2. Let Me Join You! Real-Time F-Formation Recognition by a Socially Aware Robot;Barua,2020
3. Relational Inductive Biases, Deep Learning, and Graph Networks;Battaglia,2018
4. Orthogonal Distance Regression;Boggs;Contemp. Mathematics,1990
5. A Study in Scene Shaping: Adjusting F-Formations in the Wild;Bohus,2017
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献