1. Admoni, H., Srinivasa, S.: Predicting user intent through eye gaze for shared autonomy. In: Proceedings of the AAAI Fall Symposium Series: Shared Autonomy in Research and Practice (AAAI Fall Symposium), pp. 298–303 (2016)
2. Bavelas, J., Gerwing, J., Healing, S.: Hand and facial gestures in conversational interaction. In: Holtgraves, T.M. (ed.) The Oxford Handbook of Language and Social Psychology, pp. 111–130. Oxford University Press, Oxford (2014)
3. Bolt, R.A.: “Put-that-there”: voice and gesture at the graphics interface, vol. 14. ACM (1980)
4. Chai, J.Y., et al.: Collaborative effort towards common ground in situated human-robot dialogue. In: Proceedings of the 2014 ACM/IEEE International Conference on Human-Robot Interaction, pp. 33–40. ACM (2014)
5. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint