Author:
Greco Claudio,Bagade Diksha,Le Dieu-Thu,Bernardi Raffaella
Abstract
Communication is a dynamic process through which interlocutors adapt to each other. In the development of conversational agents, this core aspect has been put aside for several years since the main challenge was to obtain conversational neural models able to produce utterances and dialogues that at least at the surface level are human-like. Now that this milestone has been achieved, the importance of paying attention to the dynamic and adaptive interactive aspects of language has been advocated in several position papers. In this paper, we focus on how a Speaker adapts to an interlocutor with different background knowledge. Our models undergo a pre-training phase, through which they acquire grounded knowledge by learning to describe an image, and an adaptive phase through which a Speaker and a Listener play a repeated reference game. Using a similar setting, previous studies focus on how conversational models create new conventions; we are interested, instead, in studying whether the Speaker learns from the Listener's mistakes to adapt to his background knowledge. We evaluate models based on Rational Speech Act (RSA), a likelihood loss, and a combination of the two. We show that RSA could indeed work as a backbone to drive the Speaker toward the Listener: in the combined model, apart from the improved Listener's accuracy, the language generated by the Speaker features the changes that signal adaptation to the Listener's background knowledge. Specifically, captions to unknown object categories contain more adjectives and less direct reference to the unknown objects.
Reference46 articles.
1. “Reasoning about pragmatics with neural listeners and speakers,”;Andreas;Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing,2016
2. “Learning to mediate disparities towards pragmatic communication,”;Bao;Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers),2022
3. “MindCraft: theory of mind modeling for situated dialogue in collaborative tasks,”;Bara;Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing,2021
4. “Grounding as a collaborative process,”;Benotti;Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume,2021