Author:
Wu Yu,Wei Furu,Huang Shaohan,Wang Yunli,Li Zhoujun,Zhou Ming
Abstract
Open domain response generation has achieved remarkable progress in recent years, but sometimes yields short and uninformative responses. We propose a new paradigm, prototypethen-edit for response generation, that first retrieves a prototype response from a pre-defined index and then edits the prototype response according to the differences between the prototype context and current context. Our motivation is that the retrieved prototype provides a good start-point for generation because it is grammatical and informative, and the post-editing process further improves the relevance and coherence of the prototype. In practice, we design a contextaware editing model that is built upon an encoder-decoder framework augmented with an editing vector. We first generate an edit vector by considering lexical differences between a prototype context and current context. After that, the edit vector and the prototype response representation are fed to a decoder to generate a new response. Experiment results on a large scale dataset demonstrate that our new paradigm significantly increases the relevance, diversity and originality of generation results, compared to traditional generative models. Furthermore, our model outperforms retrieval-based methods in terms of relevance and originality.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
26 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Dynamic Demonstration Retrieval and Cognitive Understanding for Emotional Support Conversation;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10
2. Leveraging Intent Entity Enhancement for Task-Oriented Dialogue;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30
3. Enhancing User Experience in Chinese Initial Text Conversations with Personalised AI-Powered Assistant;Extended Abstracts of the CHI Conference on Human Factors in Computing Systems;2024-05-11
4. Math Word Problem Generation via Disentangled Memory Retrieval;ACM Transactions on Knowledge Discovery from Data;2024-03-26
5. Unsupervised Disentanglement Learning Model for Exemplar-Guided Paraphrase Generation;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024