Polite Dialogue Generation Without Parallel Data-Reference-Cited by-同舟云学术

Polite Dialogue Generation Without Parallel Data

Published:2018-12 Issue: Volume:6 Page:373-389
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:TACL

Author:

Niu Tong¹,Bansal Mohit¹

Affiliation:

1. UNC Chapel Hill,

Abstract

Stylistic dialogue response generation, with valuable applications in personality-based conversational agents, is a challenging task because the response needs to be fluent, contextually-relevant, as well as paralinguistically accurate. Moreover, parallel datasets for regular-to-stylistic pairs are usually unavailable. We present three weakly-supervised models that can generate diverse, polite (or rude) dialogue responses without parallel data. Our late fusion model (Fusion) merges the decoder of an encoder-attention-decoder dialogue model with a language model trained on stand-alone polite utterances. Our label-finetuning (LFT) model prepends to each source sequence a politeness-score scaled label (predicted by our state-of-the-art politeness classifier) during training, and at test time is able to generate polite, neutral, and rude responses by simply scaling the label embedding by the corresponding score. Our reinforcement learning model (Polite-RL) encourages politeness generation by assigning rewards proportional to the politeness classifier score of the sampled response. We also present two retrievalbased, polite dialogue model baselines. Human evaluation validates that while the Fusion and the retrieval-based models achieve politeness with poorer context-relevance, the LFT and Polite-RL models can produce significantly more polite responses without sacrificing dialogue quality.

Publisher

MIT Press - Journals

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/tacl_a_00027

Reference11 articles.

1. Inter-Coder Agreement for Computational Linguistics

2. Amazon's Mechanical Turk

3. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit.

4. Long Short-Term Memory

Cited by 39 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Two in One: A multi-task framework for politeness turn identification and phrase extraction in goal-oriented conversations;Computer Speech & Language;2024-11

2. JH-Ranker: Enhancing Keigo Recognition in Japanese Sentences through Multi-Task Learning;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

3. Computational Politeness in Natural Language Processing: A Survey;ACM Computing Surveys;2024-05-08

4. Emotion-and-knowledge grounded response generation in an open-domain dialogue setting;Knowledge-Based Systems;2024-01

5. Please Donate to Save a Life: Inducing Politeness to Handle Resistance in Persuasive Dialogue Agents;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024