Reinforcing personalized persuasion in task-oriented virtual sales assistant-Reference-Cited by-同舟云学术

Reinforcing personalized persuasion in task-oriented virtual sales assistant

Published:2023-01-05 Issue:1 Volume:18 Page:e0275750
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Raut Aritra,Tiwari Abhisek^ORCID,Das Subrata,Saha Sriparna,Maitra Anutosh,Ramnani Roshni,Sengupta Shubhashis

Abstract

Purpose Existing task-oriented virtual agents can assist users with simple tasks like ticket booking, hotel reservations, etc. effectively and with high confidence. These virtual assistants, however, assume specific, predictable end-user behavior, such as predefined/servable objectives, which results in conversation failures in challenging situations, such as when goals are unavailable. Methodology Inspired by the practice and its efficacy, we propose an end-to-end framework for task-oriented persuasive dialogue generation that combines pre-training and reinforcement learning for generating context-aware persuasive responses. We utilize four novel rewards to improve consistency and repetitiveness in generated responses. Additionally, a meta-learning strategy has also been utilized to make the model parameters better for domain adaptation. Furthermore, we also curate a personalized persuasive dialogue (PPD) corpus, which contains utterance-level intent, slot, sentiment, and persuasion strategy annotation. Findings The obtained results and detailed analysis firmly establish the effectiveness of the proposed persuasive virtual assistant over traditional task-oriented virtual assistants. The proposed framework considerably increases the quality of dialogue generation in terms of consistency and repetitiveness. Additionally, our experiment with a few shot and zero-shot settings proves that our meta-learned model learns to quickly adopt new domains with a few or even zero no. of training epochs. It outperforms the non-meta-learning-based approaches keeping the base model constant. Originality To the best of our knowledge, this is the first effort to improve a task-oriented virtual agent’s persuasiveness and domain adaptation.

Funder

Accenture

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference67 articles.

1. Lipton Z, Li X, Gao J, Li L, Ahmed F, Deng L. BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems. Proceedings of the AAAI Conference on Artificial Intelligence. 2018;32(1).

2. Li X, Chen YN, Li L, Gao J. End-to-End Task-Completion Neural Dialogue Systems. 2017;.

3. Liu B, Lane I. End-to-End Learning of Task-Oriented Dialogs. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop. New Orleans, Louisiana, USA: Association for Computational Linguistics; 2018. p. 67–73. Available from: https://aclanthology.org/N18-4010.

4. Chen W, Chen J, Qin P, Yan X, Wang WY. Semantically conditioned dialog response generation via hierarchical disentangled self-attention. arXiv preprint arXiv:190512866. 2019;.

5. Wang K, Tian J, Wang R, Quan X, Yu J. Multi-domain dialogue acts and response co-generation. arXiv preprint arXiv:200412363. 2020;.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dynamic Negotiation Landscapes: Mbps and the Interplay of Buyer Personalities;2024