Learning and Evaluation of Dialogue Strategies for New Applications: Empirical Methods for Optimization from Small Data Sets-Reference-Cited by-同舟云学术

Learning and Evaluation of Dialogue Strategies for New Applications: Empirical Methods for Optimization from Small Data Sets

Published:2011-03 Issue:1 Volume:37 Page:153-196
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Rieser Verena¹,Lemon Oliver²

Affiliation:

1. School of GeoSciences/University of Edinburgh

2. School of Mathematical and Computer Sciences/Heriot-Watt University

Abstract

We present a new data-driven methodology for simulation-based dialogue strategy learning, which allows us to address several problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and determining a data-driven reward function. In addition, we evaluate the result with real users, and explore how results transfer between simulated and real interactions. We use Reinforcement Learning (RL) to learn multimodal dialogue strategies by interaction with a simulated environment which is “bootstrapped” from small amounts of Wizard-of-Oz (WOZ) data. This use of WOZ data allows data-driven development of optimal strategies for domains where no working prototype is available. Using simulation-based RL allows us to find optimal policies which are not (necessarily) present in the original data. Our results show that simulation-based RL significantly outperforms the average (human wizard) strategy as learned from the data by using Supervised Learning. The bootstrapped RL-based policy gains on average 50 times more reward when tested in simulation, and almost 18 times more reward when interacting with real users. Users also subjectively rate the RL-based policy on average 10% higher. We also show that results from simulated interaction do transfer to interaction with real users, and we explicitly evaluate the stability of the data-driven reward function.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/coli_a_00038

Reference22 articles.

1. Recent research advances in Reinforcement Learning in Spoken Dialogue Systems

2. Simulating speech systems

3. Automatic annotation of context and speech acts for dialogue corpora

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Conversational AI for multi-agent communication in Natural Language;AI Communications;2022-09-30

2. Understanding Dialogue for Human Communication;Handbook of Cognitive Mathematics;2022

3. Holding out the promise of Lasswell's dream: Big data analytics in public policy research and teaching;Review of Policy Research;2021-09-09

4. Understanding Dialogue for Human Communication;Handbook of Cognitive Mathematics;2021

5. Conversational AI: Dialogue Systems, Conversational Agents, and Chatbots;Synthesis Lectures on Human Language Technologies;2020-10-30