A survey on metrics for the evaluation of user simulations-Reference-Cited by-同舟云学术

A survey on metrics for the evaluation of user simulations

Published:2012-11-28 Issue:1 Volume:28 Page:59-73
ISSN:0269-8889
Container-title:The Knowledge Engineering Review
language:en
Short-container-title:The Knowledge Engineering Review

Author:

Pietquin Olivier,Hastie Helen

Abstract

AbstractUser simulation is an important research area in the field of spoken dialogue systems (SDSs) because collecting and annotating real human–machine interactions is often expensive and time-consuming. However, such data are generally required for designing, training and assessing dialogue systems. User simulations are especially needed when using machine learning methods for optimizing dialogue management strategies such as Reinforcement Learning, where the amount of data necessary for training is larger than existing corpora. The quality of the user simulation is therefore of crucial importance because it dramatically influences the results in terms of SDS performance analysis and the learnt strategy. Assessment of the quality of simulated dialogues and user simulation methods is an open issue and, although assessment metrics are required, there is no commonly adopted metric. In this paper, we give a survey of User Simulations Metrics in the literature, propose some extensions and discuss these metrics in terms of a list of desired features.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Software

Reference48 articles.

1. Data-driven user simulation for automated evaluation of spoken dialog systems

2. Recent research advances in Reinforcement Learning in Spoken Dialogue Systems

3. Partially observable Markov decision processes for spoken dialog systems

4. A stochastic model of human-machine interaction for learning dialog strategies

Cited by 43 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Towards a Formal Characterization of User Simulation Objectives in Conversational Information Access;Proceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval;2024-08-02

2. Exploring the Utility of Emotion Recognition Systems in Healthcare;Advances in Psychology, Mental Health, and Behavioral Studies;2024-04-12

3. Metaphorical User Simulators for Evaluating Task-oriented Dialogue Systems;ACM Transactions on Information Systems;2023-08-18

4. Development of a Trust-Aware User Simulator for Statistical Proactive Dialog Modeling in Human-AI Teams;Adjunct Proceedings of the 31st ACM Conference on User Modeling, Adaptation and Personalization;2023-06-16

5. A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning;Machine Intelligence Research;2023-01-07