Assessing user simulation for dialog systems using human judges and automatic evaluation measures-Reference-Cited by-同舟云学术

Assessing user simulation for dialog systems using human judges and automatic evaluation measures

Published:2011-02-01 Issue:4 Volume:17 Page:511-540
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

AI HUA,LITMAN DIANE

Abstract

AbstractWhile different user simulations are built to assist dialog system development, there is an increasing need to quickly assess the quality of the user simulations reliably. Previous studies have proposed several automatic evaluation measures for this purpose. However, the validity of these evaluation measures has not been fully proven. We present an assessment study in which human judgments are collected on user simulation qualities as the gold standard to validate automatic evaluation measures. We show that a ranking model can be built using the automatic measures to predict the rankings of the simulations in the same order as the human judgments. We further show that the ranking model can be improved by using a simple feature that utilizes time-series analysis.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference47 articles.

1. Data-driven user simulation for automated evaluation of spoken dialog systems

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Linking Dialogue with Student Modelling to Create an Adaptive Tutoring System for Conceptual Physics;International Journal of Artificial Intelligence in Education;2021-01-04

2. Promoting Effects of Computer Scoring on English Learning of College Students;International Journal of Emerging Technologies in Learning (iJET);2020-04-08