Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores-Reference-Cited by-同舟云学术

Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores

Published:2023-11-26 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region
language:
Short-container-title:

Author:

Takehi Rikiya¹^ORCID,Watanabe Akihisa²^ORCID,Sakai Tetsuya³^ORCID

Affiliation:

1. The Department of Computer Science and Engineering, Waseda University, Japan

2. Waseda University, Japan

3. Department of Computer Science and Engineering, Waseda University, Japan

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3624918.3625338

Reference23 articles.

1. Yi Chen Rui Wang Haiyun Jiang Shuming Shi and Ruifeng Xu. 2023. Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study. arxiv:2304.00723 [cs.CL] Yi Chen Rui Wang Haiyun Jiang Shuming Shi and Ruifeng Xu. 2023. Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study. arxiv:2304.00723 [cs.CL]

2. Survey on evaluation methods for dialogue systems

3. Sarik Ghazarian , Ralph Weischedel , Aram Galstyan , and Nanyun Peng . 2020 . Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems. arxiv:1911.01456 [cs.CL] Sarik Ghazarian, Ralph Weischedel, Aram Galstyan, and Nanyun Peng. 2020. Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems. arxiv:1911.01456 [cs.CL]

4. Weakly Supervised Turn-level Engagingness Evaluator for Dialogues

5. Longxuan Ma , Ziyu Zhuang , Weinan Zhang , Mingda Li , and Ting Liu . 2022 . SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation . In Proceedings of the 29th International Conference on Computational Linguistics. International Committee on Computational Linguistics, Gyeongju, Republic of Korea, 485–495 . https://aclanthology.org/2022.coling-1.39 Longxuan Ma, Ziyu Zhuang, Weinan Zhang, Mingda Li, and Ting Liu. 2022. SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation. In Proceedings of the 29th International Conference on Computational Linguistics. International Committee on Computational Linguistics, Gyeongju, Republic of Korea, 485–495. https://aclanthology.org/2022.coling-1.39