<i>CRS-Que</i> : A User-Centric Evaluation Framework for Conversational Recommender Systems-Reference-Cited by-同舟云学术

CRS-Que : A User-Centric Evaluation Framework for Conversational Recommender Systems

Published:2023-11-02 Issue: Volume: Page:
ISSN:2770-6699
Container-title:ACM Transactions on Recommender Systems
language:en
Short-container-title:ACM Trans. Recomm. Syst.

Author:

Jin Yucheng¹,Chen Li¹,Cai Wanling¹,Zhao Xianglin¹

Affiliation:

1. Hong Kong Baptist University, China

Abstract

An increasing number of recommendation systems try to enhance the overall user experience by incorporating conversational interaction. However, evaluating conversational recommender systems (CRSs) from the user’s perspective remains elusive. The GUI-based system evaluation criteria may be inadequate for their conversational counterparts. This paper presents our proposed unifying framework, CRS-Que , to evaluate the user experience of CRSs. This new evaluation framework is developed based on ResQue , a popular user-centric evaluation framework for recommender systems. Additionally, it includes user experience metrics of conversation (e.g., understanding, response quality, humanness) under two dimensions of ResQue (i.e., Perceived Qualities and User Beliefs). Following the psychometric modeling method, we validate our framework by evaluating two conversational recommender systems in different scenarios: music exploration and mobile phone purchase . The results of the two studies support the validity and reliability of the constructs in our framework and reveal how conversation constructs and recommendation constructs interact and influence the overall user experience of the CRS. We believe this framework could help researchers conduct standardized user-centric research for conversational recommender systems and provide practitioners with insights into designing and evaluating a CRS from users’ perspectives.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3631534

Reference141 articles.

1. Ahmad Abdellatif , Khaled Badran , Diego Elias Costa , and Emad Shihab . 2021. A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering . IEEE Transactions on Software Engineering(May 2021 ), 3087–3102. https://doi.org/10.1109/tse.2021.3078384 10.1109/tse.2021.3078384 Ahmad Abdellatif, Khaled Badran, Diego Elias Costa, and Emad Shihab. 2021. A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering. IEEE Transactions on Software Engineering(May 2021), 3087–3102. https://doi.org/10.1109/tse.2021.3078384

2. Investigating User Confidence for Uncertainty Presentation in Predictive Decision Making

3. Peter M Bentler . 1990. Comparative fit indexes in structural models.Psychological bulletin 107, 2 ( 1990 ), 238. https://doi.org/10.1037/0033-2909.107.2.238 10.1037/0033-2909.107.2.238 Peter M Bentler. 1990. Comparative fit indexes in structural models.Psychological bulletin 107, 2 (1990), 238. https://doi.org/10.1037/0033-2909.107.2.238

4. How to Recommend?

5. Simone Borsci , Alessio Malizia , Martin Schmettow , Frank Van Der Velde , Gunay Tariverdiyeva, Divyaa Balaji, and Alan Chamberlain. 2022 . The Chatbot usability scale: The design and pilot of a usability scale for interaction with AI-based conversational agents. Personal and ubiquitous computing 26, 1 (2022), 95–119. https://doi.org/10.1007/s00779-021-01582-9 10.1007/s00779-021-01582-9 Simone Borsci, Alessio Malizia, Martin Schmettow, Frank Van Der Velde, Gunay Tariverdiyeva, Divyaa Balaji, and Alan Chamberlain. 2022. The Chatbot usability scale: The design and pilot of a usability scale for interaction with AI-based conversational agents. Personal and ubiquitous computing 26, 1 (2022), 95–119. https://doi.org/10.1007/s00779-021-01582-9

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Usability and User Experience Evaluation in Intelligent Environments: A Review and Reappraisal;International Journal of Human–Computer Interaction;2024-09-12

2. What Did I Say Again? Relating User Needs to Search Outcomes in Conversational Commerce;Proceedings of Mensch und Computer 2024;2024-09

3. Introduction to the Special Issue on Perspectives on Recommender Systems Evaluation;ACM Transactions on Recommender Systems;2024-03-07