Knowledge-aware Conversational Preference Elicitation with Bandit Feedback-Reference-Cited by-同舟云学术

Knowledge-aware Conversational Preference Elicitation with Bandit Feedback

Published:2022-04-25 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the ACM Web Conference 2022
language:
Short-container-title:

Author:

Zhao Canzhe¹,Yu Tong²,Xie Zhihui¹,Li Shuai¹

Affiliation:

1. Shanghai Jiao Tong University, China

2. Carnegie Mellon University, USA

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3485447.3512152

Reference52 articles.

1. Yasin Abbasi-Yadkori , Dávid Pál , and Csaba Szepesvári . 2011. Improved Algorithms for Linear Stochastic Bandits . In Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011 . Proceedings of a meeting held 12-14 December 2011, Granada, Spain . 2312–2320. Yasin Abbasi-Yadkori, Dávid Pál, and Csaba Szepesvári. 2011. Improved Algorithms for Linear Stochastic Bandits. In Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, Granada, Spain. 2312–2320.

2. Naoki Abe and Philip M. Long . 1999 . Associative Reinforcement Learning using Linear Probabilistic Concepts . In Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999 ), Bled, Slovenia, June 27 - 30 , 1999. Morgan Kaufmann, 3–11. Naoki Abe and Philip M. Long. 1999. Associative Reinforcement Learning using Linear Probabilistic Concepts. In Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27 - 30, 1999. Morgan Kaufmann, 3–11.

3. Charu C Aggarwal 2016. Recommender systems. Vol. 1 . Springer . Charu C Aggarwal 2016. Recommender systems. Vol. 1. Springer.

4. Noga Alon , Nicolò Cesa-Bianchi , Ofer Dekel , and Tomer Koren . 2015 . Online Learning with Feedback Graphs: Beyond Bandits . In Proceedings of The 28th Conference on Learning Theory, COLT 2015, Paris, France, July 3-6, 2015(JMLR Workshop and Conference Proceedings, Vol. 40) . JMLR.org, 23–35. Noga Alon, Nicolò Cesa-Bianchi, Ofer Dekel, and Tomer Koren. 2015. Online Learning with Feedback Graphs: Beyond Bandits. In Proceedings of The 28th Conference on Learning Theory, COLT 2015, Paris, France, July 3-6, 2015(JMLR Workshop and Conference Proceedings, Vol. 40). JMLR.org, 23–35.

5. Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The effect of preference elicitation methods on the user experience in conversational recommender systems;Computer Speech & Language;2025-01

2. Conversational Dueling Bandits in Generalized Linear Models;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Doing Personal LAPS: LLM-Augmented Dialogue Construction for Personalized Multi-Session Conversational Search;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

4. Robust and efficient algorithms for conversational contextual bandit;Information Sciences;2024-02

5. Toward joint utilization of absolute and relative bandit feedback for conversational recommendation;User Modeling and User-Adapted Interaction;2024-01-27