On Leveraging Large Language Models for Multilingual Intent Discovery-Reference-Cited by-同舟云学术

On Leveraging Large Language Models for Multilingual Intent Discovery

Published:2024-08-13 Issue: Volume: Page:
ISSN:2158-656X
Container-title:ACM Transactions on Management Information Systems
language:en
Short-container-title:ACM Trans. Manage. Inf. Syst.

Author:

Chow Rudolf¹^ORCID,Suen King Yiu²^ORCID,Lam Albert Y.S.²^ORCID

Affiliation:

1. Fano Labs, Hong Kong, Hong Kong

2. Fano Labs, Hong Kong Hong Kong

Abstract

Intent discovery is vital for any real-world dialogue systems such as chatbot. Since the intents of users naturally change over time, models only trained on a static training set of intents will inevitably fail to detect new intents. While this topic has been widely studied, existing work only focuses on monolingual datasets, rendering it less practical for international businesses where it is far more common to work with multilingual data. In this work, we present a method for multilingual intent discovery through leveraging the multilingual capabilities of recent large language models. By performing joint extraction of intent and keyphrases, as well as a chain-of-thought styled reasoning, our method is able to efficiently produce clustering results that are easy to interpret. Experimental results on two different datasets show that our proposed method consistently surpasses all baselines, with up to 15% gain in adjusted rand index.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3688400

Reference37 articles.

1. K. Ahuja, H. Diddee, R. Hada, M. Ochieng, K. Ramesh, P. Jain, A. Nambi, T. Ganu, S. Segal, M. Axmed, K. Bali, and S. Sitaram. 2023. MEGA: Multilingual Evaluation of Generative AI. In Proceedings of the 2023 Conference on EMNLP. Association for Computational Linguistics, Singapore.

2. T. Brown B. Mann N. Ryder M. Subbiah J. Kaplan P. Dhariwal A. Neelakantan P. Shyam G. Sastry A. Askell S. Agarwal A. Herbert-Voss G. Krueger T. Henighan R. Child A. Ramesh D. Ziegler J. Wu C. Winter C. Hesse M. Chen E. Sigler M. Litwin S. Gray B. Chess J. Clark C. Berner S. McCandlish A. Radford I. Sutskever and D. Amodei. 2020. Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems 32. 1877–1901.

3. M. Caron, P. Bojanowski, A. Joulin, and M. Douze. 2018. Deep clustering for unsupervised learning of visual features. In European Conference on Computer Vision. 132–149.

4. A. Chatterjee and S. Sengupta. 2020. Intent Mining from past conversations for Conversational Agent. In Proceedings of the 28th International Conference on Computational Linguistics. International Committee on Computational Linguistics, Barcelona, Spain, 4140–4152.

5. J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019. Association for Computational Linguistics, Minneapolis, Minnesota, 4171–4186.