Affiliation:
1. School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100083, China
2. Li Auto Inc., Beijing 101399, China
Abstract
Cross-domain dialogue state tracking (DST) focuses on using labeled data from source domains to train a DST model for target domains. It is of great significance for transferring a dialogue system into new domains. Most of the existing cross-domain DST models track each slot independently, which leads to poor performances caused by not considering the correlation among different slots, as well as low efficiency of training and inference. This paper, therefore, proposes a prompt-based end-to-end cross-domain DST method for efficiently tracking all slots simultaneously. A dynamic prompt template shuffle method is proposed to alleviate the bias of the slot order, and a dynamic prompt template sampling method is proposed to alleviate the bias of the slot number, respectively. The experimental results on the MultiWOZ 2.0 and MultiWOZ 2.1 datasets show that our approach consistently outperforms the state-of-the-art baselines in all target domains and improves both training and inference efficiency by at least 5 times.
Funder
National Natural Science Foundation of China
Reference42 articles.
1. Mgcrl: Multi-view graph convolution and multi-agent reinforcement learning for dialogue state tracking;Huang;Neural Comput. Appl.,2024
2. Dstea: Improving dialogue state tracking via entity adaptive pre-training;Lee;Knowl.-Based Syst.,2024
3. Liu, Y., Chen, L., and Yu, K. (2024, January 14–19). Label-aware auxiliary learning for dialogue state tracking. Proceedings of the ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Republic of Korea.
4. Multi-domain gate and interactive dual attention for multi-domain dialogue state tracking;Jia;Knowl.-Based Syst.,2024
5. Dialogue state distillation network with inter-slot contrastive learning for dialogue state tracking;Xu;Proc. AAAI Conf. Artif. Intell.,2023