Parameter-Efficient Fine-Tuning Method for Task-Oriented Dialogue Systems-Reference-Cited by-同舟云学术

Parameter-Efficient Fine-Tuning Method for Task-Oriented Dialogue Systems

Published:2023-07-10 Issue:14 Volume:11 Page:3048
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Mo Yunho¹^ORCID,Yoo Joon¹^ORCID,Kang Sangwoo¹^ORCID

Affiliation:

1. School of Computing, Gachon University, 1342, Seongnam-daero, Sujeong-gu, Seongnam-si 13120, Republic of Korea

Abstract

The use of Transformer-based pre-trained language models has become prevalent in enhancing the performance of task-oriented dialogue systems. These models, which are pre-trained on large text data to grasp the language syntax and semantics, fine-tune the entire parameter set according to a specific task. However, as the scale of the pre-trained language model increases, several challenges arise during the fine-tuning process. For example, the training time escalates as the model scale grows, since the complete parameter set needs to be trained. Furthermore, additional storage space is required to accommodate the larger model size. To address these challenges, we propose a new new task-oriented dialogue system called PEFTTOD. Our proposal leverages a method called the Parameter-Efficient Fine-Tuning method (PEFT), which incorporates an Adapter Layer and prefix tuning into the pre-trained language model. It significantly reduces the overall parameter count used during training and efficiently transfers the dialogue knowledge. We evaluated the performance of PEFTTOD on the Multi-WOZ 2.0 dataset, a benchmark dataset commonly used in task-oriented dialogue systems. Compared to the traditional method, PEFTTOD utilizes only about 4% of the parameters for training, resulting in a 4% improvement in the combined score compared to the existing T5-based baseline. Moreover, PEFTTOD achieved an efficiency gain by reducing the training time by 20% and saving up to 95% of the required storage space.

Funder

National Research Foundation of Korea

Gachon University

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/14/3048/pdf

Reference48 articles.

1. Probabilistic methods in spoken–dialogue systems;Young;Philos. Trans. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci.,2000

2. Su, Y., Shu, L., Mansimov, E., Gupta, A., Cai, D., Lai, Y.A., and Zhang, Y. (2021). Multi-task pre-training for plug-and-play task-oriented dialogue system. arXiv.

3. Lin, Z., Madotto, A., Winata, G.I., and Fung, P. (2020). Mintl: Minimalist transfer learning for task-oriented dialogue systems. arXiv.

4. Lee, Y. (2021, January 7). Improving end-to-end task-oriented dialog system with a simple auxiliary task. Findings of the Association for Computational Linguistics. Proceedings of the EMNLP 2021, Punta Cana, Dominican Republic.

5. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A., Kaiser, Ł., and Polosukhin, I. (2007). Advances in Neural Information Processing Systems 30, Proceedings of the NIPS, Long Beach, CA, USA, 4–9 December 2007, MIT Press.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing Task-Oriented Dialogue Modeling through Coreference-Enhanced Contrastive Pre-Training;Applied Sciences;2024-08-28

2. Structure-Aware Low-Rank Adaptation for Parameter-Efficient Fine-Tuning;Mathematics;2023-10-17