Dialogue-Rewriting Model Based on Transformer Pointer Extraction-Reference-Cited by-同舟云学术

Dialogue-Rewriting Model Based on Transformer Pointer Extraction

Published:2024-06-17 Issue:12 Volume:13 Page:2362
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Pu Chenyang¹,Sun Zhangjie¹,Li Chuan¹,Song Jianfeng¹^ORCID

Affiliation:

1. School of Computer Science and Technology, Xidian University, Xi’an 710071, China

Abstract

In the multi-turn dialogue scenario, users commonly encounter challenges with pronoun referents and information omission, leading to semantically incomplete representations. These issues contribute to textual incoherence, as unclear referents and missing components hinder the semantic understanding of the spoken representations of text by machines. Currently, scholars frequently resort to multiple rounds of dialogue rewriting to address the semantic challenges posed by the machine comprehension of semantically missing texts with pronoun referents and information omissions. However, existing dialogue-rewriting methods often suffer from low precision and high latency in handling such texts. To mitigate these shortcomings, this paper proposes a Transformer-based dialogue-rewriting model that utilizes pointer extraction. The method leverages a Transformer pre-training model to effectively extract the potential semantic features of text and extract the key information of text by a pointer address. By extracting keywords and appropriately replacing or inserting text, the model restores referents and missing information. The experimental findings on an open-source Chinese multi-turn dialogue-rewriting dataset demonstrate the effectiveness of the proposed method in improving both the accuracy and efficiency of rewriting compared with existing methods. Specifically, the ROUGR-1 value increased by 2.9%, while the time consumption decreased by 50% compared with the benchmark method.

Funder

Continuing Education Teaching Reform Research Program of Xidian University

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/12/2362/pdf

Reference19 articles.

1. Hao, J., Song, L., Wang, L., Xu, K., Tu, Z., and Yu, D. (2022). Robust Dialogue Utterance Rewriting as Sequence Tagging. (17/192,260), U.S. Patent.

2. DuReSE: Rewriting Incomplete Utterances via Neural Sequence Editing;Jiang;Neural Process. Lett.,2023

3. Su, H., Shen, X., Zhang, R., Sun, F., Hu, P., Niu, C., and Zhou, J. (2019). Improving Multi-turn Dialogue Modelling with Utterance ReWriter. arXiv.

4. Niehues, J., Cho, E., Ha, T.L., and Waibel, A. (2016). Pre-Translation for Neural Machine Translation. arXiv.

5. Junczys-Dowmunt, M., and Grundkiewicz, R. (2017). An Exploration of Neural Sequence-to-Sequence Architec-tures for Automatic Post-Editing. arXiv.