Research on Applying the “Shift” Concept to Deep Attention Matching-Reference-Cited by-同舟云学术

Research on Applying the “Shift” Concept to Deep Attention Matching

Published:2023-03-20 Issue:6 Volume:13 Page:3934
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Hu Kai¹,Xiao Nanfeng¹

Affiliation:

1. School of Computer Science and Engineering, South China University of Technology, Guangzhou 510006, China

Abstract

The main purpose of chat robots is to realize intelligent interaction between human beings and chat robots. Generally, a complete conversation involves chat contexts. Before responding, human beings need to extract information from their chat contexts, and human beings are very good at extracting information. However, how to make the chatbots more complete in extracting appropriate contextual information has become a key issue for the chatbots in multi-round dialogues. Most research in recent years has paid attention to the point-to-point matching of responses and utterances at reciprocal granularity, such as SMN (Sequential Matching Network) and DAM (Deep Attention Matching Network), for which these parsing methods affect the effectiveness of the above networks to some extent. For example, DAM introduces an attention mechanism, which can obtain good results by parsing five levels of granularity of response and utterance through a self-attention module, and then matching the same level of granularity to mine similar spatial information in it. Based on the structure of DAM, this paper proposes an information-mining idea for improving DAM, which applies to those models with a multi-layer matching structure. Therefore, based on this idea, this paper presents two improved methods for DAM, so as to improve the accuracy of the information extraction from the contexts of the multiple round chats. The experiments show that the improved DAM has better results than that of the unimproved DAM, which are, respectively, R_2@1 increased by 0.425%, R_10@1 increased by 0.515%, R_10@2 increased by 0.341%, MAP (Mean Average Precision) increased by 0.358% in Douban data set, P@1 increased by 1.16%, R_10@1 increased by 1.17%, and these results are better than those state-of-the-art methods, and the improved methods presented in this paper can be used not only in DAM but also in other models with similar the point-to-point matching structures.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/6/3934/pdf

Reference49 articles.

1. Turing, A.M. (2009). Parsing the Turing Test, Springer.

2. Ji, Z., Lu, Z., and Li, H. (2014). An information retrieval approach to short text conversation. arXiv.

3. Wang, H., Lu, Z., Li, H., and Chen, E. (2013, January 18–21). A dataset for research on short-text conversations. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, DC, USA.

4. The hidden information state model: A practical framework for pomdp-based spoken dialogue management;Young;Comput. Speech Lang.,2010

5. Zhou, X., Dong, D., Wu, H., Zhao, S., Yu, D., Tian, H., Liu, X., and Yan, R. (2016, January 1–5). Multi-view response selection for human-computer conversation. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, USA.