A Sentence-Matching Model Based on Multi-Granularity Contextual Key Semantic Interaction-Reference-Cited by-同舟云学术

A Sentence-Matching Model Based on Multi-Granularity Contextual Key Semantic Interaction

Published:2024-06-14 Issue:12 Volume:14 Page:5197
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Li Jinhang¹^ORCID,Li Yingna¹²^ORCID

Affiliation:

1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China

2. Computer Technology Application Key Lab of the Yunnan Province, Kunming 650500, China

Abstract

In the task of matching Chinese sentences, the key semantics within sentences and the deep interaction between them significantly affect the matching performance. However, previous studies mainly relied on shallow interactions based on a single semantic granularity, which left them vulnerable to interference from overlapping terms. It is particularly challenging to distinguish between positive and negative examples within datasets from the same thematic domain. This paper proposes a sentence-matching model that incorporates multi-granularity contextual key semantic interaction. The model combines multi-scale convolution and multi-level convolution to extract different levels of contextual semantic information at word, phrase, and sentence granularities. It employs multi-head self-attention and cross-attention mechanisms to align the key semantics between sentences. Furthermore, the model integrates the original, similarity, and dissimilarity information of sentences to establish deep semantic interaction. Experimental results on both open- and closed-domain datasets demonstrate that the proposed model outperforms existing baseline models in terms of matching performance. Additionally, the model achieves matching effectiveness comparable to large-scale pre-trained language models while utilizing a lightweight encoder.

Funder

Key projects of science and technology plan of Yunnan Provincial Department of Science and Technology

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/12/5197/pdf

Reference36 articles.

1. Garg, S., and Ramakrishnan, G. (2020, January 16–20). BAE: BERT-based Adversarial Examples for Text Classification. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online.

2. Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2019, January 2–7). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, MN, USA.

3. Reimers, N., and Gurevych, I. (2019, January 3–7). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China.

4. Wang, S., and Jiang, J. (2017, January 24–26). A compare-aggregate model for matching text sequences. Proceedings of the ICLR 2017: International Conference on Learning Representations, Toulon, France.

5. Short text matching model with multiway semantic interaction based on multi-granularity semantic embedding;Tang;Appl. Intell.,2022