Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation-Reference-Cited by-同舟云学术

Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation

Published:2023-05-18 Issue: Volume: Page:
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Sun Yujia¹²^ORCID,Platoš Jan¹

Affiliation:

1. Department of Computer Science, Technical University of Ostrava, 17. listopadu 2172/15, Ostrava-Poruba, 70800, Czech Republic

2. Institute of Network Information Security, Hebei GEO University, No. 136 East Huai΄an Road, Shijiazhuang Hebei, 050031, China

Abstract

Word sense disambiguation is a basic task in Natural Language Processing which aims to identify the most appropriate sense of ambiguous words in different contexts by applying algorithm models. In this work, we propose a model that uses a stacked bidirectional Long Short-Term Memory neural network and attention mechanism to determine the sense of ambiguous words. First, the stacked bidirectional Long Short-Term Memory is employed for deep embedding-based representation of sentences containing ambiguous words. Then, we utilize the self-attention mechanism to highlight the contextual features of ambiguous words, and then construct the overall semantic representation of sentences. Finally, the sentence semantic representation is applied to the multilayer perception classifier to generate the appropriate category of the ambiguous word sense items. This model is tested on the Semeval-2007 task-17: English lexical samples dataset and using examples of ambiguous words sourced from Oxford, Cambridge, and Collins dictionaries as extra test datasets. The effectiveness of the proposed approach is demonstrated via comparison with existing word sense disambiguation approaches. Our experimental results show that the proposed model outperforms other word sense disambiguation methods in terms of the evaluation metrics (Average Accuracy, Micro F1-Score, Kappa, and Matthews Correlation Coefficient), and exhibits strong interpretability.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3594780

Reference48 articles.

1. WordNet

2. Semantic Wikipedia

3. BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network

4. Maru , M. , Scozzafava , F. , Martelli , F. , Navigli , R. : SyntagNet: Challenging supervised word sense disambiguation with lexical-semantic combinations . In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) , pp. 3525– 3531 . Association for Computational Linguistics, Hong Kong, China (2019). https://aclanthology.org/D19-1359 Maru, M., Scozzafava, F., Martelli, F., Navigli, R.: SyntagNet: Challenging supervised word sense disambiguation with lexical-semantic combinations. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 3525–3531. Association for Computational Linguistics, Hong Kong, China (2019). https://aclanthology.org/D19-1359

5. Scarlini , B. , Pasini , T. , Navigli , R. : SensEmBERT: Context-enhanced sense embeddings for multilingual word sense disambiguation . In: Proceedings of the AAAI Conference on Artificial Intelligence , vol. 34 , pp. 8758 - 8765 ( 2020 ). Scarlini, B., Pasini, T., Navigli, R.: SensEmBERT: Context-enhanced sense embeddings for multilingual word sense disambiguation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 8758-8765 (2020).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Abstractive text summarization model combining a hierarchical attention mechanism and multiobjective reinforcement learning;Expert Systems with Applications;2024-08