LB-BMBC: MHBiaffine-CNN to Capture Span Scores with BERT Injected with Lexical Information for Chinese NER
-
Published:2024-06-10
Issue:1
Volume:17
Page:
-
ISSN:1875-6883
-
Container-title:International Journal of Computational Intelligence Systems
-
language:en
-
Short-container-title:Int J Comput Intell Syst
Author:
Guo Tao, Zhang ZhichaoORCID
Abstract
AbstractA substantial body of research has shown that introducing lexical information in Chinese Named Entity Recognition (NER) tasks can enhance the semantic and boundary information of Chinese words. However, in most methods, the introduction of lexical information occurs at the model architecture level, which cannot fully leverage the lexicon learning capability of pre-trained models. Therefore, we propose seamless integration of external Lexicon knowledge into the Transformer layer of BERT. Additionally, we have observed that in span-based recognition, adjacent spans have special spatial relationships. To capture this relationship, we extend the work after Biaffine and use Convolutional Neural Networks (CNN) to treat the score matrix as an image, allowing us to interact with the spatial relationships of spans. Our proposed LB-BMBC model was experimented on four publicly available Chinese NER datasets: Resume, Weibo, OntoNotes v4, and MSRA. In particular, during ablation experiments, we found that CNN can significantly improve performance.
Funder
National Natural Science Foundation of China Fundamental Research Funds for the Central Universities
Publisher
Springer Science and Business Media LLC
Reference57 articles.
1. Ji, B., Yu, J., Li, S., Ma, J., Wu, Q., Tan, Y., Liu, H.: Span-based joint entity and relation extraction with attention-based span-specific and contextual semantic representations. In: Proceedings of the 28th International Conference on Computational Linguistics. International Committee on Computational Linguistics, Barcelona, Spain (Online) (2020) 2. Yu, Y., Wang, Y., Mu, J., Li, W., Jiao, S., Wang, Z., Lv, P., Zhu, Y.: Chinese mineral named entity recognition based on bert model. Expert Syst. Appl. 206, 117727 (2022) 3. Liu, Y., Wei, S., Huang, H., Lai, Q., Li, M., Guan, L.: Naming entity recognition of citrus pests and diseases based on the bert-bilstm-crf model. Expert Syst. Appl. 234, 121103 (2023) 4. Xi, Q., Ren, Y., Yao, S., Wu, G., Miao, G., Zhang, Z..: In: Jia, Y., Gu, Z., Li, A. (eds.) Chinese Named Entity Recognition: Applications and Challenges, pp. 51–81. Springer, Cham (2021) 5. Zhang, Y., Yang, J.: Chinese NER using lattice LSTM. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1554–1564. Association for Computational Linguistics, Melbourne, Australia (2018)
|
|