Research on Chinese Nested Entity Recognition Based on IDCNNLR and GlobalPointer
-
Published:2024-01-08
Issue:1
Volume:7
Page:8
-
ISSN:2571-5577
-
Container-title:Applied System Innovation
-
language:en
-
Short-container-title:ASI
Author:
Li Weijun12ORCID, Liu Jintong1, Gao Yuxiao1, Zhang Xinyong1, Gu Jianlai1
Affiliation:
1. School of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China 2. State Ethnic Affairs Commission Key Laboratory of Graphic Image Intelligent Processing, North Minzu University, Yinchuan 750021, China
Abstract
The task of named entity recognition (NER) is to identify entities in the text and predict their categories. In real-life scenarios, the context of the text is often complex, and there may exist nested entities within an entity. This kind of entity is called a nested entity, and the task of recognizing entities with nested structures is referred to as nested named entity recognition. Most existing NER models can only handle flat entities, and there has been limited research progress in Chinese nested named entity recognition, resulting in relatively few models in this direction. General NER models have limited semantic extraction capabilities and cannot capture deep semantic information between nested entities in the text. To address these issues, this paper proposes a model that uses the GlobalPointer module to identify nested entities in the text and constructs the IDCNNLR semantic extraction module to extract deep semantic information. Furthermore, multiple-head self-attention mechanisms are incorporated into the model at multiple positions to achieve data denoising, enhancing the quality of semantic features. The proposed model considers each possible entity boundary through the GlobalPointer module, and the IDCNNLR semantic extraction module and multi-position attention mechanism are introduced to enhance the model’s semantic extraction capability. Experimental results demonstrate that the proposed model achieves F1 scores of 69.617% and 79.285% on the CMeEE Chinese nested entity recognition dataset and CLUENER2020 Chinese fine-grained entity recognition dataset, respectively. The model exhibits improvement compared to baseline models, and each innovation point shows effective performance enhancement in ablative experiments.
Funder
Ningxia Natural Science Foundation Project Basic Scientific Research in Central Universities of North Minzu University National Natural Science Foundation of China
Reference39 articles.
1. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., and Dyer, C. (2016, January 12–17). Neural architectures for named entity recognition. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA. 2. Dai, J., Feng, C., Bai, X., Dai, J., and Zhang, H. (2019, January 4–6). AERNs: Attention-based entity region networks for multi-grained named entity recognition. Proceedings of the 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI), Portland, OR, USA. 3. Geological entity recognition method based on Deep Belief Networks;Zhang;Acta Petrol. Sin.,2018 4. Wang, C., Shang, W., Huang, W., and Lin, W. (2021, January 28–30). BiLSTM-CRF with Compensation Method for Spatial Entity Recognition. Proceedings of the 2021 21st ACIS International Winter Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD-Winter), Ho Chi Minh City, Vietnam. 5. Phan, R., Luu, T.M., Davey, R., and Chetty, G. Biomedical named entity recognition based on hybrid multistage CNN-RNN learner. In Proceedings of the 2018 International Conference on Machine Learning and Data Engineering (iCMLDE), Sydney, Australia, 3–7 December 2018; IEEE: New York, NY, USA, 2018.
|
|