Abstract
Named entity recognition of forest diseases plays a key role in knowledge extraction in the field of forestry. The aim of this paper is to propose a named entity recognition method based on multi-feature embedding, a transformer encoder, a bi-gated recurrent unit (BiGRU), and conditional random fields (CRF). According to the characteristics of the forest disease corpus, several features are introduced here to improve the method’s accuracy. In this paper, we analyze the characteristics of forest disease texts; carry out pre-processing, labeling, and extraction of multiple features; and construct forest disease texts. In the input representation layer, the method integrates multi-features, such as characters, radicals, word boundaries, and parts of speech. Then, implicit features (e.g., sentence context features) are captured through the transformer’s encoding layer. The obtained features are transmitted to the BiGRU layer for further deep feature extraction. Finally, the CRF model is used to learn constraints and output the optimal annotation of disease names, damage sites, and drug entities in the forest disease texts. The experimental results on the self-built data set of forest disease texts show that the precision of the proposed method for entity recognition reached more than 93%, indicating that it can effectively solve the task of named entity recognition in forest disease texts.
Funder
This work was supported by the National Natural Science Foundation of China
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference25 articles.
1. A Survey of Chinese Named Entity Recognition;Zhao;J. Front. Comput. Sci. Technol.,2021
2. A Review on Named Entity Recognition;Liu;J. China Soc. Sci. Tech. Inf.,2018
3. Research on named entity recognition of Chinese electronic medical records based on multifeatured embedding and attention mechanism;Gong;Chin. J. Eng.,2021
4. Bridge Inspection Named Entity Recognition Based on Transformer-BiLSTM-CRF;Li;J. Chin. Inf. Process.,2021
5. Research on named entity recognition technology in military software testing;Han;J. Front. Comput. Sci. Technol.,2020
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Few-shot named entity recognition framework for forestry science metadata extraction;Journal of Ambient Intelligence and Humanized Computing;2024-02-01
2. Exploring the Value of Pre-trained Language Models for Clinical Named Entity Recognition;2023 IEEE International Conference on Big Data (BigData);2023-12-15
3. Named Entity Recognition in Fire Control Texts Based on BERT;2023 12th International Conference of Information and Communication Technology (ICTech);2023-04