Research on Named Entity Recognition Methods in Chinese Forest Disease Texts

Author:

Wang Qi,Su Xiyou

Abstract

Named entity recognition of forest diseases plays a key role in knowledge extraction in the field of forestry. The aim of this paper is to propose a named entity recognition method based on multi-feature embedding, a transformer encoder, a bi-gated recurrent unit (BiGRU), and conditional random fields (CRF). According to the characteristics of the forest disease corpus, several features are introduced here to improve the method’s accuracy. In this paper, we analyze the characteristics of forest disease texts; carry out pre-processing, labeling, and extraction of multiple features; and construct forest disease texts. In the input representation layer, the method integrates multi-features, such as characters, radicals, word boundaries, and parts of speech. Then, implicit features (e.g., sentence context features) are captured through the transformer’s encoding layer. The obtained features are transmitted to the BiGRU layer for further deep feature extraction. Finally, the CRF model is used to learn constraints and output the optimal annotation of disease names, damage sites, and drug entities in the forest disease texts. The experimental results on the self-built data set of forest disease texts show that the precision of the proposed method for entity recognition reached more than 93%, indicating that it can effectively solve the task of named entity recognition in forest disease texts.

Funder

This work was supported by the National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference25 articles.

1. A Survey of Chinese Named Entity Recognition;Zhao;J. Front. Comput. Sci. Technol.,2021

2. A Review on Named Entity Recognition;Liu;J. China Soc. Sci. Tech. Inf.,2018

3. Research on named entity recognition of Chinese electronic medical records based on multifeatured embedding and attention mechanism;Gong;Chin. J. Eng.,2021

4. Bridge Inspection Named Entity Recognition Based on Transformer-BiLSTM-CRF;Li;J. Chin. Inf. Process.,2021

5. Research on named entity recognition technology in military software testing;Han;J. Front. Comput. Sci. Technol.,2020

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Few-shot named entity recognition framework for forestry science metadata extraction;Journal of Ambient Intelligence and Humanized Computing;2024-02-01

2. Exploring the Value of Pre-trained Language Models for Clinical Named Entity Recognition;2023 IEEE International Conference on Big Data (BigData);2023-12-15

3. Named Entity Recognition in Fire Control Texts Based on BERT;2023 12th International Conference of Information and Communication Technology (ICTech);2023-04

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3