Affiliation:
1. The Academy of Digital China (Fujian), Fuzhou University, Fuzhou 350116, China
2. State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
Abstract
This paper proposes a novel model for named entity recognition of Chinese crop diseases and pests. The model is intended to solve the problems of uneven entity distribution, incomplete recognition of complex terms, and unclear entity boundaries. First, a robustly optimized BERT pre-training approach-whole word masking (RoBERTa-wwm) model is used to extract diseases and pests’ text semantics, acquiring dynamic word vectors to solve the problem of incomplete word recognition. Adversarial training is then introduced to address unclear boundaries of diseases and pest entities and to improve the generalization ability of models in an effective manner. The context features are obtained by the bi-directional gated recurrent unit (BiGRU) neural network. Finally, the optimal tag sequence is obtained by conditional random fields (CRF) decoding. A focal loss function is introduced to optimize conditional random fields (CRF) and thus solve the problem of unbalanced label classification in the sequence. The experimental results show that the model’s precision, recall, and F1 values on the crop diseases and pests corpus reached 89.23%, 90.90%, and 90.04%, respectively, demonstrating effectiveness at improving the accuracy of named entity recognition for Chinese crop diseases and pests. The named entity recognition model proposed in this study can provide a high-quality technical basis for downstream tasks such as crop diseases and pests knowledge graphs and question-answering systems.
Funder
the National Key Research and Development Project
Subject
Agronomy and Crop Science
Reference65 articles.
1. Biology and Control of the Khapra Beetle, Trogoderma granarium, a Major Quarantine Threat to Global Food Security;Athanassiou;Annu. Rev. Entomol.,2019
2. Zhao, J.S. (2022). Construction and Application of Knowledge Map of Crop Diseases and Pests Based on ALBERT. [Master’s Thesis, Anhui Agricultural University].
3. The Future of Digital Agriculture: Technologies and Opportunities;Fountas;IT Prof.,2020
4. Overview of Chinese Named Entity Recognition;Zhao;Comput. Sci. Explor.,2022
5. A survey of the applications of text mining for agriculture;Drury;Comput. Electron. Agric.,2019
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献