BIBC: A Chinese Named Entity Recognition Model for Diabetes Research-Reference-Cited by-同舟云学术

BIBC: A Chinese Named Entity Recognition Model for Diabetes Research

Published:2021-10-16 Issue:20 Volume:11 Page:9653
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Yang Lei,Fu Yufan,Dai Yu

Abstract

In the medical field, extracting medical entities from text by Named Entity Recognition (NER) has become one of the research hotspots. This thesis takes the chapter-level diabetes literature as the research object and uses a deep learning method to extract medical entities in the literature. Based on the deep and bidirectional transformer network structure, the pre-training language model BERT model can solve the problem of polysemous word representation, and supplement the features by large-scale unlabeled data, combined with BiLSTM-CRF model extracts of the long-distance features of sentences. On this basis, in view of the problem that the model cannot focus on the local information of the sentence, resulting in insufficient feature extraction, and considering the characteristics of Chinese data mainly in words, this thesis proposes a Named Entity Recognition method based on BIBC. This method combines Iterated Dilated CNN to enable the model to take into account global and local features at the same time, and uses the BERT-WWM model based on whole word masking to further extract semantic information from Chinese data. In the experiment of diabetic entity recognition in Ruijin Hospital, the accuracy rate, recall rate, and F1 score are improved to 79.58%, 80.21%, and 79.89%, which are better than the evaluation indexes of existing studies. It indicates that the method can extract the semantic information of diabetic text more accurately and obtain good entity recognition results, which can meet the requirements of practical applications.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/20/9653/pdf

Reference30 articles.

1. Message Understanding Conference-6

2. A Survey on Recent Advances in Named Entity Recognition from Deep Learning models;Yadav;arXiv,2019

3. Application of Pre-training Models in Named Entity Recognition;Wang;arXiv,2020

4. Improving Language Understanding by Generative Pre-Training. In Technical Report, OpenAIhttps://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A hybrid methodology for knowledge organization and application of Chinese civil aviation regulations from mission safety support perspective;Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering;2023-10-05

2. An Easy Partition Approach for Joint Entity and Relation Extraction;Applied Sciences;2023-06-27

3. Task-Specific Transformer-Based Language Models in Medicine: A Survey (Preprint);2023-06-07

4. Named Entity Recognition of Diabetes Online Health Community Data Using Multiple Machine Learning Models;Bioengineering;2023-05-29

5. Prompt-Based Word-Level Information Injection BERT for Chinese Named Entity Recognition;Applied Sciences;2023-03-06