Affiliation:
1. College of Electronic Information, Qingdao University, Qingdao, Shandong 266071, China
2. Qingdao Lanzhi Modern Service Industry Digital Engineering Technology Research Center, Qingdao, Shandong 266071, China
Abstract
Medical text data records detailed clinical data; named entity recognition is the basis of text information processing and an important part of mining valuable information in medical texts. The named entity recognition technology can accurately identify the information needed in medical texts and help medical staff make clinical decision-making, evidence-based medicine, and epidemic disease monitoring. This paper proposes a hybrid neural network medical text named entity recognition model. First, a coding method based on a fully self-attentive mechanism is proposed. The vector representation of each word is related to the entire sentence through the attention mechanism. It determines the weight distribution by scoring the characters or words in all positions and obtains the position information in the sentence that needs the most attention. The encoding vector at each position is integrated with the context information of full sentence, which solves the ambiguity problem. Second, a multivariate convolutional decoding method is proposed. This method can effectively pay attention to the characteristics of medical text named entity recognition in the decoding process. It uses two-dimensional convolutional decoding to associate the current position word with surrounding words to improve decoding efficiency while extracting features from the logic of the preceding and following words. Using the same number of convolution kernels as the entity category, it can effectively extract effective features from the label dimension. Besides, according to the characteristics of the named entity recognition task, a special mixed loss is designed. The experimental results verify that the proposed method is effective, and it is improved compared with some existing medical text named entity recognition methods.
Subject
Health Informatics,Biomedical Engineering,Surgery,Biotechnology
Reference35 articles.
1. Transformers: state-of-the-art natural language processing;T. Wolf
2. Huggingface’s transformers: state-of-the-art natural language processin. Huggingface’s transformers: state-of-the-art natural language processing;T. Wolf,2019
3. Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
4. ECNN: evaluating a cluster-neural network model for city innovation capability
5. GluonCV and GluonNLP: deep learning in computer vision and natural language processing;J. Guo;Journal of Machine Learning Research,2020
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献