A Hybrid Model for Named Entity Recognition on Chinese Electronic Medical Records-Reference-Cited by-同舟云学术

A Hybrid Model for Named Entity Recognition on Chinese Electronic Medical Records

Published:2021-04-23 Issue:2 Volume:20 Page:1-12
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Wang Yu¹,Sun Yining¹,Ma Zuchang²,Gao Lisheng²,Xu Yang³

Affiliation:

1. Institute of Intelligent Machines and University of Science and Technology of China, Hefei City, Anhui Province, China

2. Institute of Intelligent Machines

3. Institute of Intelligent Machines, Anhui Province, China

Abstract

Electronic medical records (EMRs) contain valuable information about the patients, such as clinical symptoms, diagnostic results, and medications. Named entity recognition (NER) aims to recognize entities from unstructured text, which is the initial step toward the semantic understanding of the EMRs. Extracting medical information from Chinese EMRs could be a more complicated task because of the difference between English and Chinese. Some researchers have noticed the importance of Chinese NER and used the recurrent neural network or convolutional neural network (CNN) to deal with this task. However, it is interesting to know whether the performance could be improved if the advantages of the RNN and CNN can be both utilized. Moreover, RoBERTa-WWM, as a pre-training model, can generate the embeddings with word-level features, which is more suitable for Chinese NER compared with Word2Vec. In this article, we propose a hybrid model. This model first obtains the entities identified by bidirectional long short-term memory and CNN, respectively, and then uses two hybrid strategies to output the final results relying on these entities. We also conduct experiments on raw medical records from real hospitals. This dataset is provided by the China Conference on Knowledge Graph and Semantic Computing in 2019 (CCKS 2019). Results demonstrate that the hybrid model can improve performance significantly.

Funder

Major Special Project of Anhui Science and Technology Department

Science and Technology Service Network Initiative

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3436819

Reference39 articles.

1. A Hybrid Method to Extract Clinical Information From Chinese Electronic Medical Records

2. Adversarial training based lattice LSTM for Chinese clinical named entity recognition;Zhao Shan;Journal of Biomedical Informatics,2019

3. An Ontology-Based Interpretable Fuzzy Decision Support System for Diabetes Diagnosis

4. GrEDeL: A Knowledge Graph Embedding Based Method for Drug Discovery From Biomedical Literatures

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Electronic Medical Record Data Mining and Processing Based on Natural Language Processing;2024 International Conference on Machine Intelligence and Digital Applications;2024-05-30

2. Think More Ambiguity Less: A Novel Dual Interactive Model with Local and Global Semantics for Chinese Named Entity Recognition;ACM Transactions on Asian and Low-Resource Language Information Processing;2023-06-17

3. A Transformer-Based Longer Entity Attention Model for Chinese Named Entity Recognition in Aerospace;2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE);2022-04