Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding–Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model-Reference-Cited by-同舟云学术

Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding–Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model

Published:2021-12-01 Issue:12 Volume:9 Page:e26407
ISSN:2291-9694
Container-title:JMIR Medical Informatics
language:en
Short-container-title:JMIR Med Inform

Author:

Wu Hong^ORCID,Ji Jiatong^ORCID,Tian Haimei^ORCID,Chen Yao^ORCID,Ge Weihong^ORCID,Zhang Haixia^ORCID,Yu Feng^ORCID,Zou Jianjun^ORCID,Nakamura Mitsuhiro^ORCID,Liao Jun^ORCID

Abstract

Background With the increasing variety of drugs, the incidence of adverse drug events (ADEs) is increasing year by year. Massive numbers of ADEs are recorded in electronic medical records and adverse drug reaction (ADR) reports, which are important sources of potential ADR information. Meanwhile, it is essential to make latent ADR information automatically available for better postmarketing drug safety reevaluation and pharmacovigilance. Objective This study describes how to identify ADR-related information from Chinese ADE reports. Methods Our study established an efficient automated tool, named BBC-Radical. BBC-Radical is a model that consists of 3 components: Bidirectional Encoder Representations from Transformers (BERT), bidirectional long short-term memory (bi-LSTM), and conditional random field (CRF). The model identifies ADR-related information from Chinese ADR reports. Token features and radical features of Chinese characters were used to represent the common meaning of a group of words. BERT and Bi-LSTM-CRF were novel models that combined these features to conduct named entity recognition (NER) tasks in the free-text section of 24,890 ADR reports from the Jiangsu Province Adverse Drug Reaction Monitoring Center from 2010 to 2016. Moreover, the man-machine comparison experiment on the ADE records from Drum Tower Hospital was designed to compare the NER performance between the BBC-Radical model and a manual method. Results The NER model achieved relatively high performance, with a precision of 96.4%, recall of 96.0%, and F1 score of 96.2%. This indicates that the performance of the BBC-Radical model (precision 87.2%, recall 85.7%, and F1 score 86.4%) is much better than that of the manual method (precision 86.1%, recall 73.8%, and F1 score 79.5%) in the recognition task of each kind of entity. Conclusions The proposed model was competitive in extracting ADR-related information from ADE reports, and the results suggest that the application of our method to extract ADR-related information is of great significance in improving the quality of ADR reports and postmarketing drug safety evaluation.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference33 articles.

1. Ontological Organization and Bioinformatic Analysis of Adverse Drug Reactions From Package Inserts: Development and Usability Study

2. WHO Strategy for Collecting Safety Data in Public Health Programmes: Complementing Spontaneous Reporting Systems

3. Adverse drug reactions: definitions, diagnosis, and management

4. Impact of Medicine Withdrawal on Reporting of Adverse Events Involving Therapeutic Alternatives: A Study from the French Spontaneous Reporting Database

5. A signal for an abuse liability for pregabalin—results from the Swedish spontaneous adverse drug reaction reporting system

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Assessing domain adaptation in adverse drug event extraction on real-world breast cancer records;International Journal of Medical Informatics;2024-11

2. Automated System to Capture Patient Symptoms from Multi-type Japanese Clinical Texts: Natural Language Processing Approach (Preprint);JMIR Medical Informatics;2024-03-29

3. Automated System to Capture Patient Symptoms from Multimodal Texts: Natural Language Processing Approach (Preprint);2024-03-29

4. exKidneyBERT: a language model for kidney transplant pathology reports and the crucial role of extended vocabularies;PeerJ Computer Science;2024-02-28

5. A Deep Learning Model for the Normalization of Institution Names by Multisource Literature Feature Fusion: Algorithm Development Study;JMIR Formative Research;2023-08-18