Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study-Reference-Cited by-同舟云学术

Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study

Published:2021-05-05 Issue:5 Volume:9 Page:e24678
ISSN:2291-9694
Container-title:JMIR Medical Informatics
language:en
Short-container-title:JMIR Med Inform

Author:

Alfattni Ghada^ORCID,Belousov Maksim^ORCID,Peek Niels^ORCID,Nenadic Goran^ORCID

Abstract

Background Drug prescriptions are often recorded in free-text clinical narratives; making this information available in a structured form is important to support many health-related tasks. Although several natural language processing (NLP) methods have been proposed to extract such information, many challenges remain. Objective This study evaluates the feasibility of using NLP and deep learning approaches for extracting and linking drug names and associated attributes identified in clinical free-text notes and presents an extensive error analysis of different methods. This study initiated with the participation in the 2018 National NLP Clinical Challenges (n2c2) shared task on adverse drug events and medication extraction. Methods The proposed system (DrugEx) consists of a named entity recognizer (NER) to identify drugs and associated attributes and a relation extraction (RE) method to identify the relations between them. For NER, we explored deep learning-based approaches (ie, bidirectional long-short term memory with conditional random fields [BiLSTM-CRFs]) with various embeddings (ie, word embedding, character embedding [CE], and semantic-feature embedding) to investigate how different embeddings influence the performance. A rule-based method was implemented for RE and compared with a context-aware long-short term memory (LSTM) model. The methods were trained and evaluated using the 2018 n2c2 shared task data. Results The experiments showed that the best model (BiLSTM-CRFs with pretrained word embeddings [PWE] and CE) achieved lenient micro F-scores of 0.921 for NER, 0.927 for RE, and 0.855 for the end-to-end system. NER, which relies on the pretrained word and semantic embeddings, performed better on most individual entity types, but NER with PWE and CE had the highest classification efficiency among the proposed approaches. Extracting relations using the rule-based method achieved higher accuracy than the context-aware LSTM for most relations. Interestingly, the LSTM model performed notably better in the reason-drug relations, the most challenging relation type. Conclusions The proposed end-to-end system achieved encouraging results and demonstrated the feasibility of using deep learning methods to extract medication information from free-text data.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference59 articles.

1. Combining structured and unstructured data to identify a cohort of ICU patients who received dialysis

2. KarystianisGExtraction and representation of key characteristics from epidemiological literatureThe University of Manchester20142021-03-31https://tinyurl.com/bv927sfthttps://tinyurl.com/645sksnd

3. Extracting structured information from free-text medication prescriptions using dependencies

4. MedXN: an open source medication extraction and normalization tool for clinical text

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Utilising NLP for Enhanced Clinical Text Mining;2024 3rd International Conference on Applied Artificial Intelligence and Computing (ICAAIC);2024-06-05

2. Exploring Biomedical Named Entity Recognition via SciSpaCy and BioBERT Models;The Open Biomedical Engineering Journal;2024-06-05

3. Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study;JMIR AI;2024-05-16

4. Extracting adverse drug events from clinical Notes: A systematic review of approaches used;Journal of Biomedical Informatics;2024-03

5. Sequence-Model-Based Medication Extraction from Clinical Narratives in German;Lecture Notes in Computer Science;2024