Using the Natural Language Processing System Medical Named Entity Recognition-Japanese to Analyze Pharmaceutical Care Records: Natural Language Processing Analysis (Preprint)-Reference-Cited by-同舟云学术

Using the Natural Language Processing System Medical Named Entity Recognition-Japanese to Analyze Pharmaceutical Care Records: Natural Language Processing Analysis (Preprint)

Published:2023-12-26 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Ohno Yukiko^ORCID,Kato Riri^ORCID,Ishikawa Haruki^ORCID,Nishiyama Tomohiro^ORCID,Isawa Minae^ORCID,Mochizuki Mayumi^ORCID,Aramaki Eiji^ORCID,Aomori Tohru^ORCID

Abstract

BACKGROUND

Large language models have propelled recent advances in artificial intelligence technology, facilitating the extraction of medical information from unstructured data such as medical records. Although named entity recognition (NER) is used to extract data from physicians’ records, it has yet to be widely applied to pharmaceutical care records.

OBJECTIVE

In this study, we aimed to investigate the feasibility of automatic extraction of the information regarding patients’ diseases and symptoms from pharmaceutical care records. The verification was performed using Medical Named Entity Recognition-Japanese (MedNER-J), a Japanese disease-extraction system designed for physicians’ records.

METHODS

MedNER-J was applied to subjective, objective, assessment, and plan data from the care records of 49 patients who received cefazolin sodium injection at Keio University Hospital between April 2018 and March 2019. The performance of MedNER-J was evaluated in terms of precision, recall, and F1-score.

RESULTS

The F1-scores of NER for subjective, objective, assessment, and plan data were 0.46, 0.70, 0.76, and 0.35, respectively. In NER and positive-negative classification, the F1-scores were 0.28, 0.39, 0.64, and 0.077, respectively. The F1-scores of NER for objective (0.70) and assessment data (0.76) were higher than those for subjective and plan data, which supported the superiority of NER performance for objective and assessment data. This might be because objective and assessment data contained many technical terms, similar to the training data for MedNER-J. Meanwhile, the F1-score of NER and positive-negative classification was high for assessment data alone (F1-score=0.64), which was attributed to the similarity of its description format and contents to those of the training data.

CONCLUSIONS

MedNER-J successfully read pharmaceutical care records and showed the best performance for assessment data. However, challenges remain in analyzing records other than assessment data. Therefore, it will be necessary to reinforce the training data for subjective data in order to apply the system to pharmaceutical care records.

Publisher

JMIR Publications Inc.

Reference10 articles.

1. Natural Language Processing: from Bedside to Everywhere

2. A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data

3. Preliminary development of a deep learning-based automated primary headache diagnosis model using Japanese natural language processing of medical questionnaire

4. Using Natural Language Processing Techniques to Detect Adverse Events From Progress Notes Due to Chemotherapy

5. Extraction and Standardization of Patient Complaints from Electronic Medication Histories for Pharmacovigilance: Natural Language Processing Analysis in Japanese