Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning-Reference-Cited by-同舟云学术

Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning

Published:2021-08-31 Issue:8 Volume:9 Page:e23230
ISSN:2291-9694
Container-title:JMIR Medical Informatics
language:en
Short-container-title:JMIR Med Inform

Author:

Chen Pei-Fu^ORCID,Wang Ssu-Ming^ORCID,Liao Wei-Chih^ORCID,Kuo Lu-Cheng^ORCID,Chen Kuan-Chih^ORCID,Lin Yu-Cheng^ORCID,Yang Chi-Yu^ORCID,Chiu Chi-Hao^ORCID,Chang Shu-Chih^ORCID,Lai Feipei^ORCID

Abstract

Background The International Classification of Diseases (ICD) code is widely used as the reference in medical system and billing purposes. However, classifying diseases into ICD codes still mainly relies on humans reading a large amount of written material as the basis for coding. Coding is both laborious and time-consuming. Since the conversion of ICD-9 to ICD-10, the coding task became much more complicated, and deep learning– and natural language processing–related approaches have been studied to assist disease coders. Objective This paper aims at constructing a deep learning model for ICD-10 coding, where the model is meant to automatically determine the corresponding diagnosis and procedure codes based solely on free-text medical notes to improve accuracy and reduce human effort. Methods We used diagnosis records of the National Taiwan University Hospital as resources and apply natural language processing techniques, including global vectors, word to vectors, embeddings from language models, bidirectional encoder representations from transformers, and single head attention recurrent neural network, on the deep neural network architecture to implement ICD-10 auto-coding. Besides, we introduced the attention mechanism into the classification model to extract the keywords from diagnoses and visualize the coding reference for training freshmen in ICD-10. Sixty discharge notes were randomly selected to examine the change in the F1-score and the coding time by coders before and after using our model. Results In experiments on the medical data set of National Taiwan University Hospital, our prediction results revealed F1-scores of 0.715 and 0.618 for the ICD-10 Clinical Modification code and Procedure Coding System code, respectively, with a bidirectional encoder representations from transformers embedding approach in the Gated Recurrent Unit classification model. The well-trained models were applied on the ICD-10 web service for coding and training to ICD-10 users. With this service, coders can code with the F1-score significantly increased from a median of 0.832 to 0.922 (P<.05), but not in a reduced interval. Conclusions The proposed model significantly improved the F1-score but did not decrease the time consumed in coding by disease coders.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference22 articles.

1. The International Classification of Diseases, 10th RevisionWorld Health Organization20152021-08-04https://icd.who.int/browse10/2015/en

2. Handbook of Research on Informatics in Healthcare and Biomedicine

3. Automatic construction of rule-based ICD-9-CM coding systems

4. LEAP

5. MedSTS: a resource for clinical semantic textual similarity

Cited by 34 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Using Enhanced Representations to Predict Medical Procedures from Clinician Notes;Applied Sciences;2024-07-24

2. Automating surgical procedure extraction for society of surgeons adult cardiac surgery registry using pretrained language models;JAMIA Open;2024-07-01

3. EXAMINATION OF SUMMARIZED MEDICAL RECORDS FOR ICD CODE CLASSIFICATION VIA BERT;Applied Computer Science;2024-06-30

4. A Unified Review of Deep Learning for Automated Medical Coding;ACM Computing Surveys;2024-05-17

5. Validating the Application of Clinical Department-specific Artificial Intelligence-assisted Coding using TwDRGs (Preprint);2024-04-30