Training a Deep Contextualized Language Model for International Classification of Diseases, 10th Revision Classification via Federated Learning: Model Development and Validation Study-Reference-Cited by-同舟云学术

Training a Deep Contextualized Language Model for International Classification of Diseases, 10th Revision Classification via Federated Learning: Model Development and Validation Study

Published:2022-11-10 Issue:11 Volume:10 Page:e41342
ISSN:2291-9694
Container-title:JMIR Medical Informatics
language:en
Short-container-title:JMIR Med Inform

Author:

Chen Pei-Fu^ORCID,He Tai-Liang^ORCID,Lin Sheng-Che^ORCID,Chu Yuan-Chia^ORCID,Kuo Chen-Tsung^ORCID,Lai Feipei^ORCID,Wang Ssu-Ming^ORCID,Zhu Wan-Xuan^ORCID,Chen Kuan-Chih^ORCID,Kuo Lu-Cheng^ORCID,Hung Fang-Ming^ORCID,Lin Yu-Cheng^ORCID,Tsai I-Chang^ORCID,Chiu Chi-Hao^ORCID,Chang Shu-Chih^ORCID,Yang Chi-Yu^ORCID

Abstract

Background The automatic coding of clinical text documents by using the International Classification of Diseases, 10th Revision (ICD-10) can be performed for statistical analyses and reimbursements. With the development of natural language processing models, new transformer architectures with attention mechanisms have outperformed previous models. Although multicenter training may increase a model’s performance and external validity, the privacy of clinical documents should be protected. We used federated learning to train a model with multicenter data, without sharing data per se. Objective This study aims to train a classification model via federated learning for ICD-10 multilabel classification. Methods Text data from discharge notes in electronic medical records were collected from the following three medical centers: Far Eastern Memorial Hospital, National Taiwan University Hospital, and Taipei Veterans General Hospital. After comparing the performance of different variants of bidirectional encoder representations from transformers (BERT), PubMedBERT was chosen for the word embeddings. With regard to preprocessing, the nonalphanumeric characters were retained because the model’s performance decreased after the removal of these characters. To explain the outputs of our model, we added a label attention mechanism to the model architecture. The model was trained with data from each of the three hospitals separately and via federated learning. The models trained via federated learning and the models trained with local data were compared on a testing set that was composed of data from the three hospitals. The micro F1 score was used to evaluate model performance across all 3 centers. Results The F1 scores of PubMedBERT, RoBERTa (Robustly Optimized BERT Pretraining Approach), ClinicalBERT, and BioBERT (BERT for Biomedical Text Mining) were 0.735, 0.692, 0.711, and 0.721, respectively. The F1 score of the model that retained nonalphanumeric characters was 0.8120, whereas the F1 score after removing these characters was 0.7875—a decrease of 0.0245 (3.11%). The F1 scores on the testing set were 0.6142, 0.4472, 0.5353, and 0.2522 for the federated learning, Far Eastern Memorial Hospital, National Taiwan University Hospital, and Taipei Veterans General Hospital models, respectively. The explainable predictions were displayed with highlighted input words via the label attention architecture. Conclusions Federated learning was used to train the ICD-10 classification model on multicenter clinical text while protecting data privacy. The model’s performance was better than that of models that were trained locally.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference27 articles.

1. Impact of the Transition to ICD-10 on Medicare Inpatient Hospital Payments

2. A narrative review of the impact of the transition to ICD-10 and ICD-10-CM/PCS

3. Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality?;Computational and Structural Biotechnology Journal;2024-12

2. Large language models in physical therapy: time to adapt and adept;Frontiers in Public Health;2024-05-24

3. A Unified Review of Deep Learning for Automated Medical Coding;ACM Computing Surveys;2024-05-17

4. Assessing the Implications of Data Heterogeneity on Privacy-Enhanced Federated Learning: A Comprehensive Examination Using CIFAR-10;TENCON 2023 - 2023 IEEE Region 10 Conference (TENCON);2023-10-31

5. Road traffic death coding quality in the WHO Mortality Database;Bulletin of the World Health Organization;2023-10-01