Exploring the performance and explainability of fine-tuned BERT models for neuroradiology protocol assignment-Reference-Cited by-同舟云学术

Exploring the performance and explainability of fine-tuned BERT models for neuroradiology protocol assignment

Published:2024-02-07 Issue:1 Volume:24 Page:
ISSN:1472-6947
Container-title:BMC Medical Informatics and Decision Making
language:en
Short-container-title:BMC Med Inform Decis Mak

Author:

Talebi Salmonn,Tong Elizabeth,Li Anna,Yamin Ghiam,Zaharchuk Greg,Mofrad Mohammad R. K.

Abstract

Abstract Background Deep learning has demonstrated significant advancements across various domains. However, its implementation in specialized areas, such as medical settings, remains approached with caution. In these high-stake environments, understanding the model's decision-making process is critical. This study assesses the performance of different pretrained Bidirectional Encoder Representations from Transformers (BERT) models and delves into understanding its decision-making within the context of medical image protocol assignment. Methods Four different pre-trained BERT models (BERT, BioBERT, ClinicalBERT, RoBERTa) were fine-tuned for the medical image protocol classification task. Word importance was measured by attributing the classification output to every word using a gradient-based method. Subsequently, a trained radiologist reviewed the resulting word importance scores to assess the model’s decision-making process relative to human reasoning. Results The BERT model came close to human performance on our test set. The BERT model successfully identified relevant words indicative of the target protocol. Analysis of important words in misclassifications revealed potential systematic errors in the model. Conclusions The BERT model shows promise in medical image protocol assignment by reaching near human level performance and identifying key words effectively. The detection of systematic errors paves the way for further refinements to enhance its safety and utility in clinical settings.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s12911-024-02444-z.pdf

Reference38 articles.

1. Shen D, Wu G, Suk H-I. Deep learning in medical image analysis. Annual review of biomedical engineering. 2017;19:221.

2. Miotto R, Wang F, Wang S, Jiang X, Dudley JT. Deep learning for healthcare: review, opportunities and challenges. Briefings in bioinformatics. 2018;19(6):1236–46.

3. Madani A, Ong JR, Tibrewal A, Mofrad MR. Deep echocardiography: data-efficient supervised and semi- supervised deep learning towards automated diagnosis of cardiac disease. NPJ digital medicine. 2018;1(1):1–11.

4. Yoojoong Kim, et al. “Predicting medical specialty from text based on a domain-specific pre-trained BERT.” Int J Med Inform. 2023;170:104956.

5. Turchin Alexander, Masharsky Stanislav, Zitnik Marinka. Comparison of BERT implementations for natural language processing of narrative medical documents. Informatics in Medicine Unlocked. 2023;36: 101139.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports;Journal of Imaging Informatics in Medicine;2024-08-26

2. Adaption BERT for Medical Information Processing with ChatGPT and Contrastive Learning;Electronics;2024-06-21

3. Toward Clinical Generative AI: Conceptual Framework;JMIR AI;2024-06-07

4. Evaluation of a BERT Natural Language Processing Model for Automating CT and MRI Triage and Protocol Selection;Canadian Association of Radiologists Journal;2024-06-04