Fine-Tuned Large Language Model for Extracting Patients on Pretreatment for Lung Cancer from a Picture Archiving and Communication System Based on Radiological Reports-Reference-Cited by-同舟云学术

Fine-Tuned Large Language Model for Extracting Patients on Pretreatment for Lung Cancer from a Picture Archiving and Communication System Based on Radiological Reports

Published:2024-07-02 Issue: Volume: Page:
ISSN:2948-2933
Container-title:Journal of Imaging Informatics in Medicine
language:en
Short-container-title:J Digit Imaging. Inform. med.

Author:

Yasaka Koichiro^ORCID,Kanzawa Jun,Kanemaru Noriko,Koshino Saori,Abe Osamu

Abstract

AbstractThis study aimed to investigate the performance of a fine-tuned large language model (LLM) in extracting patients on pretreatment for lung cancer from picture archiving and communication systems (PACS) and comparing it with that of radiologists. Patients whose radiological reports contained the term lung cancer (3111 for training, 124 for validation, and 288 for test) were included in this retrospective study. Based on clinical indication and diagnosis sections of the radiological report (used as input data), they were classified into four groups (used as reference data): group 0 (no lung cancer), group 1 (pretreatment lung cancer present), group 2 (after treatment for lung cancer), and group 3 (planning radiation therapy). Using the training and validation datasets, fine-tuning of the pretrained LLM was conducted ten times. Due to group imbalance, group 2 data were undersampled in the training. The performance of the best-performing model in the validation dataset was assessed in the independent test dataset. For testing purposes, two other radiologists (readers 1 and 2) were also involved in classifying radiological reports. The overall accuracy of the fine-tuned LLM, reader 1, and reader 2 was 0.983, 0.969, and 0.969, respectively. The sensitivity for differentiating group 0/1/2/3 by LLM, reader 1, and reader 2 was 1.000/0.948/0.991/1.000, 0.750/0.879/0.996/1.000, and 1.000/0.931/0.978/1.000, respectively. The time required for classification by LLM, reader 1, and reader 2 was 46s/2539s/1538s, respectively. Fine-tuned LLM effectively extracted patients on pretreatment for lung cancer from PACS with comparable performance to radiologists in a shorter time.

Funder

The University of Tokyo

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10278-024-01186-8.pdf

Reference13 articles.

1. Sung H, Ferlay J, Siegel RL et al (2021) Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin 71(3):209-249. https://doi.org/10.3322/caac.21660.

2. Kang HR, Cho JY, Lee SH et al (2019) Role of Low-Dose Computerized Tomography in Lung Cancer Screening among Never-Smokers. J Thorac Oncol 14(3):436-444. https://doi.org/10.1016/j.jtho.2018.11.002.

3. Prosper AE, Kammer MN, Maldonado F, Aberle DR, Hsu W (2023) Expanding Role of Advanced Image Analysis in CT-detected Indeterminate Pulmonary Nodules and Early Lung Cancer Characterization. Radiology 309(1):e222904. https://doi.org/10.1148/radiol.222904.

4. Adams SJ, Mikhael P, Wohlwend J, Barzilay R, Sequist LV, Fintelmann FJ (2023) Artificial Intelligence and Machine Learning in Lung Cancer Screening. Thorac Surg Clin 33(4):401-409. https://doi.org/10.1016/j.thorsurg.2023.03.001.

5. de Margerie-Mellon C, Chassagnon G (2023) Artificial intelligence: A critical review of applications for lung nodule and lung cancer. Diagn Interv Imaging 104(1):11-17. https://doi.org/10.1016/j.diii.2022.11.007.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports;Journal of Imaging Informatics in Medicine;2024-08-26