Use of Natural Language Processing to Infer Sites of Metastatic Disease From Radiology Reports at Scale

Author:

Tay See Boon12ORCID,Low Guat Hwa13ORCID,Wong Gillian Jing En2,Tey Han Jieh13,Leong Fun Loon13ORCID,Li Constance3ORCID,Chua Melvin Lee Kiang345ORCID,Tan Daniel Shao Weng146ORCID,Thng Choon Hua47,Tan Iain Bee Huat134,Tan Ryan Shea Ying Cong1348ORCID

Affiliation:

1. Division of Medical Oncology, National Cancer Centre Singapore, Singapore, Singapore

2. NUS Yong Loo Lin School of Medicine, Singapore, Singapore

3. Data and Computational Science Core, National Cancer Centre Singapore, Singapore, Singapore

4. Singapore Duke-NUS Medical School, Singapore, Singapore

5. Division of Radiation Oncology, National Cancer Centre Singapore, Singapore, Singapore

6. Division of Clinical Trials and Epidemiological Sciences, National Cancer Centre Singapore, Singapore, Singapore

7. Division of Oncologic Imaging, National Cancer Centre Singapore, Singapore, Singapore

8. Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY

Abstract

PURPOSE To evaluate natural language processing (NLP) methods to infer metastatic sites from radiology reports. METHODS A set of 4,522 computed tomography (CT) reports of 550 patients with 14 types of cancer was used to fine-tune four clinical large language models (LLMs) for multilabel classification of metastatic sites. We also developed an NLP information extraction (IE) system (on the basis of named entity recognition, assertion status detection, and relation extraction) for comparison. Model performances were measured by F1 scores on test and three external validation sets. The best model was used to facilitate analysis of metastatic frequencies in a cohort study of 6,555 patients with 53,838 CT reports. RESULTS The RadBERT, BioBERT, GatorTron-base, and GatorTron-medium LLMs achieved F1 scores of 0.84, 0.87, 0.89, and 0.91, respectively, on the test set. The IE system performed best, achieving an F1 score of 0.93. F1 scores of the IE system by individual cancer type ranged from 0.89 to 0.96. The IE system attained F1 scores of 0.89, 0.83, and 0.81, respectively, on external validation sets including additional cancer types, positron emission tomography-CT ,and magnetic resonance imaging scans, respectively. In our cohort study, we found that for colorectal cancer, liver-only metastases were higher in de novo stage IV versus recurrent patients (29.7% v 12.2%; P < .001). Conversely, lung-only metastases were more frequent in recurrent versus de novo stage IV patients (17.2% v 7.3%; P < .001). CONCLUSION We developed an IE system that accurately infers metastatic sites in multiple primary cancers from radiology reports. It has explainable methods and performs better than some clinical LLMs. The inferred metastatic phenotypes could enhance cancer research databases and clinical trial matching, and identify potential patients for oligometastatic interventions.

Publisher

American Society of Clinical Oncology (ASCO)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3