Synoptic Reporting by Summarizing Cancer Pathology Reports using Large Language Models

Author:

Rajaganapathy SivaramanORCID,Chowdhury Shaika,Buchner Vincent,He ZheORCID,Jiang Xiaoqian,Yang Ping,Cerhan James R.,Zong Nansu

Abstract

AbstractBackgroundSynoptic reporting, the documenting of clinical information in a structured manner, is known to improve patient care by reducing errors, increasing readability, interoperability, and report completeness. Despite its advantages, manually synthesizing synoptic reports from narrative reports is expensive and error prone when the number of structured fields are many. While the recent revolutionary developments in Large Language Models (LLMs) have significantly advanced natural language processing, their potential for innovations in medicine is yet to be fully evaluated.ObjectivesIn this study, we explore the strengths and challenges of utilizing the state-of-the-art language models in the automatic synthesis of synoptic reports.Materials and MethodsWe use a corpus of 7,774 cancer related, narrative pathology reports, which have annotated reference synoptic reports from Mayo Clinic EHR. Using these annotations as a reference, we reconfigure the state-of-the-art large language models, such as LLAMA-2, to generate the synoptic reports. Our annotated reference synoptic reports contain 22 unique data elements. To evaluate the accuracy of the reports generated by the LLMs, we use several metrics including the BERT F1 Score and verify our results by manual validation.ResultsWe show that using fine-tuned LLAMA-2 models, we can obtain BERT Score F1 of 0.86 or higher across all data elements and BERT F1 scores of 0.94 or higher on over 50% (11 of 22) of the questions. The BERT F1 scores translate to average accuracies of 76% and as high as 81% for short clinical reports.ConclusionsWe demonstrate successful automatic synoptic report generation by fine-tuning large language models.

Publisher

Cold Spring Harbor Laboratory

Reference41 articles.

1. Renshaw AA , Mena-Allauca M , Gould EW , et al. Synoptic Reporting: Evidence-Based Review and Future Directions. JCO Clinical Cancer Informatics. 2018;1–9.

2. Establishing Synoptic Cancer Pathology Reporting in Low- and Middle-Income Countries: A Nicaraguan Experience;JCO Glob Oncol,2022

3. Comparison of Accuracy and Speed of Information Identification by Nonpathologists in Synoptic Reports With Different Formats;Archives of Pathology & Laboratory Medicine,2016

4. College of American Pathologists Cancer Protocols: Optimizing Format for Accuracy and Efficiency;Arch Pathol Lab Med,2016

5. Advancement in the quality of operative documentation: A systematic review and meta-analysis of synoptic versus narrative operative reporting;The American Journal of Surgery,2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3