Development and Testing of a Novel Large Language Model-Based Clinical Decision Support Systems for Medication Safety in 12 Clinical Specialties

Author:

Ting Daniel Shu Wei1,Ong Jasmine Chiat Ling2ORCID,Jin Liyuan3,Kabilan Elangovan4,Lim Gilbert Yong San4,Lim Daniel Yan Zheng2,Sng Gerald Gui Ren2,Ke Yuhe2,Tung Joshua Yi Min2,Zhong Ryan Jian2,Koh Christopher Ming Yao2,Lee Keane Zhi Hao2,Chen Xiang2,Ch'ng Jack Kian2,Aung Than2,Goh Ken Junyang2

Affiliation:

1. Singapore National Eye Centre

2. Singapore General Hospital

3. Duke-NUS Medical School

4. Singapore National Eye Centre, Singapore Eye Research Institute

Abstract

Abstract

Importance: We introduce a novel Retrieval Augmented Generation (RAG)-Large Language Model (LLM) framework as a Clinical Decision Support Systems (CDSS) to support safe medication prescription, a critical aspect of patient safety. This overcomes existing challenges of irrelevancy of alerts in rules-based CDSS in provision of prescribing error alerts that is relevant to the patient’s context and institutional medication use guides. Objective: To evaluate the efficacy of LLM-based CDSS in correctly identifying medication errors in different patient case vignettes from diverse medical and surgical sub-disciplines, against a human expert panel derived ground truth. We compared performance for under 2 different CDSS practical healthcare integration modalities: LLM-based CDSS alone (fully autonomous mode) vs junior pharmacist + LLM-based CDSS (co-pilot, assistive mode). Design, Setting, and Participants: Utilizing a RAG model with state-of-the-art medically-related LLMs (GPT-4, Gemini Pro 1.0 and Med-PaLM 2), this study used 61 prescribing error scenarios embedded into 23 complex clinical vignettes across 12 different medical and surgical specialties. A multidisciplinary expert panel assessed these cases for Drug-Related Problems (DRPs) using the PCNE classification and graded severity / potential for harm using revised NCC MERP medication error index. We compared. Main Outcomes and Measures: This study compares the performance of an LLM-based CDSS in identifying DRPs. Key metrics include accuracy, precision, recall, and F1 scores. We also compare the performance of LLM-CDSS alone and junior hospital pharmacists (less than 2 years post licensure) + LLM-CDSS (co-pilot, assistive mode) in the provision of recommendations to clinicians. In addition, we present comparative results from different LLMs: GPT-4, Gemini Pro 1.0 and Med-PaLM 2. Results RAG-LLM performed better compared to LLM alone. When employed in a co-pilot mode, accuracy, recall, and F1 scores were optimized, indicating effectiveness in identifying moderate to severe DRPs. The accuracy of DRP detection with RAG-LLM improved in several categories but at the expense of lower precision. Conclusions This study established that a RAG-LLM based CDSS significantly boosts the accuracy of medication error identification when used alongside junior pharmacists (co-pilot), with notable improvements in detecting severe DRPs. This study also illuminates the comparative performance of current state-of-the-art LLMs in RAG-based CDSS systems.

Publisher

Research Square Platform LLC

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3