Algorithmic identification of treatment-emergent adverse events from clinical notes using large language models: a pilot study in inflammatory bowel disease-Reference-Cited by-同舟云学术

Algorithmic identification of treatment-emergent adverse events from clinical notes using large language models: a pilot study in inflammatory bowel disease

Published:2023-09-08 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Silverman Anna L,Sushil Madhumita,Bhasuran Balu,Ludwig Dana,Buchanan James,Racz Rebecca,Parakala Mahalakshmi,El-Kamary Samer,Ahima Ohenewaa,Belov Artur,Choi Lauren,Billings Monisha,Li Yan,Habal Nadia,Liu Qi,Tiwari Jawahar,Butte Atul J,Rudrapatna Vivek A

Abstract

AbstractBackground and AimsOutpatient clinical notes are a rich source of information regarding drug safety. However, data in these notes are currently underutilized for pharmacovigilance due to methodological limitations in text mining. Large language models (LLM) like BERT have shown progress in a range of natural language processing tasks but have not yet been evaluated on adverse event detection.MethodsWe adapted a new clinical LLM, UCSF BERT, to identify serious adverse events (SAEs) occurring after treatment with a non-steroid immunosuppressant for inflammatory bowel disease (IBD). We compared this model to other language models that have previously been applied to AE detection.ResultsWe annotated 928 outpatient IBD notes corresponding to 928 individual IBD patients for all SAE-associated hospitalizations occurring after treatment with a non-steroid immunosuppressant. These notes contained 703 SAEs in total, the most common of which was failure of intended efficacy. Out of 8 candidate models, UCSF BERT achieved the highest numerical performance on identifying drug-SAE pairs from this corpus (accuracy 88-92%, macro F1 61-68%), with 5-10% greater accuracy than previously published models. UCSF BERT was significantly superior at identifying hospitalization events emergent to medication use (p < 0.01).ConclusionsLLMs like UCSF BERT achieve numerically superior accuracy on the challenging task of SAE detection from clinical notes compared to prior methods. Future work is needed to adapt this methodology to improve model performance and evaluation using multi-center data and newer architectures like GPT. Our findings support the potential value of using large language models to enhance pharmacovigilance.

Publisher

Cold Spring Harbor Laboratory

Reference27 articles.

1. Questions and Answers on FDA’s Adverse Event Reporting System (FAERS). https://www.fda.gov/drugs/surveillance/questions-and-answers-fdas-adverse-event-reporting-system-faers#:~:text=The%20FDA%20Adverse%20Event%20Reporting,that%20were%20submitted%20to%20FDA. Accessed 05/19/2023.

2. Thein D , Egeberg A , Skov L , Loft N . Absolute and Relative Risk of New-Onset Psoriasis Associated With Tumor Necrosis Factor-α Inhibitor Treatment in Patients With Immune-Mediated Inflammatory Diseases: A Danish Nationwide Cohort Study. JAMA dermatology. 2022.

3. Short and long-term effectiveness and safety of vedolizumab in inflammatory bowel disease: results from the ENEIDA registry;Alimentary pharmacology & therapeutics,2018

4. Under-Reporting of Adverse Drug Reactions

5. Causes for the underreporting of adverse drug events by health professionals: a systematic review;Revista da Escola de Enfermagem da USP,2014