A Custom GPT Health Recommender System Informed by Causal Bayesian Networks and Authoritative Ontologies (Preprint)

Author:

Hu HengyiORCID,Kerschberg Larry

Abstract

BACKGROUND

ChatGPT and other large language models (LLMs) are trained on extensive text data; they learn patterns and associations within the training text without an inherent understanding of underlying causal mechanisms. Establishing causation necessitates controlled experiments and observations, and as of May 2024, ChatGPT lacks access to experimental data and the capacity to learn analytical models from data. Recent advancements from OpenAI enable the creation of custom Generative Pre-trained Transformers (GPT) models using their GPT Builder. These custom GPTs can be tailored with causal knowledge from Causal Bayesian Networks (CBN), thereby producing a knowledgeable, health-recommender system with causal expertise. This system is not only easily accessible for patients, clinicians and other users, but also has the potential to significantly improve healthcare outcomes.

OBJECTIVE

This paper presents a practical solution –a custom GPT model integrated with Causal Bayesian Networks (CBNs) informed by Authoritative Medical Ontologies (AMOs) as prior foundational knowledge. AMOs are robust biomedical ontologies that encapsulate the expert knowledge of their creators. By utilizing structured information contained within these ontologies, we can generate an informed CBN which can be used to improve a custom GPT as a health recommender system. These enhanced GPTs offer profound insights into cause-and-effect among co-morbid symptoms within the disease domain.

METHODS

To demonstrate our recommender system, we learn a CBN using NIMH data for patients of Alzheimer’s Disease and augment this with causal knowledge from the International Classification of Diseases Version 10 Clinical Modification (ICD-10-CM). We compute the CBN using the Max-Min Hill-Climbing (MMHC) algorithm. We generate two separate CBNs using MMHC to compare predictive accuracies between a baseline and a CBN modified with causal mechanisms from ICD-10-CM. Our previous research using this method has resulted in a modified CBN that reflects the causal claims in the AMOs and agrees with both the AMOs and the observational dataset. With this causal model, we build a custom GPT using OpenAI’s GPT Builder.

RESULTS

The custom GPT contains both potentially causal and correlations among symptoms, as well as conditional probabilities for these relationships. The GPT will also contain knowledge of causal mechanisms from ICD-10-CM, extended information regarding symptom variables, and references to existing literature regarding comorbid Alzheimer’s Disease symptoms. Furthermore, because the modified CBN model establishes potentially causal relationships among symptoms which can be verified in existing epidemiological research, we can verify that the custom GPT also establishes these causal relationships. This creates a GPT that agrees with the modified CBN, which is a representation of existing subject matter expertise in the disease domain.

CONCLUSIONS

To obtain our CBN, we’ve used a previous methodology which obtains ordered variable pairs from authoritative ontology ICD-10-CM as prior expertise. In our past research, a CBN that is learned using MMHC can be improved significantly by considering prior sources of knowledge, if the algorithm is modified appropriately. This source of prior knowledge can be validated in existing literature as a sequence of events in AD progression, specific causal mechanisms among comorbid symptoms, and conditional probabilities from a Bayesian Network. The resulting modified network provides insight into the causal relationships expressed in the AD data and takes advantage of the expertise and knowledge contained in the AMOs. Since inferring causality from a CBN does not exist in a vacuum, the relationships within the CBN, regardless of the strength of the conditional probabilities, must be explored further. A custom GPT extends the causal knowledge within a CBN with general knowledge of symptoms and diseases, providing a tool capable of suggesting causal inference based on the analysis of real patient data. With a LLM, uncertainty is present but reasoned with, as it is present but reasoned with in the prior, posterior, and likelihood in Bayes Theorem. Moving forward, we would like to explore using ChatGPT to produce ordered variable pairs and to automate the validation of potentially causal information.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3