The shaky foundations of large language models and foundation models for electronic health records-Reference-Cited by-同舟云学术

The shaky foundations of large language models and foundation models for electronic health records

Published:2023-07-29 Issue:1 Volume:6 Page:
ISSN:2398-6352
Container-title:npj Digital Medicine
language:en
Short-container-title:npj Digit. Med.

Author:

Wornow Michael^ORCID,Xu Yizhe,Thapa Rahul,Patel Birju^ORCID,Steinberg Ethan^ORCID,Fleming Scott^ORCID,Pfeffer Michael A.,Fries Jason,Shah Nigam H.^ORCID

Abstract

AbstractThe success of foundation models such as ChatGPT and AlphaFold has spurred significant interest in building similar models for electronic medical records (EMRs) to improve patient care and hospital operations. However, recent hype has obscured critical gaps in our understanding of these models’ capabilities. In this narrative review, we examine 84 foundation models trained on non-imaging EMR data (i.e., clinical text and/or structured data) and create a taxonomy delineating their architectures, training data, and potential use cases. We find that most models are trained on small, narrowly-scoped clinical datasets (e.g., MIMIC-III) or broad, public biomedical corpora (e.g., PubMed) and are evaluated on tasks that do not provide meaningful insights on their usefulness to health systems. Considering these findings, we propose an improved evaluation framework for measuring the benefits of clinical foundation models that is more closely grounded to metrics that matter in healthcare.

Funder

National Science Foundation

Publisher

Springer Science and Business Media LLC

Subject

Health Information Management,Health Informatics,Computer Science Applications,Medicine (miscellaneous)

Link

https://www.nature.com/articles/s41746-023-00879-8.pdf

Reference106 articles.

1. Bommasani, R. et al. On the opportunities and risks of foundation models. Preprint at arXiv: 2108.07258 (2021).

2. Brown, T. B. et al. Language models are few-shot learners. Preprint at arXiv:2005.14165 (2020).

3. Esser, P., Chiu, J., Atighehchian, P., Granskog, J. & Germanidis, A. Structure and content-guided video synthesis with diffusion models. Preprint at arXiv: 2302.03011 (2023).

4. Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).

5. Jiang, Y. et al. VIMA: general robot manipulation with multimodal prompts. Preprint at arXiv: 2210.03094 (2022).

Cited by 74 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hazard analysis in the era of AI: Assessing the usefulness of ChatGPT4 in STPA hazard analysis;Safety Science;2024-10

2. From bytes to bedside: a systematic review on the use and readiness of artificial intelligence in the neonatal and pediatric intensive care unit;Intensive Care Medicine;2024-09-12

3. Large language model to multimodal large language model: A journey to shape the biological macromolecules to biological sciences and medicine;Molecular Therapy - Nucleic Acids;2024-09

4. Enhancing Diagnostic Support for Chiari Malformation and Syringomyelia: A Comparative Study of Contextualized ChatGPT Models;World Neurosurgery;2024-09

5. Advancing Chinese biomedical text mining with community challenges;Journal of Biomedical Informatics;2024-09