Recent Advances in Large Language Models for Healthcare-Reference-Cited by-同舟云学术

Recent Advances in Large Language Models for Healthcare

Published:2024-04-16 Issue:2 Volume:4 Page:1097-1143
ISSN:2673-7426
Container-title:BioMedInformatics
language:en
Short-container-title:BioMedInformatics

Author:

Nassiri Khalid¹^ORCID,Akhloufi Moulay A.¹^ORCID

Affiliation:

1. Perception, Robotics and Intelligent Machines (PRIME), Department of Computer Science, Université de Moncton, Moncton, NB E1A 3E9, Canada

Abstract

Recent advances in the field of large language models (LLMs) underline their high potential for applications in a variety of sectors. Their use in healthcare, in particular, holds out promising prospects for improving medical practices. As we highlight in this paper, LLMs have demonstrated remarkable capabilities in language understanding and generation that could indeed be put to good use in the medical field. We also present the main architectures of these models, such as GPT, Bloom, or LLaMA, composed of billions of parameters. We then examine recent trends in the medical datasets used to train these models. We classify them according to different criteria, such as size, source, or subject (patient records, scientific articles, etc.). We mention that LLMs could help improve patient care, accelerate medical research, and optimize the efficiency of healthcare systems such as assisted diagnosis. We also highlight several technical and ethical issues that need to be resolved before LLMs can be used extensively in the medical field. Consequently, we propose a discussion of the capabilities offered by new generations of linguistic models and their limitations when deployed in a domain such as healthcare.

Publisher

MDPI AG

Link

https://www.mdpi.com/2673-7426/4/2/62/pdf

Reference299 articles.

1. Ye, J., Chen, X., Xu, N., Zu, C., Shao, Z., Liu, S., Cui, Y., Zhou, Z., Gong, C., and Shen, Y. (2023). A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models. arXiv.

2. OpenAI, Achiam, J., Adler, S., Agarwal, S., Ahmad, L., Akkaya, I., Aleman, F.L., Almeida, D., Altenschmidt, J., and Altman, S. (2023). GPT-4 Technical Report. arXiv.

3. Language Models are Few-shot Learners;Brown;Adv. Neural Inf. Process. Syst.,2020

4. Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019, January 2–7). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, MN, USA.

5. A Systematic Review of Natural Language Processing in Healthcare;Iroju;Int. J. Inf. Technol. Comput. Sci.,2015

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. How to critically appraise and direct the trajectory of AI development and application in oncology;ESMO Real World Data and Digital Oncology;2024-09

2. Comparison of the Knowledge of Large Language Models and General Radiologist on RECIST (Preprint);2024-07-26

3. Exploring Temperature Effects on Large Language Models Across Various Clinical Tasks;2024-07-22

4. Contrasting the performance of mainstream Large Language Models in Radiology Board Examinations;2024-07-19

5. Contrasting the performance of mainstream Large Language Models in Radiology Board Examinations (Preprint);2024-07-14