PMC-LLaMA: toward building open-source language models for medicine-Reference-Cited by-同舟云学术

PMC-LLaMA: toward building open-source language models for medicine

Published:2024-04-13 Issue:9 Volume:31 Page:1833-1843
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Wu Chaoyi¹²,Lin Weixiong¹²,Zhang Xiaoman¹²,Zhang Ya¹²,Xie Weidi¹²,Wang Yanfeng¹²

Affiliation:

1. Cooperative Medianet Innovation Center (CMIC), Shanghai Jiao Tong University , Shanghai, 200240, China

2. Shanghai AI Laboratory , Shanghai, 200232, China

Abstract

Abstract Objective Recently, large language models (LLMs) have showcased remarkable capabilities in natural language understanding. While demonstrating proficiency in everyday conversations and question-answering (QA) situations, these models frequently struggle in domains that require precision, such as medical applications, due to their lack of domain-specific knowledge. In this article, we describe the procedure for building a powerful, open-source language model specifically designed for medicine applications, termed as PMC-LLaMA. Materials and methods We adapt a general-purpose LLM toward the medical domain, involving data-centric knowledge injection through the integration of 4.8M biomedical academic papers and 30K medical textbooks, as well as comprehensive domain-specific instruction fine-tuning, encompassing medical QA, rationale for reasoning, and conversational dialogues with 202M tokens. Results While evaluating various public medical QA benchmarks and manual rating, our lightweight PMC-LLaMA, which consists of only 13B parameters, exhibits superior performance, even surpassing ChatGPT. All models, codes, and datasets for instruction tuning will be released to the research community. Discussion Our contributions are 3-fold: (1) we build up an open-source LLM toward the medical domain. We believe the proposed PMC-LLaMA model can promote further development of foundation models in medicine, serving as a medical trainable basic generative language backbone; (2) we conduct thorough ablation studies to demonstrate the effectiveness of each proposed component, demonstrating how different training data and model scales affect medical LLMs; (3) we contribute a large-scale, comprehensive dataset for instruction tuning. Conclusion In this article, we systematically investigate the process of building up an open-source medical-specific LLM, PMC-LLaMA.

Funder

National Key R&D Program of China

Science and Technology Commission of Shanghai Municipality

Higher Education Discipline Innovation Project 111

State Key Laboratory of UHD Video and Audio Production and Presentation.

Publisher

Oxford University Press (OUP)

Link

https://academic.oup.com/jamia/article-pdf/31/9/1833/58868261/ocae045.pdf

Reference49 articles.

1. Large language models encode clinical knowledge;Singhal;Nature,2023

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language models in biomedicine and health: current research landscape and future directions;Journal of the American Medical Informatics Association;2024-08-22

2. Large language models for medicine: a survey;International Journal of Machine Learning and Cybernetics;2024-08-19

3. Evaluating local open-source large language models for data extraction from unstructured reports on mechanical thrombectomy in patients with ischemic stroke;Journal of NeuroInterventional Surgery;2024-08-02

4. Location-enhanced syntactic knowledge for biomedical relation extraction;Journal of Biomedical Informatics;2024-08

5. Llama 3 Challenges Proprietary State-of-the-Art Large Language Models in Radiology Board–style Examination Questions;Radiology;2024-08-01