Taiyi: a bilingual fine-tuned large language model for diverse biomedical tasks-Reference-Cited by-同舟云学术

Taiyi: a bilingual fine-tuned large language model for diverse biomedical tasks

Published:2024-02-29 Issue:9 Volume:31 Page:1865-1874
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Luo Ling¹^ORCID,Ning Jinzhong¹,Zhao Yingwen¹,Wang Zhijun¹,Ding Zeyuan¹,Chen Peng¹^ORCID,Fu Weiru¹,Han Qinyu¹,Xu Guangtao¹,Qiu Yunzhi¹,Pan Dinghao¹,Li Jiru¹,Li Hao¹,Feng Wenduo¹,Tu Senbo¹,Liu Yuqi¹,Yang Zhihao¹^ORCID,Wang Jian¹,Sun Yuanyuan¹,Lin Hongfei¹

Affiliation:

1. School of Computer Science and Technology, Dalian University of Technology , Dalian 116024, China

Abstract

Abstract Objective Most existing fine-tuned biomedical large language models (LLMs) focus on enhancing performance in monolingual biomedical question answering and conversation tasks. To investigate the effectiveness of the fine-tuned LLMs on diverse biomedical natural language processing (NLP) tasks in different languages, we present Taiyi, a bilingual fine-tuned LLM for diverse biomedical NLP tasks. Materials and Methods We first curated a comprehensive collection of 140 existing biomedical text mining datasets (102 English and 38 Chinese datasets) across over 10 task types. Subsequently, these corpora were converted to the instruction data used to fine-tune the general LLM. During the supervised fine-tuning phase, a 2-stage strategy is proposed to optimize the model performance across various tasks. Results Experimental results on 13 test sets, which include named entity recognition, relation extraction, text classification, and question answering tasks, demonstrate that Taiyi achieves superior performance compared to general LLMs. The case study involving additional biomedical NLP tasks further shows Taiyi’s considerable potential for bilingual biomedical multitasking. Conclusion Leveraging rich high-quality biomedical corpora and developing effective fine-tuning strategies can significantly improve the performance of LLMs within the biomedical domain. Taiyi shows the bilingual multitasking capability through supervised fine-tuning. However, those tasks such as information extraction that are not generation tasks in nature remain challenging for LLM-based generative approaches, and they still underperform the conventional discriminative approaches using smaller language models.

Funder

National Natural Science Foundation of China

Fundamental Research Funds for the Central Universities

Publisher

Oxford University Press (OUP)

Link

https://academic.oup.com/jamia/article-pdf/31/9/1865/58868142/ocae037.pdf

Reference51 articles.

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language model to multimodal large language model: A journey to shape the biological macromolecules to biological sciences and medicine;Molecular Therapy - Nucleic Acids;2024-09

2. Large language models in biomedicine and health: current research landscape and future directions;Journal of the American Medical Informatics Association;2024-08-22

3. LaDer: A Two-Stage Unsupervised Method for Stem Cell Entity Recognition Based on Reinforcement Learning;Arabian Journal for Science and Engineering;2024-08-17

4. A New Adapter Tuning of Large Language Model for Chinese Medical Named Entity Recognition;Applied Artificial Intelligence;2024-08-05

5. Joint extraction of Chinese medical entities and relations based on RoBERTa and single-module global pointer;BMC Medical Informatics and Decision Making;2024-07-31