Pre-trained Language Models in Biomedical Domain: A Systematic Survey-Reference-Cited by-同舟云学术

Pre-trained Language Models in Biomedical Domain: A Systematic Survey

Published:2023-10-05 Issue:3 Volume:56 Page:1-52
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Wang Benyou¹^ORCID,Xie Qianqian²^ORCID,Pei Jiahuan³^ORCID,Chen Zhihong⁴^ORCID,Tiwari Prayag⁵^ORCID,Li Zhao⁶^ORCID,Fu Jie⁷^ORCID

Affiliation:

1. SRIBD & SDS, The Chinese University of Hong Kong, Shenzhen, China

2. Department of Computer Science, University of Manchester, United Kingdom

3. University of Amsterdam, Netherlands

4. SRIBD & SSE, The Chinese University of Hong Kong, Shenzhen, China

5. School of Information Technology, Halmstad University, Sweden

6. The University of Texas Health Science Center at Houston, USA

7. Mila, University of Montreal, Canada

Abstract

Pre-trained language models (PLMs) have been the de facto paradigm for most natural language processing tasks. This also benefits the biomedical domain: researchers from informatics, medicine, and computer science communities propose various PLMs trained on biomedical datasets, e.g., biomedical text, electronic health records, protein, and DNA sequences for various biomedical tasks. However, the cross-discipline characteristics of biomedical PLMs hinder their spreading among communities; some existing works are isolated from each other without comprehensive comparison and discussions. It is nontrivial to make a survey that not only systematically reviews recent advances in biomedical PLMs and their applications but also standardizes terminology and benchmarks. This article summarizes the recent progress of pre-trained language models in the biomedical domain and their applications in downstream biomedical tasks. Particularly, we discuss the motivations of PLMs in the biomedical domain and introduce the key concepts of pre-trained language models. We then propose a taxonomy of existing biomedical PLMs that categorizes them from various perspectives systematically. Plus, their applications in biomedical downstream tasks are exhaustively discussed, respectively. Last, we illustrate various limitations and future trends, which aims to provide inspiration for the future research.

Funder

Chinese Key-Area Research and Development Program of Guangdong Province

Shenzhen Science and Technology Program

Guangdong Provincial Key Laboratory of Big Data Computing, The Chinese University of Hong Kong, Shenzhen, Shenzhen Key Research Project

Shenzhen Doctoral Startup Funding

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3611651

Reference346 articles.

1. Asma Ben Abacha and Dina Demner-Fushman. 2016. Recognizing question entailment for medical question answering. In AMIA Annual Symposium Proceedings, Vol. 2016. American Medical Informatics Association, 310.

2. Asma Ben Abacha, Chaitanya Shivade, and Dina Demner-Fushman. 2019. Overview of the mediqa 2019 shared task on textual inference, question entailment and question answering. In BioNLP Workshop and Shared Task. 370–379.

3. Arda Akdemir and Tetsuo Shibuya. 2020. Transfer learning for biomedical question answering. In CLEF (Working Notes).

4. Liliya Akhtyamova. 2020. Named entity recognition in spanish biomedical literature: Short review and bert model. In FRUCT. IEEE, 1–7.

5. Testing Contextualized Word Embeddings to Improve NER in Spanish Clinical Case Narratives

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A few-shot word-structure embedded model for bridge inspection reports learning;Advanced Engineering Informatics;2024-10

2. Human-Comparable Sensitivity of Large Language Models in Identifying Eligible Studies Through Title and Abstract Screening: 3-Layer Strategy Using GPT-3.5 and GPT-4 for Systematic Reviews;Journal of Medical Internet Research;2024-08-16

3. Fine-tuning of conditional Transformers for the generation of functionally characterized enzymes;2024-08-10

4. Location-enhanced syntactic knowledge for biomedical relation extraction;Journal of Biomedical Informatics;2024-08

5. Effective type label-based synergistic representation learning for biomedical event trigger detection;BMC Bioinformatics;2024-07-31