RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition-Reference-Cited by-同舟云学术

RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition

Published:2024-05-06 Issue:9 Volume:31 Page:1929-1938
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Li Mingchen¹,Zhou Huixue¹²^ORCID,Yang Han¹²^ORCID,Zhang Rui¹^ORCID

Affiliation:

1. Division of Computational Health Sciences, Department of Surgery, University of Minnesota , Minneapolis, MN 55455, United States

2. Institute for Health Informatics, University of Minnesota , Minneapolis, MN 55455, United States

Abstract

Abstract Objectives This article aims to enhance the performance of larger language models (LLMs) on the few-shot biomedical named entity recognition (NER) task by developing a simple and effective method called Retrieving and Chain-of-Thought (RT) framework and to evaluate the improvement after applying RT framework. Materials and Methods Given the remarkable advancements in retrieval-based language model and Chain-of-Thought across various natural language processing tasks, we propose a pioneering RT framework designed to amalgamate both approaches. The RT approach encompasses dedicated modules for information retrieval and Chain-of-Thought processes. In the retrieval module, RT discerns pertinent examples from demonstrations during instructional tuning for each input sentence. Subsequently, the Chain-of-Thought module employs a systematic reasoning process to identify entities. We conducted a comprehensive comparative analysis of our RT framework against 16 other models for few-shot NER tasks on BC5CDR and NCBI corpora. Additionally, we explored the impacts of negative samples, output formats, and missing data on performance. Results Our proposed RT framework outperforms other LMs for few-shot NER tasks with micro-F1 scores of 93.50 and 91.76 on BC5CDR and NCBI corpora, respectively. We found that using both positive and negative samples, Chain-of-Thought (vs Tree-of-Thought) performed better. Additionally, utilization of a partially annotated dataset has a marginal effect of the model performance. Discussion This is the first investigation to combine a retrieval-based LLM and Chain-of-Thought methodology to enhance the performance in biomedical few-shot NER. The retrieval-based LLM aids in retrieving the most relevant examples of the input sentence, offering crucial knowledge to predict the entity in the sentence. We also conducted a meticulous examination of our methodology, incorporating an ablation study. Conclusion The RT framework with LLM has demonstrated state-of-the-art performance on few-shot NER tasks.

Funder

National Institutes of Health

National Center for Complementary and Integrative Health

National Institute on Aging

National Cancer Institute

Publisher

Oxford University Press (OUP)

Link

https://academic.oup.com/jamia/article-pdf/31/9/1929/58868150/ocae095.pdf

Reference32 articles.

1. Hierarchical shared transfer learning for biomedical named entity recognition;Chai;BMC Bioinformatics,2022

2. Medical knowledge graph: data sources, construction, reasoning, and applications;Wu;Big Data Min Anal,2023

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language models in biomedicine and health: current research landscape and future directions;Journal of the American Medical Informatics Association;2024-08-22