Multi-task learning for few-shot biomedical relation extraction-Reference-Cited by-同舟云学术

Multi-task learning for few-shot biomedical relation extraction

Published:2023-04-19 Issue:11 Volume:56 Page:13743-13763
ISSN:0269-2821
Container-title:Artificial Intelligence Review
language:en
Short-container-title:Artif Intell Rev

Author:

Moscato Vincenzo,Napolano Giuseppe,Postiglione Marco,Sperlì Giancarlo

Abstract

AbstractArtificial intelligence (AI) has advanced rapidly, but it has limited impact on biomedical text understanding due to a lack of annotated datasets (a.k.a. few-shot learning). Multi-task learning, which uses data from multiple datasets and tasks with related syntax and semantics, has potential to address this issue. However, the effectiveness of this approach heavily relies on the quality of the available data and its transferability between tasks. In this paper, we propose a framework, built upon a state-of-the-art multi-task method (i.e. MT-DNN), that leverages different publicly available biomedical datasets to enhance relation extraction performance. Our model employs a transformer-based architecture with shared encoding layers across multiple tasks, and task-specific classification layers to generate task-specific representations. To further improve performance, we utilize a knowledge distillation technique. In our experiments, we assess the impact of incorporating biomedical datasets in a multi-task learning setting and demonstrate that it consistently outperforms state-of-the-art few-shot learning methods in cases of limited data. This results in significant improvement across most datasets and few-shot scenarios, particularly in terms of recall scores.

Funder

Università degli Studi di Napoli Federico II

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics

Link

https://link.springer.com/content/pdf/10.1007/s10462-023-10484-6.pdf

Reference46 articles.

1. Afradi A, Ebrahimabadi A (2020) Comparison of artificial neural networks (ANN), support vector machine (SVM) and gene expression programming (gep) approaches for predicting tbm penetration rate. SN Appl Sci 2:1–16

2. Afradi A, Ebrahimabadi A (2021) Prediction of TBM penetration rate using the imperialist competitive algorithm (ICA) and quantum fuzzy logic. Innov Infrastruct Solut 6(2):103

3. Afradi A, Ebrahimabadi A, Hallajian T (2020) Prediction of tunnel boring machine penetration rate using ant colony optimization, bee colony optimization and the particle swarm optimization, case study: Sabzkooh water conveyance tunnel. Mining Miner Depos 14(2):75–84

4. Afradi A, Ebrahimabadi A, Hallajian T (2021) Prediction of TBM penetration rate using fuzzy logic, particle swarm optimization and harmony search algorithm. Geotech Geol Eng 8:1–24

5. Alimova I, Tutubalina E (2020) Multiple features for clinical relation extraction: a machine learning approach. J Biomed Inform 103:103382. https://doi.org/10.1016/j.jbi.2020.103382

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts;Artificial Intelligence in Medicine;2024-10

2. End-to-end framework for agricultural entity extraction – A hybrid model with transformer;Computers and Electronics in Agriculture;2024-10

3. Few-shot biomedical relation extraction using data augmentation and domain information;Neurocomputing;2024-08

4. An Open-Set Semi-Supervised Multi-Task Learning Framework for Context Classification in Biomedical Texts;2024-07-23

5. Recent Advances in Large Language Models for Healthcare;BioMedInformatics;2024-04-16