Multilingual bi‐encoder models for biomedical entity linking-Reference-Cited by-同舟云学术

Multilingual bi‐encoder models for biomedical entity linking

Published:2023-06-21 Issue:9 Volume:40 Page:
ISSN:0266-4720
Container-title:Expert Systems
language:en
Short-container-title:Expert Systems

Author:

Guven Zekeriya Anil¹,Lamurias Andre²

Affiliation:

1. Department of Computer Engineering Izmir Bakircay University Izmir Turkey

2. Department of Computer Science NOVA School of Science and Technology Lisbon Portugal

Abstract

AbstractNatural language processing (NLP) is a field of study that focuses on data analysis on texts with certain methods. NLP includes tasks such as sentiment analysis, spam detection, entity linking, and question answering, to name a few. Entity linking is an NLP task that is used to map mentions specified in the text to the entities of a Knowledge Base. In this study, we analysed the efficacy of bi‐encoder entity linking models for multilingual biomedical texts. Using surface‐based, approximate nearest neighbour search and embedding approaches during the candidate generation phase, accuracy, and recall values were measured on language representation models such as BERT, SapBERT, BioBERT, and RoBERTa according to language and domain. The proposed entity linking framework was analysed on the BC5CDR and Cantemist datasets for English and Spanish, respectively. The framework achieved 76.75% accuracy for the BC5CDR and 60.19% for the Cantemist. In addition, the proposed framework was compared with previous studies. The results highlight the challenges that come with domain‐specific multilingual datasets.

Publisher

Wiley

Subject

Artificial Intelligence,Computational Theory and Mathematics,Theoretical Computer Science,Control and Systems Engineering

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/exsy.13388

Reference48 articles.

1. Andrade V. D. T. Ruas P. &Couto F. M.(2021).Named entity recognition and linking: A Portuguese and Spanish oncological parallel corpus. bioRxiv.https://doi.org/10.1101/2021.09.16.460605

2. Angell R. Monath N. Mohan S. Yadav N. &McCallum A.(2021).Clustering‐based inference for biomedical entity linking. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2598–2608.

3. Bhargav G. P. S. Khandelwal D. Dana S. Garg D. Kapanipathi P. Roukos S. Gray A. &Subramaniam L. V.(2022).Zero‐shot entity linking with less data. In: Findings of the Association for Computational Linguistics: NAACL 2022 Seattle United States: Association for Computational Linguistics 1681–1697.https://aclanthology.org/2022.findings-naacl.127

4. Bhowmik R. Stratos K. &deMelo G.(2021).Fast and effective biomedical entity linking using a dual encoder. arXiv Preprint arXiv:210305028.

5. Building Transformer‐Based Entity Linking Systemizuna385 | Nerd For Tech.Medium.https://medium.com/nerd-for-tech/building-bi-encoder-based-entity-linking-system-with-transformer-6c111d86500

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evolution of AI in Business Intelligence;Advances in Computational Intelligence and Robotics;2024-08-30