ABEE: automated bio entity extraction from biomedical text documents-Reference-Cited by-同舟云学术

ABEE: automated bio entity extraction from biomedical text documents

Published:2023-01-25 Issue:2 Volume:57 Page:222-244
ISSN:2514-9288
Container-title:Data Technologies and Applications
language:en
Short-container-title:DTA

Author:

Kumar Ashutosh^ORCID,Sharaff Aakanksha

Abstract

PurposeThe purpose of this study was to design a multitask learning model so that biomedical entities can be extracted without having any ambiguity from biomedical texts.Design/methodology/approachIn the proposed automated bio entity extraction (ABEE) model, a multitask learning model has been introduced with the combination of single-task learning models. Our model used Bidirectional Encoder Representations from Transformers to train the single-task learning model. Then combined model's outputs so that we can find the verity of entities from biomedical text.FindingsThe proposed ABEE model targeted unique gene/protein, chemical and disease entities from the biomedical text. The finding is more important in terms of biomedical research like drug finding and clinical trials. This research aids not only to reduce the effort of the researcher but also to reduce the cost of new drug discoveries and new treatments.Research limitations/implicationsAs such, there are no limitations with the model, but the research team plans to test the model with gigabyte of data and establish a knowledge graph so that researchers can easily estimate the entities of similar groups.Practical implicationsAs far as the practical implication concerned, the ABEE model will be helpful in various natural language processing task as in information extraction (IE), it plays an important role in the biomedical named entity recognition and biomedical relation extraction and also in the information retrieval task like literature-based knowledge discovery.Social implicationsDuring the COVID-19 pandemic, the demands for this type of our work increased because of the increase in the clinical trials at that time. If this type of research has been introduced previously, then it would have reduced the time and effort for new drug discoveries in this area.Originality/valueIn this work we proposed a novel multitask learning model that is capable to extract biomedical entities from the biomedical text without any ambiguity. The proposed model achieved state-of-the-art performance in terms of precision, recall and F1 score.

Publisher

Emerald

Subject

Library and Information Sciences,Information Systems

Reference45 articles.

1. Malay named entity recognition based on rule-based approach;International Journal of Machine Learning and Computing,2014

2. A framework for learning predictive structures from multiple tasks and unlabeled data;Journal of Machine Learning Research,2005

3. Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training;Journal of Biomedical Informatics,2019

4. Rule-based information extraction is dead! long live rule-based information extraction systems!,2013

5. A unified architecture for natural language processing: deep neural networks with multitask learning,2008

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Biomedical Named Entity Recognition Based on Residual Network and Global Context Mechanism;2023 International Conference on Intelligent Communication and Networking (ICN);2023-11-10