Self-Attention-Based Models for the Extraction of Molecular Interactions from Biological Texts-Reference-Cited by-同舟云学术

Self-Attention-Based Models for the Extraction of Molecular Interactions from Biological Texts

Published:2021-10-27 Issue:11 Volume:11 Page:1591
ISSN:2218-273X
Container-title:Biomolecules
language:en
Short-container-title:Biomolecules

Author:

Srivastava Prashant,Bej Saptarshi,Yordanova Kristina^ORCID,Wolkenhauer Olaf^ORCID

Abstract

For any molecule, network, or process of interest, keeping up with new publications on these is becoming increasingly difficult. For many cellular processes, the amount molecules and their interactions that need to be considered can be very large. Automated mining of publications can support large-scale molecular interaction maps and database curation. Text mining and Natural-Language-Processing (NLP)-based techniques are finding their applications in mining the biological literature, handling problems such as Named Entity Recognition (NER) and Relationship Extraction (RE). Both rule-based and Machine-Learning (ML)-based NLP approaches have been popular in this context, with multiple research and review articles examining the scope of such models in Biological Literature Mining (BLM). In this review article, we explore self-attention-based models, a special type of Neural-Network (NN)-based architecture that has recently revitalized the field of NLP, applied to biological texts. We cover self-attention models operating either at the sentence level or an abstract level, in the context of molecular interaction extraction, published from 2019 onwards. We conducted a comparative study of the models in terms of their architecture. Moreover, we also discuss some limitations in the field of BLM that identifies opportunities for the extraction of molecular interactions from biological text.

Publisher

MDPI AG

Subject

Molecular Biology,Biochemistry

Link

https://www.mdpi.com/2218-273X/11/11/1591/pdf

Reference46 articles.

1. Text Mining

2. The STRING database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets

3. The Atlas of Inflammation Resolution (AIR)

4. Recent advances in biomedical literature mining

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language models assisted multi-effect variants mining on cerebral cavernous malformation familial whole genome sequencing;Computational and Structural Biotechnology Journal;2024-12

2. Fine-tuning a pre-trained Transformers-based model for gene name entity recognition in biomedical text using a customized dataset: case of Desulfovibrio vulgaris Hildenborough;2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM);2023-12-05

3. Traffic flow prediction based on transformer;Journal of Physics: Conference Series;2023-11-01

4. A Service Recommendation Algorithm Based on Self-Attention Mechanism and DeepFM;International Journal of Web Services Research;2023-10-10

5. Multi-omics integration method based on attention deep learning network for biomedical data classification;Computer Methods and Programs in Biomedicine;2023-04