A Hybrid Deep Learning Model for Protein–Protein Interactions Extraction from Biomedical Literature-Reference-Cited by-同舟云学术

A Hybrid Deep Learning Model for Protein–Protein Interactions Extraction from Biomedical Literature

Published:2020-04-13 Issue:8 Volume:10 Page:2690
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Quan Changqin,Luo Zhiwei,Wang Song

Abstract

The exponentially increasing size of biomedical literature and the limited ability of manual curators to discover protein–protein interactions (PPIs) in text has led to delays in keeping PPI databases updated with the current findings. The state-of-the-art text mining methods for PPI extraction are primarily based on deep learning (DL) models, and the performance of a DL-based method is mainly affected by the architecture of DL models and the feature embedding methods. In this study, we compared different architectures of DL models, including convolutional neural networks (CNN), long short-term memory (LSTM), and hybrid models, and proposed a hybrid architecture of a bidirectional LSTM+CNN model for PPI extraction. Pretrained word embedding and shortest dependency path (SDP) embedding are fed into a two-embedding channel model, such that the model is able to model long-distance contextual information and can capture the local features and structure information effectively. The experimental results showed that the proposed model is superior to the non-hybrid DL models, and the hybrid CNN+Bidirectional LSTM model works well for PPI extraction. The visualization and comparison of the hidden features learned by different DL models further confirmed the effectiveness of the proposed model.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/8/2690/pdf

Reference44 articles.

1. Small molecules, big targets: drug discovery faces the protein–protein interaction challenge

2. BIND: the Biomolecular Interaction Network Database

3. MINT: a Molecular INTeraction database

4. The IntAct molecular interaction database in 2012

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. R2V-PPI: Enhancing Prediction of Protein-Protein Interactions Using Word2Vec Embeddings and Deep Neural Networks;2024 Fourth International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT);2024-01-11

2. Information Extraction for Biomedical Literature Using Artificial Intelligence: A Comparative Study;Lecture Notes in Networks and Systems;2024

3. Learning entity-oriented representation for biomedical relation extraction;Journal of Biomedical Informatics;2023-11

4. Protein–Protein Interaction Network Extraction Using Text Mining Methods Adds Insight into Autism Spectrum Disorder;Biology;2023-10-18

5. Constructing a disease database and using natural language processing to capture and standardize free text clinical information;Scientific Reports;2023-05-26