DeepPPPred: An Ensemble of BERT, CNN, and RNN for Classifying Co-mentions of Proteins and Phenotypes-Reference-Cited by-同舟云学术

DeepPPPred: An Ensemble of BERT, CNN, and RNN for Classifying Co-mentions of Proteins and Phenotypes

Published:2020-09-20 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Shahri Morteza Pourreza^ORCID,Lyon Katrina,Schearer Julia,Kahanda Indika^ORCID

Abstract

AbstractThe biomedical literature provides an extensive source of information in the form of unstructured text. One of the most important types of information hidden in biomedical literature is the relationships between human proteins and their phenotypes, which, due to the exponential growth of publications, can remain hidden. This provides a range of opportunities for the development of computational methods to extract the biomedical relationships from the unstructured text. In our previous work, we developed a supervised machine learning approach, called PPPred, for classifying the validity of a given sentence-level human protein-phenotype co-mention. In this work, we propose DeepPPPred, an ensemble classifier composed of PPPred and three deep neural network models: RNN, CNN, and BERT. Using an expanded gold-standard co-mention dataset, we demonstrate that the proposed ensemble method significantly outperforms its constituent components and provides a new state-of-the-art performance on classifying the co-mentions of human proteins and phenotype terms.

Publisher

Cold Spring Harbor Laboratory

Reference45 articles.

1. A population genetic interpretation of GWAS findings for human quantitative traits

2. SNPPhenA: a corpus for extracting ranked associations of single-nucleotide polymorphisms and phenotypes from literature;Journal of biomedical semantics,2017

3. Protein misfolding and aggregation: mechanism, factors and detection;Process Biochemistry,2016

4. Automated Acquisition of Disease-Drug Knowledge from Biomedical and Clinical Documents: An Initial Study

5. Protein Misfolding, Amyloid Formation, and Human Disease: A Summary of Progress Over the Last Decade

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Novel Patient Similarity Network (PSN) Framework Based on Multi-Model Deep Learning for Precision Medicine;Journal of Personalized Medicine;2022-05-10

2. Spatial Impressions Monitoring during COVID-19 Pandemic Using Machine Learning Techniques;Computers;2022-03-29

3. Detecting racism and xenophobia using deep learning models on Twitter data: CNN, LSTM and BERT;PeerJ Computer Science;2022-03-01

4. Deep semi-supervised learning ensemble framework for classifying co-mentions of human proteins and phenotypes;BMC Bioinformatics;2021-10-16

5. A Multidimensional Data Fusion Model Based on Deep Learning for a Patient Similarity Network (Preprint);2021-02-16