SCREENER: Streamlined collaborative learning of NER and RE model for discovering gene-disease relations-Reference-Cited by-同舟云学术

SCREENER: Streamlined collaborative learning of NER and RE model for discovering gene-disease relations

Published:2023-11-27 Issue:11 Volume:18 Page:e0294713
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Park Minjun^ORCID,Jeong Chan Ung,Baik Young Sang,Lee Dong Geon,Park Jeong U.,Koo Hee Jung,Kim Tae Yong^ORCID

Abstract

Finding relations between genes and diseases is essential in developing a clinical diagnosis, treatment, and drug design for diseases. One successful approach for mining the literature is the document-based relation extraction method. Despite recent advances in document-level extraction of entity-entity, there remains a difficulty in understanding the relations between distant words in a document. To overcome the above limitations, we propose an AI-based text-mining model that learns the document-level relations between genes and diseases using an attention mechanism. Furthermore, we show that including a direct edge (DE) and indirect edges between genetic targets and diseases when training improves the model’s performance. Such relation edges can be visualized as graphs, enhancing the interpretability of the model. For the performance, we achieved an F1-score of 0.875, outperforming state-of-the-art document-level extraction models. In summary, the SCREENER identifies biological connections between target genes and diseases with superior performance by leveraging direct and indirect target-disease relations. Furthermore, we developed a web service platform named SCREENER (Streamlined CollaboRativE lEarning of NEr and Re), which extracts the gene-disease relations from the biomedical literature in real-time. We believe this interactive platform will be useful for users to uncover unknown gene-disease relations in the world of fast-paced literature publications, with sufficient interpretation supported by graph visualizations. The interactive website is available at: https://ican.standigm.com.

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference41 articles.

1. Identification and Analysis of Co-Occurrence Networks with NetCutter;H Müller;PLoS ONE,2008

2. DigSee: disease gene search engine with evidence sentences (version cancer);J Kim;Nucleic Acids Research,2013

3. LGscore: A method to identify disease-related genes using biological literature and Google data;J Kim;Journal of Biomedical Informatics,2015

4. DISEASES: Text mining and data integration of disease–gene associations;S Pletscher-Frankild;Methods,2015

5. eDGAR: a database of disease-gene associations with annotated relationships among genes;G Babbi;BMC genomics,2017

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial intelligence-driven drug repositioning uncovers efavirenz as a modulator of α-synuclein propagation: Implications in Parkinson’s disease;Biomedicine & Pharmacotherapy;2024-05

2. Learning the Relationship Between Variants, Metabolic Fluxes and Phenotypes;2024-03-05