XGDAG: explainable gene–disease associations via graph neural networks-Reference-Cited by-同舟云学术

XGDAG: explainable gene–disease associations via graph neural networks

Published:2023-08-01 Issue:8 Volume:39 Page:
ISSN:1367-4811
Container-title:Bioinformatics
language:en
Short-container-title:

Author:

Mastropietro Andrea¹^ORCID,De Carlo Gianluca¹^ORCID,Anagnostopoulos Aris¹

Affiliation:

1. Department of Computer, Control and Management Engineering “Antonio Ruberti”, Sapienza University of Rome , Rome 00185, Italy

Abstract

Abstract Motivation Disease gene prioritization consists in identifying genes that are likely to be involved in the mechanisms of a given disease, providing a ranking of such genes. Recently, the research community has used computational methods to uncover unknown gene–disease associations; these methods range from combinatorial to machine learning-based approaches. In particular, during the last years, approaches based on deep learning have provided superior results compared to more traditional ones. Yet, the problem with these is their inherent black-box structure, which prevents interpretability. Results We propose a new methodology for disease gene discovery, which leverages graph-structured data using graph neural networks (GNNs) along with an explainability phase for determining the ranking of candidate genes and understanding the model’s output. Our approach is based on a positive–unlabeled learning strategy, which outperforms existing gene discovery methods by exploiting GNNs in a non-black-box fashion. Our methodology is effective even in scenarios where a large number of associated genes need to be retrieved, in which gene prioritization methods often tend to lose their reliability. Availability and implementation The source code of XGDAG is available on GitHub at: https://github.com/GiDeCarlo/XGDAG. The data underlying this article are available at: https://www.disgenet.org/, https://thebiogrid.org/, https://doi.org/10.1371/journal.pcbi.1004120.s003, and https://doi.org/10.1371/journal.pcbi.1004120.s004.

Funder

SoBigData++

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Link

https://academic.oup.com/bioinformatics/advance-article-pdf/doi/10.1093/bioinformatics/btad482/51024117/btad482.pdf

Reference68 articles.

1. Gene ontology: tool for the unification of biology;Ashburner;Nat Genet,2000

2. Edgar: a database of disease-gene associations with annotated relationships among genes;Babbi;BMC Genomics,2017

3. Ring structures and mean first passage time in networks;Baronchelli;Phys Rev E Stat Nonlin Soft Matter Phys,2006

4. Learning from positive and unlabeled data: a survey;Bekker;Mach Learn,2020

5. Transcriptional addiction in cancer;Bradner;Cell,2017

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning Association Characteristics by Dynamic Hypergraph and Gated Convolution Enhanced Pairwise Attributes for Prediction of Disease-Related lncRNAs;Journal of Chemical Information and Modeling;2024-03-25

2. Predicting disease-gene associations through self-supervised mutual infomax graph convolution network;Computers in Biology and Medicine;2024-03