Predicting candidate genes from phenotypes, functions, and anatomical site of expression

Author:

Chen JunORCID,Althagafi AzzaORCID,Hoehndorf RobertORCID

Abstract

ABSTRACTMotivationOver the past years, many computational methods have been developed to incorporate information about phenotypes for disease gene prioritization task. These methods generally compute the similarity between a patient’s phenotypes and a database of gene-phenotype to find the most phenotypically similar match. The main limitation in these methods is their reliance on knowledge about phenotypes associated with particular genes, which is not complete in humans as well as in many model organisms such as the mouse and fish. Information about functions of gene products and anatomical site of gene expression is available for more genes and can also be related to phenotypes through ontologies and machine learning models.ResultsWe developed a novel graph-based machine learning method for biomedical ontologies which is able to exploit axioms in ontologies and other graph-structured data. Using our machine learning method, we embed genes based on their associated phenotypes, functions of the gene products, and anatomical location of gene expression. We then develop a machine learning model to predict gene–disease associations based on the associations between genes and multiple biomedical ontologies, and this model significantly improves over state of the art methods. Furthermore, we extend phenotype-based gene prioritization methods significantly to all genes which are associated with phenotypes, functions, or site of expression.AvailabilitySoftware and data are available at https://github.com/bio-ontology-research-group/DL2Vec.Contactrobert.hoehndorf@kaust.edu.sa

Publisher

Cold Spring Harbor Laboratory

Reference62 articles.

1. Integrated genomic and network-based analyses of complex diseases and human disease network;Journal of Genetics and Genomics,2016

2. Alanis-Lobato, G. et al. (2016). Hippie v2. 0: enhancing meaningfulness and reliability of protein–protein interaction networks. Nucleic acids research, page gkw985.

3. Neuro-symbolic representation learning on biological knowledge graphs;Bioinformatics,2017

4. A new face and new challenges for Online Mendelian Inheritance in Man (OMIM®)

5. Gene Ontology: tool for the unification of biology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3