Machine learning-based approaches for disease gene prediction-Reference-Cited by-同舟云学术

Machine learning-based approaches for disease gene prediction

Published:2020-06-22 Issue:5-6 Volume:19 Page:350-363
ISSN:2041-2657
Container-title:Briefings in Functional Genomics
language:en
Short-container-title:

Author:

Le Duc-Hau¹^ORCID

Affiliation:

1. Department of Computational Biomedicine, Vingroup Big Data Institute, Hanoi, Vietnam

Abstract

AbstractDisease gene prediction is an essential issue in biomedical research. In the early days, annotation-based approaches were proposed for this problem. With the development of high-throughput technologies, interaction data between genes/proteins have grown quickly and covered almost genome and proteome; thus, network-based methods for the problem become prominent. In parallel, machine learning techniques, which formulate the problem as a classification, have also been proposed. Here, we firstly show a roadmap of the machine learning-based methods for the disease gene prediction. In the beginning, the problem was usually approached using a binary classification, where positive and negative training sample sets are comprised of disease genes and non-disease genes, respectively. The disease genes are ones known to be associated with diseases; meanwhile, non-disease genes were randomly selected from those not yet known to be associated with diseases. However, the later may contain unknown disease genes. To overcome this uncertainty of defining the non-disease genes, more realistic approaches have been proposed for the problem, such as unary and semi-supervised classification. Recently, more advanced methods, including ensemble learning, matrix factorization and deep learning, have been proposed for the problem. Secondly, 12 representative machine learning-based methods for the disease gene prediction were examined and compared in terms of prediction performance and running time. Finally, their advantages, disadvantages, interpretability and trust were also analyzed and discussed.

Funder

Vietnam National Foundation for Science and Technology Development

Publisher

Oxford University Press (OUP)

Link

http://academic.oup.com/bfg/article-pdf/19/5-6/350/34694598/elaa013.pdf

Reference118 articles.

1. Advances in translational bioinformatics: computational approaches for the hunting of disease genes;Kann;Brief Bioinform,2009

2. A guide to web tools to prioritize candidate genes;Tranchevent;Brief Bioinform,2010

3. Network-based methods for human disease gene prediction;Wang;Brief Funct Genomics,2011

4. POCUS: mining genomic sequence annotation to predict disease genes;Turner;Genome Biol,2003

5. SUSPECTS: enabling fast and effective prioritization of positional candidates;Adie;Bioinformatics,2006

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving the Accuracy of Oncology Diagnosis: A Machine Learning-Based Approach to Cancer Prediction;International Journal of Online and Biomedical Engineering (iJOE);2024-08-08

2. Identification of molecular subtypes of dementia by using blood-proteins interaction-aware graph propagational network;Briefings in Bioinformatics;2024-07-25

3. Machine learning methods for genomic prediction of cow behavioral traits measured by automatic milking systems in North American Holstein cattle;Journal of Dairy Science;2024-07

4. Drug discovery and development in the era of artificial intelligence: From machine learning to large language models;Artificial Intelligence Chemistry;2024-06

5. DiSMVC: a multi-view graph collaborative learning framework for measuring disease similarity;Bioinformatics;2024-05-01