G2P: Using machine learning to understand and predict genes causing rare neurological disorders-Reference-Cited by-同舟云学术

G2P: Using machine learning to understand and predict genes causing rare neurological disorders

Published:2018-03-27 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Botía Juan A.,Guelfi Sebastian,Zhang David,D’Sa Karishma,Reynolds Regina,Onah Daniel,McDonagh Ellen M.,Martin Antonio Rueda,Tucci Arianna,Rendon Augusto,Houlden Henry,Hardy John,Ryten Mina

Abstract

AbstractTo facilitate precision medicine and neuroscience research, we developed a machine-learning technique that scores the likelihood that a gene, when mutated, will cause a neurological phenotype. We analysed 1126 genes relating to 25 subtypes of Mendelian neurological disease defined by Genomics England (March 2017) together with 154 gene-specific features capturing genetic variation, gene structure and tissue-specific expression and co-expression. We randomly re-sampled genes with no known disease association to develop bootstrapped decision-tree models, which were integrated to generate a decision tree-based ensemble for each disease subtype. Genes generating larger numbers of distinct transcripts and with higher probability of having missense mutations in normal individuals were significantly more likely to cause neurological diseases. Using mouse-mutant phenotypic data we tested the accuracy of gene-phenotype predictions and found that for 88% of all disease subtypes there was a significant enrichment of relevant phenotypic abnormalities when predicted genes were mutated in mice and in many cases mutations produced specific and matching phenotypes. Furthermore, using only newly identified genes included in the Genomics England November 2017 release, we assessed our gene-phenotype predictions and showed an 8.3 fold enrichment relative to chance for correct predictions. Thus, we demonstrate both the explanatory and predictive power of machine-learning-based models in neurological disease.

Publisher

Cold Spring Harbor Laboratory

Reference39 articles.

1. Rare-disease genetics in the era of next-generation sequencing: discovery to translation

2. Warman Chardon, J. , Beaulieu, C. , Hartley, T. , Boycott, K. M. & Dyment, D. A. Axons to Exons: the Molecular Diagnosis of Rare Neurological Diseases by Next-Generation Sequencing. Curr. Neurol. Neurosci. Rep. 15, (2015).

3. Clinical and genetic heterogeneity in hereditary spastic paraplegias: From SPG1 to SPG72 and still counting

4. Inherited Neuropathies;Semin. Neurol,2015

5. Analysis of protein-coding genetic variation in 60,706 humans

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Functional genomics provide key insights to improve the diagnostic yield of hereditary ataxia;Brain;2023-01-10

2. PhenoExam: gene set analyses through integration of different phenotype databases;BMC Bioinformatics;2022-12-31

3. Functional genomics provide key insights to improve the diagnostic yield of hereditary ataxia;2022-06-27

4. PhenoExam: an R package and Web application for the examination of phenotypes linked to genes and gene sets;2021-06-30

5. Moving beyond neurons: the role of cell type-specific gene regulation in Parkinson’s disease heritability;npj Parkinson's Disease;2019-04-17