Data-driven modelling of mutational hotspots and in-silico predictors in hypertrophic cardiomyopathy-Reference-Cited by-同舟云学术

Data-driven modelling of mutational hotspots and in-silico predictors in hypertrophic cardiomyopathy

Published:2019-10-31 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Waring A.J.^ORCID,Harper A.R.^ORCID,Salatino S.^ORCID,Kramer C.M.^ORCID,Neubauer S,Thomson K.L.^ORCID,Watkins H.^ORCID,Farrall M.^ORCID

Abstract

ABSTRACTBackgroundAlthough rare-missense variants in Mendelian disease-genes have been noted to cluster in specific regions of proteins, it is not clear how to consider this information when evaluating the pathogenicity of a gene or variant. Here we introduce methods for gene-association and variant-interpretation that utilise this powerful signal.MethodsWe present a case-control rare-variant association test, ClusterBurden, that combines information on both variant-burden and variant-clustering. We then introduce a data-driven modelling framework to estimate mutational hotspots in genes with missense variant-clustering and integrate further in-silico predictors into the models.ResultsWe show that ClusterBurden can increase statistical power to scan for putative disease-genes, driven by missense variants, in simulated data and a 34-gene panel dataset of 5,338 cases of hypertrophic cardiomyopathy. We demonstrate that data-driven models can allow quantitative application of the ACMG criteria PM1 and PP3, to resolve a wide range of pathogenicity potential amongst variants of uncertain significance. A web application (Pathogenicity_by_Position) is accessible for missense variant risk prediction of six sarcomeric genes and an R package is available for association testing using ClusterBurden.ConclusionThe inclusion of missense residue position enhances the power of disease-gene association and improves rare-variant pathogenicity interpretation.

Publisher

Cold Spring Harbor Laboratory

Reference39 articles.

1. A cluster of mutations within a short triplet repeat in the C1 inhibitor gene.

2. Localized mutations in the gene encoding the cytoskeletal protein filamin A cause diverse malformations in humans

3. Newly identified genetic risk variants for celiac disease related to the immune response

4. Disease-causing missense mutations in actin binding domain 1 of dystrophin induce thermodynamic instability and protein aggregation

5. APOL1Risk Variants Predict Histopathology and Progression to ESRD in HIV-Related Kidney Disease