Abstract
AbstractGenetic variants in genesGRIN1,GRIN2A,GRIN2B, andGRIN2D, which encode subunits of the N-methyl-D-aspartate receptor (NMDAR), have been associated with severe and heterogeneous neurologic diseases. Missense variants in these genes can result in gain or loss of the NMDAR function, requiring opposite therapeutic treatments. Computational methods that predict pathogenicity and molecular functional effects are therefore crucial for accurate diagnosis and therapeutic applications. We assembled missense variants: 201 from patients, 631 from general population, and 159 characterized by electrophysiological readouts showing whether they can enhance or reduce the receptor function. This includes new functional data from 47 variants reported here, for the first time. We found that pathogenic/benign variants and variants that increase/decrease the channel function were distributed unevenly on the protein structure, with spatial proximity to ligands bound to the agonist and antagonist binding sites being key predictive features. Leveraging distances from ligands, we developed two independent machine learning-based predictors for NMDAR missense variants: a pathogenicity predictor which outperforms currently available predictors (AUC=0.945, MCC=0.726), and the first binary predictor of molecular function (increase or decrease) (AUC=0.809, MCC=0.523). Using these, we reclassified variants of uncertain significance in the ClinVar database and refined a previous genome-informed epidemiological model to estimate the birth incidence of molecular mechanism-defined GRIN disorders. Our findings demonstrate that distance from ligands is an important feature in NMDARs that can enhance variant pathogenicity prediction and enable functional prediction. Further studies with larger numbers of phenotypically and functionally characterized variants will enhance the potential clinical utility of this method.
Publisher
Cold Spring Harbor Laboratory