Large-scale discovery of novel neurodevelopmental disorder-related genes through a unified analysis of single-nucleotide and copy number variants
-
Published:2022-04-26
Issue:1
Volume:14
Page:
-
ISSN:1756-994X
-
Container-title:Genome Medicine
-
language:en
-
Short-container-title:Genome Med
Author:
Hamanaka Kohei, Miyake Noriko, Mizuguchi Takeshi, Miyatake Satoko, Uchiyama Yuri, Tsuchida Naomi, Sekiguchi Futoshi, Mitsuhashi Satomi, Tsurusaki Yoshinori, Nakashima Mitsuko, Saitsu Hirotomo, Yamada Kohei, Sakamoto Masamune, Fukuda Hiromi, Ohori Sachiko, Saida Ken, Itai Toshiyuki, Azuma Yoshiteru, Koshimizu Eriko, Fujita Atsushi, Erturk Biray, Hiraki Yoko, Ch’ng Gaik-Siew, Kato Mitsuhiro, Okamoto Nobuhiko, Takata Atsushi, Matsumoto NaomichiORCID
Abstract
Abstract
Background
Previous large-scale studies of de novo variants identified a number of genes associated with neurodevelopmental disorders (NDDs); however, it was also predicted that many NDD-associated genes await discovery. Such genes can be discovered by integrating copy number variants (CNVs), which have not been fully considered in previous studies, and increasing the sample size.
Methods
We first constructed a model estimating the rates of de novo CNVs per gene from several factors such as gene length and number of exons. Second, we compiled a comprehensive list of de novo single-nucleotide variants (SNVs) in 41,165 individuals and de novo CNVs in 3675 individuals with NDDs by aggregating our own and publicly available datasets, including denovo-db and the Deciphering Developmental Disorders study data. Third, summing up the de novo CNV rates that we estimated and SNV rates previously established, gene-based enrichment of de novo deleterious SNVs and CNVs were assessed in the 41,165 cases. Significantly enriched genes were further prioritized according to their similarity to known NDD genes using a deep learning model that considers functional characteristics (e.g., gene ontology and expression patterns).
Results
We identified a total of 380 genes achieving statistical significance (5% false discovery rate), including 31 genes affected by de novo CNVs. Of the 380 genes, 52 have not previously been reported as NDD genes, and the data of de novo CNVs contributed to the significance of three genes (GLTSCR1, MARK2, and UBR3). Among the 52 genes, we reasonably excluded 18 genes [a number almost identical to the theoretically expected false positives (i.e., 380 × 0.05 = 19)] given their constraints against deleterious variants and extracted 34 “plausible” candidate genes. Their validity as NDD genes was consistently supported by their similarity in function and gene expression patterns to known NDD genes. Quantifying the overall similarity using deep learning, we identified 11 high-confidence (> 90% true-positive probabilities) candidate genes: HDAC2, SUPT16H, HECTD4, CHD5, XPO1, GSK3B, NLGN2, ADGRB1, CTR9, BRD3, and MARK2.
Conclusions
We identified dozens of new candidates for NDD genes. Both the methods and the resources developed here will contribute to the further identification of novel NDD-associated genes.
Funder
Japan Agency for Medical Research and Development Japan Society for the Promotion of Science Ichiro Kanehara Foundation for the Promotion of Medical Sciences and Medical Care Intramural grant of Yokohama City University Takeda Science Foundation
Publisher
Springer Science and Business Media LLC
Subject
Genetics (clinical),Genetics,Molecular Biology,Molecular Medicine
Reference56 articles.
1. Kaplanis J, Samocha KE, Wiel L, Zhang Z, Arvai KJ, Eberhardt RY, et al. Evidence for 28 genetic disorders discovered by combining healthcare and research data. Nature. 2020;586(7831):757–62. 2. Coe BP, Stessman HAF, Sulovari A, Geisheker MR, Bakken TE, Lake AM, et al. Neurodevelopmental disease genes implicated by de novo mutation and copy number variation morbidity. Nat Genet. 2019;51(1):106–16. 3. Fitzgerald TW, Gerety SS, Jones WD, van Kogelenberg M, King DA, McRae J, et al. Large-scale discovery of novel genetic causes of developmental disorders. Nature. 2015;519(7542):223–8. 4. McRae JF, Clayton S, Fitzgerald TW, Kaplanis J, Prigmore E, Rajan D, et al. Prevalence and architecture of de novo mutations in developmental disorders. Nature. 2017;542(7642):433–8. 5. Samocha KE, Robinson EB, Sanders SJ, Stevens C, Sabo A, McGrath LM, et al. A framework for the interpretation of de novo mutation in human disease. Nat Genet. 2014;46(9):944–50.
Cited by
18 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|