Genome-wide analysis of Structural Variants in Parkinson’s Disease using Short-Read Sequencing data
Author:
Billingsley Kimberley J.ORCID, Ding JinhuiORCID, Jerez Pilar Alvarez, Illarionova Anastasia, Grenn Francis P., Makarious Mary B.ORCID, Moore Anni, Vitale Daniel, Reed Xylena, Hernandez Dena, Torkamani Ali, Ryten Mina, Hardy John, Chia RuthORCID, Scholz Sonja W., Traynor Bryan J., Dalgard Clifton L.ORCID, Ehrlich Debra J., Tanaka Toshiko, Ferrucci Luigi, Beach Thomas.G., Serrano Geidy E., Quinn John P., Bubb Vivien J., Collins Ryan L, Zhao Xuefang, Walker Mark, Pierce-Hoffman Emma, Brand Harrison, Talkowski MichaelORCID, Casey Bradford, Cookson Mark RORCID, Markham Androo, Nalls Mike, Mahmoud Medhat, Sedlazeck Fritz J, Blauwendraat Cornelis, Gibbs J. RaphaelORCID, Singleton Andrew B.,
Abstract
AbstractParkinson’s disease is a complex neurodegenerative disorder, affecting approximately one million individuals in the USA alone. A significant proportion of risk for Parkinson’s disease is driven by genetics. Despite this, the majority of the common genetic variation that contributes to disease risk is unknown, in-part because previous genetic studies have focussed solely on the contribution of single nucleotide variants. Structural variants represent a significant source of genetic variation in the human genome. However, because assay of this variability is challenging, structural variants have not been cataloged on a genome-wide scale, and their contribution to the risk of Parkinson’s disease remains unknown. In this study, we 1) leveraged the GATK-SV pipeline to detect and genotype structural variants in 7,772 short-read sequencing data and 2) generated a subset of matched whole-genome Oxford Nanopore Technologies long-read sequencing data from the PPMI cohort to allow for comprehensive structural variant confirmation. We detected, genotyped, and tested 3,154 “high-confidence” common structural variant loci, representing over 412 million nucleotides of non-reference genetic variation. Using the long-read sequencing data, we validated three structural variants that may drive the association signals at known Parkinson’s disease risk loci, including a 2kb intronic deletion within the gene LRRN4. Further, we confirm that the majority of structural variants in the human genome cannot be detected using short-read sequencing alone, encompassing on average around 4 million nucleotides of inaccessible sequence per genome. Therefore, although these data provide the most comprehensive survey of the contribution of structural variants to the genetic risk of Parkinson’s disease to date, this study highlights the need for large-scale long-read datasets to fully elucidate the role of structural variants in Parkinson’s disease.
Publisher
Cold Spring Harbor Laboratory
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|