Exome sequencing and characterization of 49,960 individuals in the UK Biobank
Author:
Van Hout Cristopher V.ORCID, Tachmazidou Ioanna, Backman Joshua D., Hoffman Joshua D., Liu Daren, Pandey Ashutosh K., Gonzaga-Jauregui Claudia, Khalid Shareef, Ye Bin, Banerjee Nilanjana, Li Alexander H., O’Dushlaine Colm, Marcketta Anthony, Staples Jeffrey, Schurmann Claudia, Hawes Alicia, Maxwell Evan, Barnard Leland, Lopez Alexander, Penn John, Habegger Lukas, Blumenfeld Andrew L., Bai Xiaodong, O’Keeffe Sean, Yadav Ashish, Praveen Kavita, Jones Marcus, Salerno William J., Chung Wendy K., Surakka Ida, Willer Cristen J.ORCID, Hveem Kristian, Leader Joseph B.ORCID, Carey David J., Ledbetter David H.ORCID, Cardon Lon, Yancopoulos George D., Economides ArisORCID, Coppola Giovanni, Shuldiner Alan R., Balasubramanian Suganthi, Cantor Michael, Nelson Matthew R., Whittaker John, Reid Jeffrey G.ORCID, Marchini Jonathan, Overton John D., Scott Robert A.ORCID, Abecasis Gonçalo R., Yerges-Armstrong LauraORCID, Baras ArisORCID, ,
Abstract
AbstractThe UK Biobank is a prospective study of 502,543 individuals, combining extensive phenotypic and genotypic data with streamlined access for researchers around the world1. Here we describe the release of exome-sequence data for the first 49,960 study participants, revealing approximately 4 million coding variants (of which around 98.6% have a frequency of less than 1%). The data include 198,269 autosomal predicted loss-of-function (LOF) variants, a more than 14-fold increase compared to the imputed sequence. Nearly all genes (more than 97%) had at least one carrier with a LOF variant, and most genes (more than 69%) had at least ten carriers with a LOF variant. We illustrate the power of characterizing LOF variants in this population through association analyses across 1,730 phenotypes. In addition to replicating established associations, we found novel LOF variants with large effects on disease traits, including PIEZO1 on varicose veins, COL6A1 on corneal resistance, MEPE on bone density, and IQGAP2 and GMPR on blood cell traits. We further demonstrate the value of exome sequencing by surveying the prevalence of pathogenic variants of clinical importance, and show that 2% of this population has a medically actionable variant. Furthermore, we characterize the penetrance of cancer in carriers of pathogenic BRCA1 and BRCA2 variants. Exome sequences from the first 49,960 participants highlight the promise of genome sequencing in large population-based studies and are now accessible to the scientific community.
Publisher
Springer Science and Business Media LLC
Subject
Multidisciplinary
Reference58 articles.
1. Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015). 2. Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018). 3. Tyrrell, J. et al. Height, body mass index, and socioeconomic status: Mendelian randomisation study in UK Biobank. Br. Med. J. 352, i582 (2016). 4. Lyall, D. M. et al. Association of body mass index with cardiometabolic disease in the UK Biobank: a Mendelian randomization study. JAMA Cardiol. 2, 882–889 (2017). 5. Abul-Husn, N. S. et al. A protein-truncating HSD17B13 variant and protection from chronic liver disease. N. Engl. J. Med. 378, 1096–1106 (2018).
Cited by
394 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|