The sequences of 150,119 genomes in the UK Biobank
Author:
Halldorsson Bjarni V.ORCID, Eggertsson Hannes P.ORCID, Moore Kristjan H. S., Hauswedell Hannes, Eiriksson OgmundurORCID, Ulfarsson Magnus O., Palsson GunnarORCID, Hardarson Marteinn T., Oddsson AsmundurORCID, Jensson Brynjar O.ORCID, Kristmundsdottir Snaedis, Sigurpalsdottir Brynja D., Stefansson Olafur A.ORCID, Beyter DorukORCID, Holley Guillaume, Tragante Vinicius, Gylfason Arnaldur, Olason Pall I., Zink Florian, Asgeirsdottir Margret, Sverrisson Sverrir T., Sigurdsson Brynjar, Gudjonsson Sigurjon A., Sigurdsson Gunnar T., Halldorsson Gisli H.ORCID, Sveinbjornsson Gardar, Norland Kristjan, Styrkarsdottir UnnurORCID, Magnusdottir Droplaug N., Snorradottir Steinunn, Kristinsson Kari, Sobech Emilia, Jonsson Helgi, Geirsson Arni J., Olafsson Isleifur, Jonsson Palmi, Pedersen Ole BirgerORCID, Erikstrup ChristianORCID, Brunak SørenORCID, Ostrowski Sisse RyeORCID, Andersen Steffen, Banasik Karina, Burgdorf Kristoffer, Didriksen Maria, Dinh Khoa Manh, Erikstrup Christian, Gudbjartsson Daniel, Hansen Thomas Folkmann, Hjalgrim Henrik, Jemec Gregor, Jennum Poul, Johansson Pär Ingemar, Larsen Margit Anita Hørup, Mikkelsen Susan, Nielsen Kasper Rene, Nyegaard Mette, Ostrowski Sisse Rye, Sækmose Susanne, Sørensen Erik, Thorsteinsdottir Unnur, Brun Mie Topholm, Ullum Henrik, Werge Thomas, Thorleifsson Gudmar, Jonsson Frosti, Melsted Pall, Jonsdottir IngileifORCID, Rafnar ThorunnORCID, Holm HilmaORCID, Stefansson Hreinn, Saemundsdottir Jona, Gudbjartsson Daniel F.ORCID, Magnusson Olafur T., Masson Gisli, Thorsteinsdottir Unnur, Helgason Agnar, Jonsson HakonORCID, Sulem PatrickORCID, Stefansson Kari,
Abstract
AbstractDetailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data1,2. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank3. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.
Publisher
Springer Science and Business Media LLC
Subject
Multidisciplinary
Reference75 articles.
1. Gudbjartsson, D. F. et al. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet. 47, 435–444 (2015). 2. Taliun, D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590, 290–299 (2021). 3. Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015). 4. Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am. J. Epidemiol. 186, 1026–1034 (2017). 5. Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Cited by
235 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|