Abstract
AbstractBroad yet detailed data collected in biobanks captures variation reflective of human health and behavior, but insights are hard to extract given their complexity and scale. In the largest factor analysis to date, we distill hundreds of medical record codes, physical assays, and survey items from UK Biobank into 35 understandable latent constructs. The identified factors recapitulate known disease classifications, highlight the relevance of psychiatric constructs, improve measurement of health-related behavior, and disentangle elements of socioeconomic status. We demonstrate the power of this principled data reduction approach to clarify genetic signal, enhance discovery, and identify associations between underlying phenotypic structure and health outcomes such as mortality. We emphasize the importance of considering the interwoven nature of the human phenome when evaluating large-scale patterns relevant to public health.
Publisher
Cold Spring Harbor Laboratory
Reference91 articles.
1. Kurki MI , Karjalainen J , Palta P , Sipilä TP , Kristiansson K , Donner K , et al. FinnGen: Unique genetic insights from combining isolated population and national health register data. medRxiv. 2022 Mar 6.
2. Million Veteran Program: A mega-biobank to study genetic influences on health and disease
3. Overview of the BioBank Japan Project: Study design and profile
4. The UK Biobank resource with deep phenotyping and genomic data;Nat,2018
5. SARS-CoV-2 is associated with changes in brain structure in UK Biobank;Nat 2022,2022