Joint analysis of multiple phenotypes for extremely unbalanced case‐control association studies-Reference-Cited by-同舟云学术

Joint analysis of multiple phenotypes for extremely unbalanced case‐control association studies

Published:2023-01-24 Issue:2 Volume:47 Page:185-197
ISSN:0741-0395
Container-title:Genetic Epidemiology
language:en
Short-container-title:Genetic Epidemiology

Author:

Xie Hongjing¹,Cao Xuewei¹^ORCID,Zhang Shuanglin¹^ORCID,Sha Qiuying¹^ORCID

Affiliation:

1. Department of Mathematical Sciences Michigan Technological University Houghton Michigan USA

Abstract

AbstractIn genome‐wide association studies (GWAS) for thousands of phenotypes in biobanks, most binary phenotypes have substantially fewer cases than controls. Many widely used approaches for joint analysis of multiple phenotypes produce inflated type I error rates for such extremely unbalanced case‐control phenotypes. In this research, we develop a method to jointly analyze multiple unbalanced case‐control phenotypes to circumvent this issue. We first group multiple phenotypes into different clusters based on a hierarchical clustering method, then we merge phenotypes in each cluster into a single phenotype. In each cluster, we use the saddlepoint approximation to estimate the p value of an association test between the merged phenotype and a single nucleotide polymorphism (SNP) which eliminates the issue of inflated type I error rate of the test for extremely unbalanced case‐control phenotypes. Finally, we use the Cauchy combination method to obtain an integrated p value for all clusters to test the association between multiple phenotypes and a SNP. We use extensive simulation studies to evaluate the performance of the proposed approach. The results show that the proposed approach can control type I error rate very well and is more powerful than other available methods. We also apply the proposed approach to phenotypes in category IX (diseases of the circulatory system) in the UK Biobank. We find that the proposed approach can identify more significant SNPs than the other viable methods we compared with.

Publisher

Wiley

Subject

Genetics (clinical),Epidemiology

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/gepi.22513

Reference56 articles.

1. UK Biobank Data: Come and Get It

2. Maximizing the Power of Principal-Component Analysis of Correlated Phenotypes in Genome-wide Association Studies

3. Network medicine: a network-based approach to human disease

4. A Fast and Accurate Method for Genome-Wide Time-to-Event Data Analysis and Its Application to UK Biobank

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Constructing genotype and phenotype network helps reveal disease heritability and phenome-wide association studies;2023-11-20

2. A novel method for multiple phenotype association studies based on genotype and phenotype network;2023-02-23