Practical Issues in Screening and Variable Selection in Genome-Wide Association Analysis-Reference-Cited by-同舟云学术

Practical Issues in Screening and Variable Selection in Genome-Wide Association Analysis

Published:2014-01 Issue: Volume:13s7 Page:CIN.S16350
ISSN:1176-9351
Container-title:Cancer Informatics
language:en
Short-container-title:Cancer Inform

Author:

Hong Sungyeon¹,Kim Yongkang¹,Park Taesung¹²

Affiliation:

1. Department of Statistics, Seoul National University, Seoul, South Korea.

2. Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea.

Abstract

Variable selection methods play an important role in high-dimensional statistical modeling and analysis. Computational cost and estimation accuracy are the two main concerns for statistical inference from ultrahigh-dimensional data. In particular, genome-wide association studies (GWAS), which focus on identifying single nucleotide polymorphisms (SNPs) associated with a disease of interest, have produced ultrahigh-dimensional data. Numerous methods have been proposed to handle GWAS data. Most statistical methods have adopted a two-stage approach: pre-screening for dimensional reduction and variable selection to identify causal SNPs. The pre-screening step selects SNPs in terms of their P-values or the absolute values of the regression coefficients in single SNP analysis. Penalized regressions, such as the ridge, lasso, adaptive lasso, and elastic-net regressions, are commonly used for the variable selection step. In this paper, we investigate which combination of pre-screening method and penalized regression performs best on a quantitative phenotype using two real GWAS datasets.

Publisher

SAGE Publications

Subject

Cancer Research,Oncology

Link

http://journals.sagepub.com/doi/pdf/10.4137/CIN.S16350

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Identifying and overcoming COVID-19 vaccination impediments using Bayesian data mining techniques;Scientific Reports;2024-04-13

2. Self-semi-supervised clustering for large scale data with massive null group;Journal of the Korean Statistical Society;2020-01-01

3. SNP variable selection by generalized graph domination;PLOS ONE;2019-01-24

4. Networking in Biology: The Hybrid Rat Diversity Panel;Methods in Molecular Biology;2019

5. Sparse wars: A survey and comparative study of spherical deconvolution algorithms for diffusion MRI;NeuroImage;2019-01