Scalable Nonparametric Prescreening Method for Searching Higher-Order Genetic Interactions Underlying Quantitative Traits-Reference-Cited by-同舟云学术

Scalable Nonparametric Prescreening Method for Searching Higher-Order Genetic Interactions Underlying Quantitative Traits

Published:2019-12-01 Issue:4 Volume:213 Page:1209-1224
ISSN:1943-2631
Container-title:Genetics
language:en
Short-container-title:

Author:

Kontio Juho A J¹,Sillanpää Mikko J¹²

Affiliation:

1. Research Unit of Mathematical Sciences, Biocenter Oulu, University of Oulu, 90014, Finland and

2. Infotech Oulu, University of Oulu, 90014, Finland

Abstract

Abstract The Gaussian process (GP) regression is theoretically capable of capturing higher-order gene-by-gene interactions important to trait variation non-exhaustively with high accuracy. Unfortunately, GP approach is scalable only for 100-200 genes and thus, not applicable for high... Gaussian process (GP)-based automatic relevance determination (ARD) is known to be an efficient technique for identifying determinants of gene-by-gene interactions important to trait variation. However, the estimation of GP models is feasible only for low-dimensional datasets (∼200 variables), which severely limits application of the GP-based ARD method for high-throughput sequencing data. In this paper, we provide a nonparametric prescreening method that preserves virtually all the major benefits of the GP-based ARD method and extends its scalability to the typical high-dimensional datasets used in practice. In several simulated test scenarios, the proposed method compared favorably with existing nonparametric dimension reduction/prescreening methods suitable for higher-order interaction searches. As a real-data example, the proposed method was applied to a high-throughput dataset downloaded from the cancer genome atlas (TCGA) with measured expression levels of 16,976 genes (after preprocessing) from patients diagnosed with acute myeloid leukemia.

Publisher

Oxford University Press (OUP)

Subject

Genetics

Link

https://academic.oup.com/genetics/article-pdf/213/4/1209/42106504/genetics1209.pdf

Reference64 articles.

1. Inferring transcription factor collaborations in gene regulatory networks.;Awad;BMC Syst. Biol.,2014

2. Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures.;Bobb;Biostatistics,2015

3. High-dimensional variable screening and bias in subsequent inference, with an empirical comparison.;Bühlmann;Comput. Stat.,2014

4. Loss of power in two-stage residual-outcome regression analysis in genetic association studies.;Che;Genet. Epidemiol.,2012

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Nonlinear expression patterns and multiple shifts in gene network interactions underlie robust phenotypic change in Drosophila melanogaster selected for night sleep duration;PLOS Computational Biology;2023-08-10

2. Quantum Chemical Calculations with Machine Learning for Multipolar Electrostatics Prediction in RNA: An Application to Pentose;Journal of Chemical Information and Modeling;2022-08-29

3. Multiple shifts in gene network interactions shape phenotypes of Drosophila melanogaster selected for long and short night sleep duration;2021-07-12

4. Model guided trait-specific co-expression network estimation as a new perspective for identifying molecular interactions and pathways;PLOS Computational Biology;2021-05-03

5. Model guided trait-specific co-expression network estimation as a new perspective for identifying molecular interactions and pathways;2020-09-22