DeepLINK: Deep learning inference using knockoffs with applications to genomics-Reference-Cited by-同舟云学术

DeepLINK: Deep learning inference using knockoffs with applications to genomics

Published:2021-09-03 Issue:36 Volume:118 Page:e2104683118
ISSN:0027-8424
Container-title:Proceedings of the National Academy of Sciences
language:en
Short-container-title:Proc Natl Acad Sci USA

Author:

Zhu Zifan^ORCID,Fan Yingying^ORCID,Kong Yinfei^ORCID,Lv Jinchi^ORCID,Sun Fengzhu^ORCID

Abstract

We propose a deep learning–based knockoffs inference framework, DeepLINK, that guarantees the false discovery rate (FDR) control in high-dimensional settings. DeepLINK is applicable to a broad class of covariate distributions described by the possibly nonlinear latent factor models. It consists of two major parts: an autoencoder network for the knockoff variable construction and a multilayer perceptron network for feature selection with the FDR control. The empirical performance of DeepLINK is investigated through extensive simulation studies, where it is shown to achieve FDR control in feature selection with both high selection power and high prediction accuracy. We also apply DeepLINK to three real data applications to demonstrate its practical utility.

Funder

HHS | NIH | National Institute of General Medical Sciences

NSF | MPS | Division of Mathematical Sciences

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Reference70 articles.

1. A selective overview of variable selection in high dimensional feature space;Fan;Stat. Sin.,2010

2. High-dimensional classification using features annealed independence rules

3. Adjusting batch effects in microarray expression data using empirical Bayes methods

4. Discovering the false discovery rate

5. The control of the false discovery rate in multiple testing under dependency

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Data Science Methods for Real-World Evidence Generation in Real-World Data;Annual Review of Biomedical Data Science;2024-08-23

2. Neural interval‐censored survival regression with feature selection;Statistical Analysis and Data Mining: The ASA Data Science Journal;2024-07-16

3. DeepPIG: deep neural network architecture with pairwise connected layers and stochastic gates using knockoff frameworks for feature selection;Scientific Reports;2024-07-06

4. Computational frameworks integrating deep learning and statistical models in mining multimodal omics data;Journal of Biomedical Informatics;2024-04

5. Factor Augmented Sparse Throughput Deep ReLU Neural Networks for High Dimensional Regression;Journal of the American Statistical Association;2023-10-18