Cost-Aware Generalized α-Investing for Multiple Hypothesis Testing-Reference-Cited by-同舟云学术

Cost-Aware Generalized α-Investing for Multiple Hypothesis Testing

Published:2024 Issue: Volume: Page:155-174
ISSN:2693-7166
Container-title:The New England Journal of Statistics in Data Science
language:en
Short-container-title:

Author:

Cook Thomas,Dubey Harsh Vardhan,Lee Ji Ah,Zhu Guangyu,Zhao Tingting,Flaherty Patrick

Abstract

We consider the problem of sequential multiple hypothesis testing with nontrivial data collection costs. This problem appears, for example, when conducting biological experiments to identify differentially expressed genes of a disease process. This work builds on the generalized α-investing framework which enables control of the marginal false discovery rate in a sequential testing setting. We make a theoretical analysis of the long term asymptotic behavior of α-wealth which motivates a consideration of sample size in the α-investing decision rule. Posing the testing process as a game with nature, we construct a decision rule that optimizes the expected α-wealth reward (ERO) and provides an optimal sample size for each test. Empirical results show that a cost-aware ERO decision rule correctly rejects more false null hypotheses than other methods for $n=1$ where n is the sample size. When the sample size is not fixed cost-aware ERO uses a prior on the null hypothesis to adaptively allocate of the sample budget to each test. We extend cost-aware ERO investing to finite-horizon testing which enables the decision rule to allocate samples in a non-myopic manner. Finally, empirical tests on real data sets from biological experiments show that cost-aware ERO balances the allocation of samples to an individual test against the allocation of samples across multiple tests.

Publisher

New England Statistical Society

Reference35 articles.

1. Generalized α-investing: definitions, optimality results and application to public databases;Journal of the Royal Statistical Society: Series B (Statistical Methodology),2014

2. Bayes and Minimax Solutions of Sequential Decision Problems

3. Controlling the false discovery rate: a practical and powerful approach to multiple testing;Journal of the Royal statistical society: series B (Methodological),1995

4. Adaptive linear step-up procedures that control the false discovery rate;Biometrika,2006

5. Statistical Decision Theory and Bayesian Analysis

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Editorial. Game-Theoretic Statistics and Safe Anytime-Valid Inference;The New England Journal of Statistics in Data Science;2024