Large-Scale Simultaneous Inference with Hypothesis Testing: Multiple Testing Procedures in Practice-Reference-Cited by-同舟云学术

Large-Scale Simultaneous Inference with Hypothesis Testing: Multiple Testing Procedures in Practice

Published:2019-05-15 Issue:2 Volume:1 Page:653-683
ISSN:2504-4990
Container-title:Machine Learning and Knowledge Extraction
language:en
Short-container-title:MAKE

Author:

Emmert-Streib Frank^ORCID,Dehmer Matthias

Abstract

A statistical hypothesis test is one of the most eminent methods in statistics. Its pivotal role comes from the wide range of practical problems it can be applied to and the sparsity of data requirements. Being an unsupervised method makes it very flexible in adapting to real-world situations. The availability of high-dimensional data makes it necessary to apply such statistical hypothesis tests simultaneously to the test statistics of the underlying covariates. However, if applied without correction this leads to an inevitable increase in Type 1 errors. To counteract this effect, multiple testing procedures have been introduced to control various types of errors, most notably the Type 1 error. In this paper, we review modern multiple testing procedures for controlling either the family-wise error (FWER) or the false-discovery rate (FDR). We emphasize their principal approach allowing categorization of them as (1) single-step vs. stepwise approaches, (2) adaptive vs. non-adaptive approaches, and (3) marginal vs. joint multiple testing procedures. We place a particular focus on procedures that can deal with data with a (strong) correlation structure because real-world data are rarely uncorrelated. Furthermore, we also provide background information making the often technically intricate methods accessible for interdisciplinary data scientists.

Publisher

MDPI AG

Subject

General Economics, Econometrics and Finance

Link

https://www.mdpi.com/2504-4990/1/2/39/pdf

Reference88 articles.

1. Challenges of Big Data analysis

2. Data Science and its Relationship to Big Data and Data-Driven Decision Making

3. What is data science? Fundamental concepts and a heuristic example;Hayashi,1998

4. Data Science: an Action Plan for Expanding the Technical Areas of the Field of Statistics

5. Data Science in Statistics Curricula: Preparing Students to “Think with Data”

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Broken Rotor Bar Detection Based on Steady-State Stray Flux Signals Using Triaxial Sensor with Random Positioning;Sensors;2024-05-12

2. Toxicity of ciprofloxacin and ofloxacin to Moina macrocopa and investigation of p-value adjustments for (eco)toxicological studies;Drug and Chemical Toxicology;2023-07-25

3. Analyzing the Scholarly Literature of Digital Twin Research: Trends, Topics and Structure;IEEE Access;2023

4. Hypothesis Testing;Elements of Data Science, Machine Learning, and Artificial Intelligence Using R;2023

5. Plasma Protein Levels Analysis in Multiple Sclerosis Sardinian Families Identified C9 and CYP24A1 as Candidate Biomarkers;Life;2022-01-20