A logical analysis of null hypothesis significance testing using popular terminology

Author:

McNulty Richard

Abstract

Abstract Background Null Hypothesis Significance Testing (NHST) has been well criticised over the years yet remains a pillar of statistical inference. Although NHST is well described in terms of statistical models, most textbooks for non-statisticians present the null and alternative hypotheses (H0 and HA, respectively) in terms of differences between groups such as (μ1 = μ2) and (μ1 ≠ μ2) and HA is often stated to be the research hypothesis. Here we use propositional calculus to analyse the internal logic of NHST when couched in this popular terminology. The testable H0 is determined by analysing the scope and limits of the P-value and the test statistic’s probability distribution curve. Results We propose a minimum axiom set NHST in which it is taken as axiomatic that H0 is rejected if P-value< α. Using the common scenario of the comparison of the means of two sample groups as an example, the testable H0 is {(μ1 = μ2) and [($$\overline{x}$$ x ¯ 1$$\overline{x}$$ x ¯ 2) due to chance alone]}. The H0 and HA pair should be exhaustive to avoid false dichotomies. This entails that HA is ¬{(μ1 = μ2) and [($$\overline{x}$$ x ¯ 1$$\overline{x}$$ x ¯ 2) due to chance alone]}, rather than the research hypothesis (HT). To see the relationship between HA and HT, HA can be rewritten as the disjunction HA: ({(μ1 = μ2) ∧ [($$\overline{x}$$ x ¯ 1$$\overline{x}$$ x ¯ 2) not due to chance alone]} ∨ {(μ1 ≠ μ2) ∧ [$$(\overline{x}$$ ( x ¯ 1$$\overline{x}$$ x ¯ 2) not due to (μ1 ≠ μ2) alone]} ∨ {(μ1  μ2) ∧ [($$\overline{\boldsymbol{x}}$$ x ¯ 1$$\overline{\boldsymbol{x}}$$ x ¯ 2) due to (μ1  μ2) alone]}). This reveals that HT (the last disjunct in bold) is just one possibility within HA. It is only by adding premises to NHST that HT or other conclusions can be reached. Conclusions Using this popular terminology for NHST, analysis shows that the definitions of H0 and HA differ from those found in textbooks. In this framework, achieving a statistically significant result only justifies the broad conclusion that the results are not due to chance alone, not that the research hypothesis is true. More transparency is needed concerning the premises added to NHST to rig particular conclusions such as HT. There are also ramifications for the interpretation of Type I and II errors, as well as power, which do not specifically refer to HT as claimed by texts.

Publisher

Springer Science and Business Media LLC

Subject

Health Informatics,Epidemiology

Reference47 articles.

1. Daniel WW. Biostatistics : a foundation for analysis in the health sciences. 9th ed. Hoboken: Wiley; 2009.

2. Munro BH, Page EB. Statistical methods for health care research, vol. xi. 2nd ed. Philadelphia: Lippincott; 1993. p. 403.

3. Gallin JI, Ognibene FP, Johnson LL. Principles and practice of clinical research, vol. xvii. 4th ed. London: Academic Press; 2018. p. 80.

4. Mann PS, Lacke CJ. Introductory statistics, vol. xx. 7th ed. Hoboken: Wiley; 2010. p. 116.

5. Sullivan LM. Essentials of biostatistics in public health, vol. xii. 3rd ed. Burlington: Jones & Bartlett Learning; 2018. p. 376.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3