A Theory of Statistical Inference for Ensuring the Robustness of Scientific Results

Author:

Coker Beau1ORCID,Rudin Cynthia2ORCID,King Gary3ORCID

Affiliation:

1. Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts 02115;

2. Department of Computer Science and Department of Electrical and Computer Engineering, Duke University, Durham, North Carolina 27708;

3. Institute for Quantitative Social Science, Harvard University, Cambridge, Massachusetts 02138

Abstract

Inference is the process of using facts we know to learn about facts we do not know. A theory of inference gives assumptions necessary to get from the former to the latter, along with a definition for and summary of the resulting uncertainty. Any one theory of inference is neither right nor wrong but merely an axiom that may or may not be useful. Each of the many diverse theories of inference can be valuable for certain applications. However, no existing theory of inference addresses the tendency to choose, from the range of plausible data analysis specifications consistent with prior evidence, those that inadvertently favor one’s own hypotheses. Because the biases from these choices are a growing concern across scientific fields, and in a sense the reason the scientific community was invented in the first place, we introduce a new theory of inference designed to address this critical problem. We introduce hacking intervals, which are the range of a summary statistic one may obtain given a class of possible endogenous manipulations of the data. Hacking intervals require no appeal to hypothetical data sets drawn from imaginary superpopulations. A scientific result with a small hacking interval is more robust to researcher manipulation than one with a larger interval and is often easier to interpret than a classical confidence interval. Some versions of hacking intervals turn out to be equivalent to classical confidence intervals, which means they may also provide a more intuitive and potentially more useful interpretation of classical confidence intervals. This paper was accepted by J. George Shanthikumar, big data analytics.

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Subject

Management Science and Operations Research,Strategy and Management

Reference46 articles.

1. Angwin J , Larson J , Mattu S , Kirchner L (2016) Machine bias. ProPublica (May 23), https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing.

2. Back BJ , Rodriguez LR , Boessenecker M , Calvo A , Castro A , Chittick HA , Eskin GC , (2017) Pretrial detention reform—recommendations to the Chief Justice. Technical report, Judicial Branch of California, Sacramento.

3. Raise standards for preclinical cancer research

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. The Dataset Multiplicity Problem: How Unreliable Data Impacts Predictions;2023 ACM Conference on Fairness, Accountability, and Transparency;2023-06-12

2. A Computational Framework for Solving Nonlinear Binary Optimization Problems in Robust Causal Inference;INFORMS Journal on Computing;2022-11

3. A Robust Approach to Quantifying Uncertainty in Matching Problems of Causal Inference;INFORMS Journal on Data Science;2022-10

4. On the Existence of Simpler Machine Learning Models;2022 ACM Conference on Fairness, Accountability, and Transparency;2022-06-20

5. Synthetic Negative Controls: Using Simulation to Screen Large-scale Propensity Score Analyses;Epidemiology;2022-04-12

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3