Einstein from Noise: Statistical Analysis-Reference-Cited by-同舟云学术

Einstein from Noise: Statistical Analysis

Published:2024-07-07 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Balanov Amnon,Huleihel Wasim,Bendory Tamir

Abstract

Abstract“Einstein from noise” (EfN) is a prominent example of the model bias phenomenon: systematic errors in the statistical model that lead to erroneous but consistent estimates. In the EfN experiment, one falsely believes that a set of observations contains noisy, shifted copies of a template signal (e.g., an Einstein image), whereas in reality, it contains only pure noise observations. To estimate the signal, the observations are first aligned with the template using cross-correlation, and then averaged. Although the observations contain nothing but noise, it was recognized early on that this process produces a signal that resembles the template signal! This pitfall was at the heart of a central scientific controversy about validation techniques in structural biology.This paper provides a comprehensive statistical analysis of the EfN phenomenon above. We show that the Fourier phases of the EfN estimator (namely, the average of the aligned noise observations) converge to the Fourier phases of the template signal, explaining the observed structural similarity. Additionally, we prove that the convergence rate is inversely proportional to the number of noise observations and, in the high-dimensional regime, to the Fourier magnitudes of the template signal. Moreover, in the high-dimensional regime, the Fourier magnitudes converge to a scaled version of the template signal’s Fourier magnitudes. This work not only deepens the theoretical understanding of the EfN phenomenon but also highlights potential pitfalls in template matching techniques and emphasizes the need for careful interpretation of noisy observations across disciplines in engineering, statistics, physics, and biology.

Publisher

Cold Spring Harbor Laboratory

Reference50 articles.

1. Ashley Aberneithy . Automatic detection of calcified nodules of patients with tuberculous. University College, London, 2007.

2. Robert J Adler and Jonathan E Taylor . Random fields and geometry. Springer Science & Business Media, 2009.

3. An industrial visual inspection system that uses inductive learning;Journal of Intelligent Manufacturing,2004

4. Jean-Marc Azaïs and Mario Wschebor . Level sets and extrema of random processes and fields. John Wiley & Sons, 2009.

5. Estimation under group actions: recovering orbits from invariants;Applied and Computational Harmonic Analysis,2023