Affiliation:
1. Centre for Applied Mathematics and Bioinformatics (CAMB), Department of Mathematics and Natural Sciences, Gulf University for Science and Technology, Mubarak Al-Abdullah, Kuwait
2. Department of Chemical Engineering and Biotechnology, Cambridge University, Cambridge, UK
3. Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA
Abstract
Developing new ways to estimate probabilities can be valuable for science, statistics, engineering, and other fields. By considering the information content of different output patterns, recent work invoking algorithmic information theory inspired arguments has shown that a priori probability predictions based on pattern complexities can be made in a broad class of input-output maps. These algorithmic probability predictions do not depend on a detailed knowledge of how output patterns were produced, or historical statistical data. Although quantitatively fairly accurate, a main weakness of these predictions is that they are given as an upper bound on the probability of a pattern, but many low complexity, low probability patterns occur, for which the upper bound has little predictive value. Here, we study this low complexity, low probability phenomenon by looking at example maps, namely a finite state transducer, natural time series data, RNA molecule structures, and polynomial curves. Some mechanisms causing low complexity, low probability behaviour are identified, and we argue this behaviour should be assumed as a default in the real-world algorithmic probability studies. Additionally, we examine some applications of algorithmic probability and discuss some implications of low complexity, low probability patterns for several research areas including simplicity in physics and biology, a priori probability predictions, Solomonoff induction and Occam’s razor, machine learning, and password guessing.
Funder
Gulf University for Science and Technology Seed Grant
Subject
Multidisciplinary,General Computer Science
Reference60 articles.
1. A preliminary report on a general theory of inductive inference (revision of report v-131);R. J. Solomonoff;Contract AF,1960
2. Three approaches to the quantitative definition of information;A. N. Kolmogorov;Problems of Information Transmission,1965
3. A Theory of Program Size Formally Identical to Information Theory
4. An Introduction to Kolmogorov Complexity and Its Applications
5. Laws of information conservation (nongrowth) and aspects of the foundation of probability theory;L. A. Levin;Problemy Peredachi Informatsii,1974