Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited-Reference-Cited by-同舟云学术

Probabilistic Deterministic Finite Automata and Recurrent Networks, Revisited

Published:2022-01-06 Issue:1 Volume:24 Page:90
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Marzen Sarah E.,Crutchfield James P.^ORCID

Abstract

Reservoir computers (RCs) and recurrent neural networks (RNNs) can mimic any finite-state automaton in theory, and some workers demonstrated that this can hold in practice. We test the capability of generalized linear models, RCs, and Long Short-Term Memory (LSTM) RNN architectures to predict the stochastic processes generated by a large suite of probabilistic deterministic finite-state automata (PDFA) in the small-data limit according to two metrics: predictive accuracy and distance to a predictive rate-distortion curve. The latter provides a sense of whether or not the RNN is a lossy predictive feature extractor in the information-theoretic sense. PDFAs provide an excellent performance benchmark in that they can be systematically enumerated, the randomness and correlation structure of their generated processes are exactly known, and their optimal memory-limited predictors are easily computed. With less data than is needed to make a good prediction, LSTMs surprisingly lose at predictive accuracy, but win at lossy predictive feature extraction. These results highlight the utility of causal states in understanding the capabilities of RNNs to predict.

Funder

United States Air Force Office of Scientific Research

U. S. Army Research Office

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/24/1/90/pdf

Reference44 articles.

1. A Neural Substrate of Prediction and Reward

2. A framework for mesencephalic dopamine systems based on predictive Hebbian learning

3. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects

4. Rate Distortion Theory;Berger,1971

5. Optimal causal inference: Estimating stored information and approximating causal architecture

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-Class Threat Detection Using Neural Network and Machine Learning Approaches in Kubernetes Environments;2024 6th International Conference on Computing and Informatics (ICCI);2024-03-06