Finite State Automata and Simple Recurrent Networks-Reference-Cited by-同舟云学术

Finite State Automata and Simple Recurrent Networks

Published:1989-09 Issue:3 Volume:1 Page:372-381
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Cleeremans Axel¹,Servan-Schreiber David²,McClelland James L.¹

Affiliation:

1. Department of Psychology, Carnegie-Mellon University, Pittsburgh, PA 15213 USA

2. Department of Computer Science, Carnegie-Mellon University, Pittsburgh, PA 15213 USA

Abstract

We explore a network architecture introduced by Elman (1988) for predicting successive elements of a sequence. The network uses the pattern of activation over a set of hidden units from time-step t−1, together with element t, to predict element t + 1. When the network is trained with strings from a particular finite-state grammar, it can learn to be a perfect finite-state recognizer for the grammar. When the network has a minimal number of hidden units, patterns on the hidden units come to correspond to the nodes of the grammar, although this correspondence is not necessary for the network to act as a perfect finite-state recognizer. We explore the conditions under which the network can carry information about distant sequential contingencies across intervening elements. Such information is maintained with relative ease if it is relevant at each intermediate step; it tends to be lost when intervening elements do not depend on it. At first glance this may suggest that such networks are not relevant to natural language, in which dependencies may span indefinite distances. However, embeddings in natural language are not completely independent of earlier information. The final simulation shows that long distance sequential contingencies can be encoded by the network even if only subtle statistical properties of embedded strings depend on the early information.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.1989.1.3.372

Reference2 articles.

1. Implicit learning of artificial grammars

2. Learning representations by back-propagating errors

Cited by 299 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Use of Attention-Enhanced CNN-LSTM Models for Multi-Indicator and Time-Series Predictions of Surface Water Quality;Water Resources Management;2024-08-09

2. An integrated deep neural network model combining 1D CNN and LSTM for structural health monitoring utilizing multisensor time-series data;Structural Health Monitoring;2024-03-26

3. Forecasting PM2.5 Concentration Using Gradient-Boosted Regression Tree with CNN Learning Model;Optical Memory and Neural Networks;2024-03

4. A provably stable neural network Turing Machine with finite precision and time;Information Sciences;2024-02

5. Random forest and artificial neural network-based tsunami forests classification using data fusion of Sentinel-2 and Airbus Vision-1 satellites: A case study of Garhi Chandan, Pakistan;Open Geosciences;2024-01-01