Improving Generalization for Temporal Difference Learning: The Successor Representation-Reference-Cited by-同舟云学术

Improving Generalization for Temporal Difference Learning: The Successor Representation

Published:1993-07 Issue:4 Volume:5 Page:613-624
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Dayan Peter¹

Affiliation:

1. Computational Neurobiology Laboratory, The Salk Institute, P.O. Box 85800, San Diego, CA 92186-5800 USA

Abstract

Estimation of returns over time, the focus of temporal difference (TD) algorithms, imposes particular constraints on good function approximators or representations. Appropriate generalization between states is determined by how similar their successors are, and representations should follow suit. This paper shows how TD machinery can be used to learn such representations, and illustrates, using a navigation task, the appropriately distributed nature of the result.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.1993.5.4.613

Reference5 articles.

1. A New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)

2. The convergence of TD(?) for general ?

3. Q-learning

4. Consistency of HDP applied to a simple reinforcement learning problem

Cited by 371 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The spread of affective and semantic valence representations across states;Cognition;2024-03

2. Information content of note transitions in the music of J. S. Bach;Physical Review Research;2024-02-02

3. Predictable navigation through spontaneous brain states with cognitive-map-like representations;Progress in Neurobiology;2024-02

4. The Hippocampus in Pigeons Contributes to the Model-Based Valuation and the Relationship between Temporal Context States;Animals;2024-01-29

5. Neural representations of predicted events: Evidence from time-resolved EEG decoding;2024-01-05