Exploration bonuses and dual control-Reference-Cited by-同舟云学术

Exploration bonuses and dual control

Published:1996-10 Issue:1 Volume:25 Page:5-22
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Dayan Peter,Sejnowski Terrence J.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/BF00115298.pdf

Reference28 articles.

1. BartoA.G., BradtkeS.J. & SinghS.P. (1995). Learning to act using real-time dynamic programming.Artificial Intelligence,72, 81?138.

2. BartoA.G., SuttonR.S. & WatkinsC.J.C.H. (1989). Learning and sequential decision making. In MGabriel & JMoore, editors,Learning and Computational Neuroscience: Foundations of Adaptive Networks. Cambridge, MA: MIT Press, Bradford Books.

3. BertsekasD. & ShreveS.E. (1978).Stochastic Optimal Control: The Discrete Time Case. New York, NY: Academic Press.

4. CohnD.A. (1994). Neural network exploration using optimal experiment design. In JDCowan, GTesauro & JAllspector, editors,Advances in Neural Information Processing Systems, 6. San Mateo, CA: Morgan Kaufmann, 679?686.

5. Techical Report;J.M. Cozzolino,1965

Cited by 47 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Curiosity: primate neural circuits for novelty and information seeking;Nature Reviews Neuroscience;2024-01-23

2. Population-based exploration in reinforcement learning through repulsive reward shaping using eligibility traces;Annals of Operations Research;2024-01-18

3. Risking your Tail: Modeling Individual Differences in Risk-sensitive Exploration using Bayes Adaptive Markov Decision Processes;2024-01-08

4. Active uncertainty reduction for safe and efficient interaction planning: A shielding-aware dual control approach;The International Journal of Robotics Research;2023-12-20

5. Reinforcement Learning with An Abrupt Model Change;2023 Winter Simulation Conference (WSC);2023-12-10