Approximate Dynamic Programming and Reinforcement Learning-Reference-Cited by-同舟云学术

Approximate Dynamic Programming and Reinforcement Learning

Published:2010 Issue: Volume: Page:3-44
ISSN:1860-949X
Container-title:Interactive Collaborative Information Systems
language:
Short-container-title:

Author:

Buşoniu Lucian,De Schutter Bart,Babuška Robert

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/978-3-642-11688-9_1

Reference92 articles.

1. Baddeley, B.: Reinforcement learning in continuous time and space: Interference and not ill conditioning is the main problem when using distributed function approximators. IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics 38(4), 950–956 (2008)

2. Barash, D.: A genetic search in policy space for solving Markov decision processes. In: AAAI Spring Symposium on Search Techniques for Problem Solving under Uncertainty and Incomplete Information. Palo Alto, US (1999)

3. Barto, A.G., Sutton, R.S., Anderson, C.W.: Neuronlike adaptive elements than can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics 13(5), 833–846 (1983)

4. Baxter, J., Bartlett, P.L.: Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research 15, 319–350 (2001)

5. Berenji, H.R., Khedkar, P.: Learning and tuning fuzzy logic controllers through reinforcements. IEEE Transactions on Neural Networks 3(5), 724–740 (1992)

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Perspective view of autonomous control in unknown environment: Dual control for exploitation and exploration vs reinforcement learning;Neurocomputing;2022-08

2. Control of Magnetic Manipulator Using Reinforcement Learning Based on Incrementally Adapted Local Linear Models;Complexity;2021-12-20

3. Data-Intensive Workflow Management: For Clouds and Data-Intensive and Scalable Computing Environments;Synthesis Lectures on Data Management;2019-05-13

4. Dynamic heuristic acceleration of linearly approximated SARSA($$\lambda $$): using ant colony optimization to learn heuristics dynamically;Journal of Heuristics;2019-05-03

5. The Role of Machine Learning and Radio Reconfigurability in the Quest for Wireless Security;Proactive and Dynamic Network Defense;2019