Bellman's principle of optimality and deep reinforcement learning for time-varying tasks-Reference-Cited by-同舟云学术

Bellman's principle of optimality and deep reinforcement learning for time-varying tasks

Published:2021-04-16 Issue: Volume: Page:1-12
ISSN:0020-7179
Container-title:International Journal of Control
language:en
Short-container-title:International Journal of Control

Author:

Giuseppi Alessandro¹^ORCID,Pietrabissa Antonio¹^ORCID

Affiliation:

1. Department of Computer, Control, and Management Engineering, Antonio Ruberti at the University of Rome “La Sapeinza”, Rome, Italy

Publisher

Informa UK Limited

Subject

Computer Science Applications,Control and Systems Engineering

Link

https://www.tandfonline.com/doi/pdf/10.1080/00207179.2021.1913516

Reference42 articles.

1. Neuronlike adaptive elements that can solve difficult learning control problems

2. Dynamic Programming

3. Borsa, D., Graepel, T. & Shawe-Taylor, J. (2016). Learning shared representations in multi-task reinforcement learning. Retrieved from http://arxiv.org/abs/1603.02041.

4. Boyan, J. A. & Littman, M. L. (2001). Exact solutions to time-dependent MDPs. In Proceedings of the 13th International Conference on Neural Information Processing Systems, Denver, CO (pp. 982–988).MIT Press.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Coordination Across Expert Areas;Green Energy and Technology;2024

2. Reducing the Learning Time of Reinforcement Learning for the Supervisory Control of Discrete Event Systems;IEEE Access;2023

3. Opponent cart-pole dynamics for reinforcement learning of competing agents;Acta Mechanica Sinica;2022-03-10

4. A Lyapunov-based version of the value iteration algorithm formulated as a discrete-time switched affine system;International Journal of Control;2021-11-22