Continuous deep Q-learning with a simulator for stabilization of uncertain discrete-time systems-Reference-Cited by-同舟云学术

Continuous deep Q-learning with a simulator for stabilization of uncertain discrete-time systems

Published:2021 Issue:4 Volume:12 Page:738-757
ISSN:2185-4106
Container-title:Nonlinear Theory and Its Applications, IEICE
language:en
Short-container-title:NOLTA

Author:

Ikemoto Junya¹^ORCID,Ushio Toshimitsu¹^ORCID

Affiliation:

1. Graduate School of Engineering Science, Osaka University

Publisher

Institute of Electronics, Information and Communications Engineers (IEICE)

Subject

Rehabilitation,Physical Therapy, Sports Therapy and Rehabilitation,General Medicine

Link

https://www.jstage.jst.go.jp/article/nolta/12/4/12_738/_pdf

Reference37 articles.

1. [1] H.K. Khalil, Nonlinear Systems, Prentice hall, 2002.

2. [2] R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction (second edition), MIT Press, 2018.

3. [3] C. Szepesvari, Algorithms for Reinforcement Learning (Synthesis Lectures on Artificial Intelligence and Machine Learning), Morgan and Claypool Publishers, 2010.

4. [4] J. Kober, J.A. Bagnell, and J. Peters, “Reinforcement learning in robotics: A survey,” The International Journal of Robotics Research, vol. 32, no. 11, pp. 1238-1274, August 2013.

5. [5] T. Kamio, S. Sugeo, K. Mitsubori, T. Tanaka, C.J. Ahn, H. Fujisaka, and K. Haeiwa, “A reinforcement learning approach to course decision of ships under navigation rules,” Proc. NOLTA'09, pp. 141-144, 2009.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fast Learning for Multi-Agent with Combination of Imitation Learning and Model-Based Learning for Formation Change of Transport Robots;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

2. Deep Dyna-Q for Rapid Learning and Improved Formation Achievement in Cooperative Transportation;Automation;2023-07-10

3. Forecasting and stabilizing chaotic regimes in two macroeconomic models via artificial intelligence technologies and control methods;Chaos, Solitons & Fractals;2023-05