Forward and inverse reinforcement learning sharing network weights and hyperparameters-Reference-Cited by-同舟云学术

Forward and inverse reinforcement learning sharing network weights and hyperparameters

Published:2021-12 Issue: Volume:144 Page:138-153
ISSN:0893-6080
Container-title:Neural Networks
language:en
Short-container-title:Neural Networks

Author:

Uchibe Eiji,Doya Kenji

Publisher

Elsevier BV

Subject

Artificial Intelligence,Cognitive Neuroscience

Reference79 articles.

1. P. Abbeel, A. Y. Ng, (2004). Apprenticeship learning via inverse reinforcement learning. In Proc. of the 21st International Conference on Machine Learning.

2. Z. Ahmed, M. N. N. Le Roux, D. Schuurmans, (2019). Understanding the impact of entropy on policy optimization. In Proc. of the 36th International Conference on Machine Learning pp.151–160.

3. R. Amit, R. Meir, K. Ciosek, (2020). Discount Factor as a Regularizer in Reinforcement Learning. In Proc. of the 37th International Conference on Machine Learning.

4. Multiple tracking and machine learning reveal dopamine modulation for area-restricted foraging behaviors via velocity change in caenorhabditis elegans;Ashida;Neuroscience Letters,2019

5. Dynamic policy programming;Azar;Journal of Machine Learning Research,2012

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Estimating cost function of expert players in differential games: A model-based method and its data-driven extension;Expert Systems with Applications;2024-12

2. Maze-solving in a plasma system based on functional analogies to reinforcement-learning model;PLOS ONE;2024-04-10

3. Online estimation of objective function for continuous-time deterministic systems;Neural Networks;2024-04

4. Cross-domain policy adaptation with dynamics alignment;Neural Networks;2023-10

5. Robotic arm trajectory tracking method based on improved proximal policy optimization;Proceedings of the Romanian Academy, Series A: Mathematics, Physics, Technical Sciences, Information Science;2023-09-30