Reinforcement Learning with Parameterized Actions-Reference-Cited by-同舟云学术

Reinforcement Learning with Parameterized Actions

Published:2016-02-21 Issue:1 Volume:30 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Masson Warwick,Ranchod Pravesh,Konidaris George

Abstract

We introduce a model-free algorithm for learning in Markov decision processes with parameterized actions—discrete actions with continuous parameters. At each step the agent must select both which action to use and which parameters to use with that action. We introduce the Q-PAMDP algorithm for learning in these domains, show that it converges to a local optimum, and compare it to direct policy search in the goal-scoring and Platform domains.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 54 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A sequential multi-agent reinforcement learning framework for different action spaces;Expert Systems with Applications;2024-12

2. PRIME: Scaffolding Manipulation Tasks With Behavior Primitives for Data-Efficient Imitation Learning;IEEE Robotics and Automation Letters;2024-10

3. Reinforcement learning for electric vehicle charging scheduling: A systematic review;Transportation Research Part E: Logistics and Transportation Review;2024-10

4. Offline Reinforcement Learning with Constrained Hybrid Action Implicit Representation Towards Wargaming Decision-Making;Tsinghua Science and Technology;2024-10

5. Optimized Online Remaining Useful Life Prediction for Nuclear Circulating Water Pump Considering Time-Varying Degradation Mechanism;IEEE Transactions on Industrial Informatics;2024-09