Planning and Learning with Stochastic Action Sets-Reference-Cited by-同舟云学术

Planning and Learning with Stochastic Action Sets

Published:2018-07 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Boutilier Craig¹,Cohen Alon¹,Hassidim Avinatan¹,Mansour Yishay¹,Meshi Ofer¹,Mladenov Martin¹,Schuurmans Dale¹

Affiliation:

1. Google Research

Abstract

In many practical uses of reinforcement learning (RL) the set of actions available at a given state is a random variable, with realizations governed by an exogenous stochastic process. Somewhat surprisingly, the foundations for such sequential decision processes have been unaddressed. In this work, we formalize and investigate MDPs with stochastic action sets (SAS-MDPs) to provide these foundations. We show that optimal policies and value functions in this model have a structure that admits a compact representation. From an RL perspective, we show that Q-learning with sampled action sets is sound. In model-based settings, we consider two important special cases: when individual actions are available with independent probabilities, and a sampling-based model for unknown distributions. We develop polynomial-time value and policy iteration methods for both cases, and provide a polynomial-time linear programming solution for the first case.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dray-Q: Demand-dependent trailer repositioning using deep reinforcement learning;Transportation Research Part C: Emerging Technologies;2024-06

2. Efficient Learning of High Level Plans from Play;2023 IEEE International Conference on Robotics and Automation (ICRA);2023-05-29

3. A bibliometric analysis and review on reinforcement learning for transportation applications;Transportmetrica B: Transport Dynamics;2023-03-02

4. Reinforcement Learning Framework for Server Placement and Workload Allocation in Multiaccess Edge Computing;IEEE Internet of Things Journal;2023-01-15