Reinforcement Learning under Threats-Reference-Cited by-同舟云学术

Reinforcement Learning under Threats

Published:2019-07-17 Issue: Volume:33 Page:9939-9940
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Gallego Victor,Naveiro Roi,Insua David Rios

Abstract

In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversaries trying to interfere with the reward generating process. However, when non-stationary environments as such are considered, Q-learning leads to suboptimal results (Busoniu, Babuska, and De Schutter 2010). Previous game-theoretical approaches to this problem have focused on modeling the whole multi-agent system as a game. Instead, we shall face the problem of prescribing decisions to a single agent (the supported decision maker, DM) against a potential threat model (the adversary). We augment the MDP to account for this threat, introducing Threatened Markov Decision Processes (TMDPs). Furthermore, we propose a level-k thinking scheme resulting in a new learning framework to deal with TMDPs. We empirically test our framework, showing the benefits of opponent modeling.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Manipulating hidden-Markov-model inferences by corrupting batch data;Computers & Operations Research;2024-02

2. Improving Agent Decision Payoffs via a New Framework of Opponent Modeling;Mathematics;2023-07-11

3. COME: Learning to Coordinate Crowdsourcing and Regular Couriers for Offline Delivery During Online Mega Sale Days;2023 IEEE 39th International Conference on Data Engineering (ICDE);2023-04

4. Defense and security planning under resource uncertainty and multi‐period commitments;Naval Research Logistics (NRL);2022-08-08

5. Configurable Environments in Reinforcement Learning: An Overview;Special Topics in Information Technology;2022