Tactics of Adversarial Attack on Deep Reinforcement Learning Agents-Reference-Cited by-同舟云学术

Tactics of Adversarial Attack on Deep Reinforcement Learning Agents

Published:2017-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Lin Yen-Chen¹,Hong Zhang-Wei¹,Liao Yuan-Hong¹,Shih Meng-Li¹,Liu Ming-Yu²,Sun Min¹

Affiliation:

1. National Tsing Hua University

2. Nvidia

Abstract

We introduce two tactics, namely the strategically-timed attack and the enchanting attack, to attack reinforcement learning agents trained by deep reinforcement learning algorithms using adversarial examples. In the strategically-timed attack, the adversary aims at minimizing the agent's reward by only attacking the agent at a small subset of time steps in an episode. Limiting the attack activity to this subset helps prevent detection of the attack by the agent. We propose a novel method to determine when an adversarial example should be crafted and applied. In the enchanting attack, the adversary aims at luring the agent to a designated target state. This is achieved by combining a generative model and a planning algorithm: while the generative model predicts the future states, the planning algorithm generates a preferred sequence of actions for luring the agent. A sequence of adversarial examples is then crafted to lure the agent to take the preferred sequence of actions. We apply the proposed tactics to the agents trained by the state-of-the-art deep reinforcement learning algorithm including DQN and A3C. In 5 Atari games, our strategically-timed attack reduces as much reward as the uniform attack (i.e., attacking at every time step) does by attacking the agent 4 times less often. Our enchanting attack lures the agent toward designated target states with a more than 70% success rate. Example videos are available at http://yclin.me/adversarial_attack_RL/.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 91 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adversarial robustness of deep reinforcement learning-based intrusion detection;International Journal of Information Security;2024-08-29

2. Adversarial Attacks in Machine Learning: Key Insights and Defense Approaches;Applied Data Science and Analysis;2024-08-07

3. Similar Locality Based Transfer Evolutionary Optimization for Minimalistic Attacks;2024 IEEE Congress on Evolutionary Computation (CEC);2024-06-30

4. Toward Evaluating Robustness of Reinforcement Learning with Adversarial Policy;2024 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN);2024-06-24

5. Mitigating Deep Reinforcement Learning Backdoors in the Neural Activation Space;2024 IEEE Security and Privacy Workshops (SPW);2024-05-23