Multi-Agent Deep Q Network to Enhance the Reinforcement Learning for Delayed Reward System-Reference-Cited by-同舟云学术

Multi-Agent Deep Q Network to Enhance the Reinforcement Learning for Delayed Reward System

Published:2022-03-30 Issue:7 Volume:12 Page:3520
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Kim Keecheon

Abstract

This study examines various factors and conditions that are related with the performance of reinforcement learning, and defines a multi-agent DQN system (N-DQN) model to improve them. N-DQN model is implemented in this paper with examples of maze finding and ping-pong as examples of delayed reward system, where delayed reward occurs, which makes general DQN learning difficult to apply. The implemented N-DQN shows about 3.5 times higher learning performance compared to the Q-Learning algorithm in the reward-sparse environment in the performance evaluation, and compared to DQN, it shows about 1.1 times faster goal achievement speed. In addition, through the implementation of the prioritized experience replay and the implementation of the reward acquisition section segmentation policy, such a problem as positive-bias of the existing reinforcement learning models seldom or never occurred. However, according to the characteristics of the architecture that uses many numbers of actors in parallel, the need for additional research on light-weighting the system for further performance improvement has raised. This paper describes in detail the structure of the proposed multi-agent N_DQN architecture, the contents of various algorithms used, and the specification for its implementation.

Funder

Konkuk University

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/7/3520/pdf

Reference34 articles.

1. Reinforcement Learning: A Survey

2. Generalization in reinforcement learning: Safely approximating the value function;Boyan;Adv. Neural Inf. Process. Syst.,1995

3. Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation;Kulkarni;Adv. Neural Inf. Inf. Process. Syst.,2016

4. Double Q-learning;Hasselt;Adv. Neural Inf. Process. Syst.,2010

5. Q-learning

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A review of research on reinforcement learning algorithms for multi-agents;Neurocomputing;2024-09

2. Enhancing multi-scenario applicability of freeway variable speed limit control strategies using continual learning;Accident Analysis & Prevention;2024-09

3. An Adjustment Strategy for Tilted Moiré Fringes via Deep Q-Network;Photonics;2024-07-17

4. An augmented reality-assisted interaction approach using deep reinforcement learning and cloud-edge orchestration for user-friendly robot teaching;Robotics and Computer-Integrated Manufacturing;2024-02

5. Development and Performance Analysis of an AI based Agent to Play Computer Games using Reinforcement Learning Techniques;2023 IEEE 3rd Mysore Sub Section International Conference (MysuruCon);2023-12-01