A Novel Behavioral Strategy for RoboCode Platform Based on Deep Q-Learning-Reference-Cited by-同舟云学术

A Novel Behavioral Strategy for RoboCode Platform Based on Deep Q-Learning

Published:2021-07-16 Issue: Volume:2021 Page:1-14
ISSN:1099-0526
Container-title:Complexity
language:en
Short-container-title:Complexity

Author:

Kayakoku Hakan¹^ORCID,Guzel Mehmet Serdar²^ORCID,Bostanci Erkan³^ORCID,Medeni Ihsan Tolga⁴^ORCID,Mishra Deepti⁵^ORCID

Affiliation:

1. Aselsan Company, Ankara, Turkey

2. Robotics Laboratory, Computer Engineering Department, Ankara University, Ankara, Turkey

3. SAAT Laboratory, Computer Engineering Department, Ankara University, Ankara, Turkey

4. Ankara Yildirim Beyazit University (AYBU), Ankara, Turkey

5. Department of Computer Science (IDI), NTNU-Norwegian University of Science and Technology, Gjøvik, Norway

Abstract

This paper addresses a new machine learning-based behavioral strategy using the deep Q-learning algorithm for the RoboCode simulation platform. According to this strategy, a new model is proposed for the RoboCode platform, providing an environment for simulated robots that can be programmed to battle against other robots. Compared to Atari Games, RoboCode has a fairly wide set of actions and situations. Due to the challenges of training a CNN model for such a continuous action space problem, the inputs obtained from the simulation environment were generated dynamically, and the proposed model was trained by using these inputs. The trained model battled against the predefined rival robots of the environment (standard robots) by cumulatively benefiting from the experience of these robots. The comparison between the proposed model and standard robots of RoboCode Platform was statistically verified. Finally, the performance of the proposed model was compared with machine learning based-customized robots (community robots). Experimental results reveal that the proposed model is mostly superior to community robots. Therefore, the deep Q-learning-based model has proven to be successful in such a complex simulation environment. It should also be noted that this new model facilitates simulation performance in adaptive and partially cluttered environments.

Publisher

Hindawi Limited

Subject

Multidisciplinary,General Computer Science

Link

http://downloads.hindawi.com/journals/complexity/2021/9963018.pdf

Reference35 articles.

1. Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment

2. Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

3. Goals and Habits in the Brain

4. Approximate Q-Learning: An Introduction

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping;IEEE Transactions on Industrial Electronics;2024-06

2. Enhancing Stability and Performance in Mobile Robot Path Planning with PMR-Dueling DQN Algorithm;Sensors;2024-02-27

3. A Low-Cost Q-Learning-Based Approach to Handle Continuous Space Problems for Decentralized Multi-Agent Robot Navigation in Cluttered Environments;IEEE Access;2022