Advanced Double Layered Multi-Agent Systems Based on A3C in Real-Time Path Planning-Reference-Cited by-同舟云学术

Advanced Double Layered Multi-Agent Systems Based on A3C in Real-Time Path Planning

Published:2021-11-12 Issue:22 Volume:10 Page:2762
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Lee Dajeong,Kim Junoh,Cho Kyungeun^ORCID,Sung Yunsick^ORCID

Abstract

In this paper, we propose an advanced double layered multi-agent system to reduce learning time, expressing a state space using a 2D grid. This system is based on asynchronous advantage actor-critic systems (A3C) and reduces the state space that agents need to consider by hierarchically expressing a 2D grid space and determining actions. Specifically, the state space is expressed in the upper and lower layers. Based on the learning results using A3C in the lower layer, the upper layer makes decisions without additional learning, and accordingly, the total learning time can be reduced. Our method was verified experimentally using a virtual autonomous surface vehicle simulator. It reduced the learning time required to reach a 90% goal achievement rate by 7.1% compared to the conventional double layered A3C. In addition, the goal achievement by the proposed method was 18.86% higher than that of the traditional double layered A3C over 20,000 learning episodes.

Funder

Agency for Defense Development

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/22/2762/pdf

Reference22 articles.

1. An introduction to deep reinforcement learning;François-Lavet;arXiv,2018

2. Development of an Automated Camera-Based Drone Landing System

3. Advanced Camera Image Cropping Approach for CNN-Based End-to-End Controls on Sustainable Computing

4. A comprehensive overview of feature representation for biometric recognition

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimizing Port Multi-AGV Trajectory Planning through Priority Coordination: Enhancing Efficiency and Safety;Axioms;2023-09-21

2. Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning;Computational Intelligence and Neuroscience;2023-02-15

3. Cooperative Following of Multiple Autonomous Robots Based on Consensus Estimation;Electronics;2022-10-14