Genetic-Algorithm-Aided Deep Reinforcement Learning for Multi-Agent Drone Delivery-Reference-Cited by-同舟云学术

Genetic-Algorithm-Aided Deep Reinforcement Learning for Multi-Agent Drone Delivery

Published:2024-02-20 Issue:3 Volume:8 Page:71
ISSN:2504-446X
Container-title:Drones
language:en
Short-container-title:Drones

Author:

Tarhan Farabi Ahmed¹^ORCID,Ure Nazım Kemal²^ORCID

Affiliation:

1. Department of Aeronautics Engineering, Istanbul Technical University, ITU Ayazaga Campus, Istanbul 34469, Turkey

2. Artificial Intelligence and Data Science Application and Research Center, Istanbul Technical University, ITU Ayazaga Campus, Istanbul 34469, Turkey

Abstract

The popularity of commercial unmanned aerial vehicles has drawn great attention from the e-commerce industry due to their suitability for last-mile delivery. However, the organization of multiple aerial vehicles efficiently for delivery within limitations and uncertainties is still a problem. The main challenge of planning is scalability, since the planning space grows exponentially to the number of agents, and it is not efficient to let human-level supervisors structure the problem for large-scale settings. Algorithms based on Deep Q-Networks had unprecedented success in solving decision-making problems. Extension of these algorithms to multi-agent problems is limited due to scalability issues. This work proposes an approach that improves the performance of Deep Q-Networks on multi-agent delivery by drone problems by utilizing state decompositions for lowering the problem complexity, Curriculum Learning for handling the exploration complexity, and Genetic Algorithms for searching efficient packet-drone matching across the combinatorial solution space. The performance of the proposed method is shown in a multi-agent delivery by drone problem that has 10 agents and ≈1077 state–action pairs. Comparative simulation results are provided to demonstrate the merit of the proposed method. The proposed Genetic-Algorithm-aided multi-agent DRL outperformed the rest in terms of scalability and convergent behavior.

Funder

Bilimsel Araştırma Projeleri Birimi, İstanbul Teknik Üniversitesi

Publisher

MDPI AG

Link

https://www.mdpi.com/2504-446X/8/3/71/pdf

Reference62 articles.

1. Jonas, A., Shanker, R., Liwag, K., Sharpe, M., and Kovanis, B. (2023, November 22). eVTOL/Urban Air Mobility TAM Update: A Slow Take-Off, However, Sky’s the Limit. Available online: https://advisor.morganstanley.com/the-busot-group/documents/field/b/bu/busot-group/Electric%20Vehicles.pdf.

2. Srinivasan, D., and Jain, L.C. (2010). Innovations in Multi-Agent Systems and Applications-1, Springer.

3. Hessel, M., Modayil, J., van Hasselt, H., Schaul, T., Ostrovski, G., Dabney, W., Horgan, D., Piot, B., Azar, M., and Silver, D. (2017). Rainbow: Combining Improvements in Deep Reinforcement Learning. arXiv.

4. Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., and Riedmiller, M. (2013). Playing Atari with Deep Reinforcement Learning. arXiv.

5. Zhang, K., Yang, Z., and Başar, T. (2019). Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms. arXiv.