Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains-Reference-Cited by-同舟云学术

Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains

Published:2023-03-29 Issue:1 Volume:3 Page:1-32
ISSN:2688-299X
Container-title:ACM Transactions on Evolutionary Learning and Optimization
language:en
Short-container-title:ACM Trans. Evol. Learn. Optim.

Author:

Flageat Manon¹^ORCID,Chalumeau Félix¹^ORCID,Cully Antoine¹^ORCID

Affiliation:

1. Imperial College London, London, UK

Abstract

Quality-Diversity algorithms, among which are the Multi-dimensional Archive of Phenotypic Elites (MAP-Elites), have emerged as powerful alternatives to performance-only optimisation approaches as they enable generating collections of diverse and high-performing solutions to an optimisation problem. However, they are often limited to low-dimensional search spaces and deterministic environments. The recently introduced Policy Gradient Assisted MAP-Elites (PGA-MAP-Elites) algorithm overcomes this limitation by pairing the traditional Genetic operator of MAP-Elites with a gradient-based operator inspired by deep reinforcement learning. This new operator guides mutations toward high-performing solutions using policy gradients (PG). In this work, we propose an in-depth study of PGA-MAP-Elites. We demonstrate the benefits of PG on the performance of the algorithm and the reproducibility of the generated solutions when considering uncertain domains. We firstly prove that PGA-MAP-Elites is highly performant in both deterministic and uncertain high-dimensional environments, decorrelating the two challenges it tackles. Secondly, we show that in addition to outperforming all the considered baselines, the collections of solutions generated by PGA-MAP-Elites are highly reproducible in uncertain environments, approaching the reproducibility of solutions found by Quality-Diversity approaches built specifically for uncertain applications. Finally, we propose an ablation and in-depth analysis of the dynamic of the PG-based variation. We demonstrate that the PG variation operator is determinant to guarantee the performance of PGA-MAP-Elites but is only essential during the early stage of the process, where it finds high-performing regions of the search space.

Funder

Engineering and Physical Sciences Research Council

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3577203

Reference37 articles.

1. Deep reinforcement learning: A brief survey;Arulkumaran Kai;IEEE Signal Processing Magazine,2017

2. OpenAI gym;Brockman Greg;CoRR,2016

3. Konstantinos Chatzilygeroudis, Antoine Cully, Vassilis Vassiliades, and Jean-Baptiste Mouret. 2021. Quality-diversity optimization: A novel branch of stochastic optimization. In Black Box Optimization, Machine Learning, and No-Free Lunch Theorems. Springer, 109–135.

4. Cédric Colas, Vashisht Madhavan, Joost Huizinga, and Jeff Clune. 2020. Scaling MAP-Elites to deep neuroevolution. In Proceedings of the 2020 Genetic and Evolutionary Computation Conference (GECCO’20). ACM, New York, NY, 67–75. DOI:10.1145/3377930.3390217; Implementation: https://github.com/uber-research/Map-Elites-Evolutionary.

5. PyBullet, a Python Module for Physics Simulation for Games, Robotics and Machine Learning;Coumans Erwin;http://pybullet.org,2019

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing MAP-Elites with Multiple Parallel Evolution Strategies;Proceedings of the Genetic and Evolutionary Computation Conference;2024-07-14

2. Neuroevolutionary diversity policy search for multi-objective reinforcement learning;Information Sciences;2024-02

3. Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing;IEEE Robotics and Automation Letters;2023-10

4. Benchmark Tasks for Quality-Diversity Applied to Uncertain Domains;Proceedings of the Companion Conference on Genetic and Evolutionary Computation;2023-07-15

5. Efficient Quality-Diversity Optimization through Diverse Quality Species;Proceedings of the Companion Conference on Genetic and Evolutionary Computation;2023-07-15