Below is a list of all relevant publications authored by Robotics Forum members.

Citation

BibTex format

@article{Flageat:2023:10.1145/3577203,
author = {Flageat, M and Chalumeau, F and Cully, A},
doi = {10.1145/3577203},
journal = {ACM Transactions on Evolutionary Learning and Optimization},
pages = {1--32},
title = {Empirical analysis of PGA-MAP-Elites for neuroevolution in uncertain domains},
url = {http://dx.doi.org/10.1145/3577203},
volume = {3},
year = {2023}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - Quality-Diversity algorithms, among which MAP-Elites, have emerged as powerful alternatives to performance-only optimisation approaches as they enable generating collections of diverse and high-performing solutions to an optimisation problem. However, they are often limited to low-dimensional search spaces and deterministic environments. The recently introduced Policy Gradient Assisted MAP-Elites (PGA-MAP-Elites) algorithm overcomes this limitation by pairing the traditional Genetic operator of MAP-Elites with a gradient-based operator inspired by Deep Reinforcement Learning. This new operator guides mutations toward high-performing solutions using policy-gradients. In this work, we propose an in-depth study of PGA-MAP-Elites. We demonstrate the benefits of policy-gradients on the performance of the algorithm and the reproducibility of the generated solutions when considering uncertain domains. We first prove that PGA-MAP-Elites is highly performant in both deterministic and uncertain high-dimensional environments, decorrelating the two challenges it tackles. Secondly, we show that in addition to outperforming all the considered baselines, the collections of solutions generated by PGA-MAP-Elites are highly reproducible in uncertain environments, approaching the reproducibility of solutions found by Quality-Diversity approaches built specifically for uncertain applications. Finally, we propose an ablation and in-depth analysis of the dynamic of the policy-gradients-based variation. We demonstrate that the policy-gradient variation operator is determinant to guarantee the performance of PGA-MAP-Elites but is only essential during the early stage of the process, where it finds high-performing regions of the search space.
AU - Flageat,M
AU - Chalumeau,F
AU - Cully,A
DO - 10.1145/3577203
EP - 32
PY - 2023///
SN - 2688-299X
SP - 1
TI - Empirical analysis of PGA-MAP-Elites for neuroevolution in uncertain domains
T2 - ACM Transactions on Evolutionary Learning and Optimization
UR - http://dx.doi.org/10.1145/3577203
UR - https://dl.acm.org/doi/10.1145/3577203
UR - http://hdl.handle.net/10044/1/102840
VL - 3
ER -

Robotics Forum Annual Report

Join our mailing list - sharing robotics-related activities at Imperial. 

Contact Us

Robotics Forum

For all enquiries, please contact our Forum Manager,  Dr Ana Cruz Ruiz

robotics-manager@imperial.ac.uk