BibTex format
@inproceedings{Allard:2022:10.1145/3512290.3528751,
author = {Allard, M and Smith, Bize S and Chatzilygeroudis, K and Cully, A},
doi = {10.1145/3512290.3528751},
publisher = {ACM},
title = {Hierarchical Quality-Diversity For Online Damage Recovery},
url = {http://dx.doi.org/10.1145/3512290.3528751},
year = {2022}
}
RIS format (EndNote, RefMan)
TY - CPAPER
AB - Adaptation capabilities, like damage recovery, are crucial for the deployment of robots in complex environments. Several works have demonstrated that using repertoires of pre-trained skills can enable robots to adapt to unforeseen mechanical damages in a few minutes. These adaptation capabilities are directly linked to the behavioural diversity in the repertoire. The more alternatives the robot has to execute a skill, the better are the chances that it can adapt to a new situation. However, solving complex tasks, like maze navigation, usually requires multiple different skills. Finding a large behavioural diversity for these multiple skills often leads to an intractable exponential growth of the number of required solutions.In this paper, we introduce the Hierarchical Trial and Error algorithm, which uses a hierarchical behavioural repertoire to learn diverse skills and leverages them to make the robot more adaptive to different situations. We show that the hierarchical decomposition of skills enables the robot to learn more complex behaviours while keeping the learning of the repertoire tractable. The experiments with a hexapod robot show that our method solves maze navigation tasks with 20% less actions in the most challenging scenarios than the best baseline while having 57% less complete failures.
AU - Allard,M
AU - Smith,Bize S
AU - Chatzilygeroudis,K
AU - Cully,A
DO - 10.1145/3512290.3528751
PB - ACM
PY - 2022///
TI - Hierarchical Quality-Diversity For Online Damage Recovery
UR - http://dx.doi.org/10.1145/3512290.3528751
UR - http://hdl.handle.net/10044/1/96343
ER -