ECoDe: A Sample-Efficient Method for Co-Design of Robotic Agents-Reference-Cited by-同舟云学术

ECoDe: A Sample-Efficient Method for Co-Design of Robotic Agents

Published:2024-06-10 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Nagiredla Kishan Reddy¹,V Arun Kumar A¹,Karimpanal Thommen George¹,Semage Buddhika Laknath¹,Rana Santu¹

Affiliation:

1. Deakin University

Abstract

Co-design involves simultaneously optimizing thecontroller and the agent’s physical design. Its inherent bi-level optimization formulation necessitates an outer loop designoptimization driven by an inner loop control optimization. Thiscan be challenging when the design space is large and eachdesign evaluation involves a data-intensive reinforcement learningprocess for control optimization. To improve sample efficiencywe propose a multi-fidelity-based design exploration strategy inwhich we tie the controllers learned across the design spacesthrough a universal policy learner for warm-starting subsequentcontroller learning problems. Experiments performed on a widerange of agent design problems demonstrate the superiority ofour method compared to the baselines. Additionally, analysisof the optimized designs shows interesting design alterationsincluding design simplifications and non-intuitive alterations thathave emerged in the biological world.

Publisher

Springer Science and Business Media LLC

Reference54 articles.

1. Elshawi, Radwa and Maher, Mohamed and Sakr, Sherif Automated {M}achine {L}earning: State-of-the-{A}rt and {O}pen {C}hallenges. arXiv preprint arXiv:1906.02287, 2019

2. Audet, Charles and Hare, Warren Derivative-free and blackbox optimization. Springer, 2017, 2

3. Lorenzo, Pablo Ribalta and Nalepa, Jakub and Kawulok, Michal and Ramos, Luciano Sanchez and Pastor, Jos{\'e} Ranilla Particle {S}warm {O}ptimization for {H}yper-parameter {s}election in {D}eep {N}eural {N}etworks. 2017, Proceedings of the genetic and evolutionary computation conference

4. Schulman, John and Levine, Sergey and Abbeel, Pieter and Jordan, Michael and Moritz, Philipp (2015) Trust region policy optimization. PMLR, International conference on machine learning

5. Schulman, John and Wolski, Filip and Dhariwal, Prafulla and Radford, Alec and Klimov, Oleg (2017) Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347