Soft DAgger: Sample-Efficient Imitation Learning for Control of Soft Robots

Author:

Nazeer Muhammad Sunny12ORCID,Laschi Cecilia3ORCID,Falotico Egidio12ORCID

Affiliation:

1. The BioRobotics Institute, Scuola Superiore Sant’Anna, 56025 Pontedera, Italy

2. Department of Excellence in Robotics and AI, Scuola Superiore Sant’Anna, 56125 Pisa, Italy

3. Department of Mechanical Engineering, National University of Singapore, Singapore 117575, Singapore

Abstract

This paper presents Soft DAgger, an efficient imitation learning-based approach for training control solutions for soft robots. To demonstrate the effectiveness of the proposed algorithm, we implement it on a two-module soft robotic arm involved in the task of writing letters in 3D space. Soft DAgger uses a dynamic behavioral map of the soft robot, which maps the robot’s task space to its actuation space. The map acts as a teacher and is responsible for predicting the optimal actions for the soft robot based on its previous state action history, expert demonstrations, and current position. This algorithm achieves generalization ability without depending on costly exploration techniques or reinforcement learning-based synthetic agents. We propose two variants of the control algorithm and demonstrate that good generalization capabilities and improved task reproducibility can be achieved, along with a consistent decrease in the optimization time and samples. Overall, Soft DAgger provides a practical control solution to perform complex tasks in fewer samples with soft robots. To the best of our knowledge, our study is an initial exploration of imitation learning with online optimization for soft robot control.

Funder

European Union’s Horizon 2020 Research and Innovation Programme

PROBOSCIS

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Augmenting Compliance With Motion Generation Through Imitation Learning Using Drop-Stitch Reinforced Inflatable Robot Arm With Rigid Joints;IEEE Robotics and Automation Letters;2024-10

2. Distributed Connectivity-Maintenance Control of a Team of Unmanned Aerial Vehicles Using Supervised Deep Learning;2024 10th International Conference on Control, Automation and Robotics (ICCAR);2024-04-27

3. An Imitation Learning-Based Approach for Enabling Plant-Like Tropic Abilities in a Redundant Soft Continuum Arm;2024 10th International Conference on Control, Automation and Robotics (ICCAR);2024-04-27

4. Continual Policy Distillation of Reinforcement Learning-based Controllers for Soft Robotic In-Hand Manipulation;2024 IEEE 7th International Conference on Soft Robotics (RoboSoft);2024-04-14

5. Mitigating Stochasticity in Visual Servoing of Soft Robots Using Data-Driven Generative Models;2024 IEEE 7th International Conference on Soft Robotics (RoboSoft);2024-04-14

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3