Learning a simulation-based visual policy for real-world peg in unseen holes

Author:

Xie Liang1ORCID,Yu Hongxiang1ORCID,Xu Kechun1,Yang Tong1,Wang Minhang2ORCID,Lu Haojian1,Xiong Rong1ORCID,Wang Yue1ORCID

Affiliation:

1. College of Control Science and Engineering, Zhejiang University 1 , Zhejiang, China

2. The Application Innovate Lab, Huawei Incorporated Company 2 , Shenzhen, China

Abstract

This paper proposes a learning-based visual peg-in-hole that enables training with several shapes in simulation and adapting to arbitrary unseen shapes in the real world with minimal sim-to-real cost. The core idea is to decouple the generalization of the sensory-motor policy from the design of a fast-adaptable perception module and a simulated generic policy module. The framework consists of a segmentation network (SN), a virtual sensor network (VSN), and a controller network (CN). Concretely, the VSN is trained to measure the pose of the unseen shape from a segmented image. After that, given the shape-agnostic pose measurement, the CN is trained to achieve a generic peg-in-hole. Finally, when applying to real unseen holes, we only have to fine-tune the SN required by the simulated VSN + CN. To further minimize the transfer cost, we propose to automatically collect and annotate the data for the SN after one-minute human teaching. Simulated and real-world results are presented under the configuration of eye-to/in-hand. An electric vehicle charging system with the proposed policy inside achieves a 10/10 success rate in 2–3 s, using only hundreds of auto-labeled samples for the SN transfer.

Funder

National Natural Science Foundation of China

Zhejiang Provincial Natural Science Foundation of China

Fundamental Research Funds for Science and Technology and Space Intelligent Control Laboratory

Publisher

AIP Publishing

Subject

Instrumentation

Reference55 articles.

1. Compare contact model-based control and contact model-free learning: A survey of robotic peg-in-hole assembly strategies,2019

2. Offline meta-reinforcement learning for industrial insertion,2021

3. Robust multi-modal policies for industrial assembly via reinforcement learning and demonstrations: A large-scale study;Shell;Robotics: Science and Systems 2021, held virtually, held virtually 12–16, July 2021,2020

4. Fast robust peg-in-hole insertion with continuous visual servoing,;Haugaard;4th Conference on Robot Learning, CoRL 2020, 16–18 November 2020, Cambridge, MA, Proceedings of Machine Learning Research

5. Deep reinforcement learning for high precision assembly tasks,2017

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3