Abstract
AbstractRobotic systems are usually controlled to repetitively perform specific actions for manufacturing tasks. The traditional control methods are domain-dependent and model-dependent with cost of much human efforts. They cannot meet the new requirements of generality and flexibility in many areas such as intelligent manufacturing and customized production. This paper develops a general model-free approach to enable robots to perform multi-step object sorting tasks through deep reinforcement learning. Taking projected heightmap images from different time steps as input without extra high-level image analysis and understanding, critic models are designed to produce a pixel-wise Q value map for each type of action. It is a new trial to apply pixel-wise Q value-based critic networks to solve multi-step sorting tasks that involve many types of actions and complex action constraints. The experimental validations on simulated and realistic object sorting tasks demonstrate the effectiveness of the proposed approach. Qualitative results (videos), code for simulated and realistic experiments, and pre-trained models are available at https://github.com/JiatongBao/DRLSorting
Publisher
Cambridge University Press (CUP)
Subject
Computer Science Applications,General Mathematics,Software,Control and Systems Engineering,Control and Optimization,Mechanical Engineering,Modeling and Simulation
Reference32 articles.
1. [15] Chen, Y. , Ju, Z. and Yang, C. , “Combining Reinforcement Learning and Rule-Based Method to Manipulate Objects in Clutter,” In: International Joint Conference on Neural Networks, Glassgow, UK (2020) pp. 1–6.
2. Modelling reversible execution of robotic assembly
3. A visual grasping strategy for improving assembly efficiency based on deep reinforcement learning;Wang;J. Sens,2021
4. Deep reinforcement learning based moving object grasping
5. [28] Deng, J. , Dong, W. , Socher, R. , Li, L.-J. , Li, K. and Fei-Fei, L. , “Imagenet: A Large-Scale Hierarchical Image Database,” In: IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA (2009) pp. 248–255.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献