Learn multi-step object sorting tasks through deep reinforcement learning-Reference-Cited by-同舟云学术

Learn multi-step object sorting tasks through deep reinforcement learning

Published:2022-05-06 Issue:11 Volume:40 Page:3878-3894
ISSN:0263-5747
Container-title:Robotica
language:en
Short-container-title:Robotica

Author:

Bao Jiatong^ORCID,Zhang Guoqing,Peng Yi,Shao Zhiyu,Song Aiguo

Abstract

AbstractRobotic systems are usually controlled to repetitively perform specific actions for manufacturing tasks. The traditional control methods are domain-dependent and model-dependent with cost of much human efforts. They cannot meet the new requirements of generality and flexibility in many areas such as intelligent manufacturing and customized production. This paper develops a general model-free approach to enable robots to perform multi-step object sorting tasks through deep reinforcement learning. Taking projected heightmap images from different time steps as input without extra high-level image analysis and understanding, critic models are designed to produce a pixel-wise Q value map for each type of action. It is a new trial to apply pixel-wise Q value-based critic networks to solve multi-step sorting tasks that involve many types of actions and complex action constraints. The experimental validations on simulated and realistic object sorting tasks demonstrate the effectiveness of the proposed approach. Qualitative results (videos), code for simulated and realistic experiments, and pre-trained models are available at https://github.com/JiatongBao/DRLSorting

Publisher

Cambridge University Press (CUP)

Subject

Computer Science Applications,General Mathematics,Software,Control and Systems Engineering,Control and Optimization,Mechanical Engineering,Modeling and Simulation

Reference32 articles.

1. [15] Chen, Y. , Ju, Z. and Yang, C. , “Combining Reinforcement Learning and Rule-Based Method to Manipulate Objects in Clutter,” In: International Joint Conference on Neural Networks, Glassgow, UK (2020) pp. 1–6.

2. Modelling reversible execution of robotic assembly

3. A visual grasping strategy for improving assembly efficiency based on deep reinforcement learning;Wang;J. Sens,2021

4. Deep reinforcement learning based moving object grasping

5. [28] Deng, J. , Dong, W. , Socher, R. , Li, L.-J. , Li, K. and Fei-Fei, L. , “Imagenet: A Large-Scale Hierarchical Image Database,” In: IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA (2009) pp. 248–255.

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Self-supervised Learning for Joint Pushing and Grasping Policies in Highly Cluttered Environments;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

2. Learning vision-based robotic manipulation tasks sequentially in offline reinforcement learning settings;Robotica;2024-05-02

3. Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence;IEEE Access;2024

4. Deep reinforcement learning with light-weight vision model for sequential robotic object sorting;Journal of King Saud University - Computer and Information Sciences;2024-01

5. Application of Reinforcement Learning to UR10 Positioning for Prioritized Multi-Step Inspection in NVIDIA Omniverse;2023 IEEE Symposium on Industrial Electronics & Applications (ISIEA);2023-07-15