Prehensile and Non-Prehensile Robotic Pick-and-Place of Objects in Clutter Using Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Prehensile and Non-Prehensile Robotic Pick-and-Place of Objects in Clutter Using Deep Reinforcement Learning

Published:2023-01-29 Issue:3 Volume:23 Page:1513
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Imtiaz Muhammad Babar¹,Qiao Yuansong¹,Lee Brian¹

Affiliation:

1. Software Research Institute, Technological University of the Shannon: Midland Midwest, N37 HD68 Athlone, Ireland

Abstract

In this study, we develop a framework for an intelligent and self-supervised industrial pick-and-place operation for cluttered environments. Our target is to have the agent learn to perform prehensile and non-prehensile robotic manipulations to improve the efficiency and throughput of the pick-and-place task. To achieve this target, we specify the problem as a Markov decision process (MDP) and deploy a deep reinforcement learning (RL) temporal difference model-free algorithm known as the deep Q-network (DQN). We consider three actions in our MDP; one is ‘grasping’ from the prehensile manipulation category and the other two are ‘left-slide’ and ‘right-slide’ from the non-prehensile manipulation category. Our DQN is composed of three fully convolutional networks (FCN) based on the memory-efficient architecture of DenseNet-121 which are trained together without causing any bottleneck situations. Each FCN corresponds to each discrete action and outputs a pixel-wise map of affordances for the relevant action. Rewards are allocated after every forward pass and backpropagation is carried out for weight tuning in the corresponding FCN. In this manner, non-prehensile manipulations are learnt which can, in turn, lead to possible successful prehensile manipulations in the near future and vice versa, thus increasing the efficiency and throughput of the pick-and-place task. The Results section shows performance comparisons of our approach to a baseline deep learning approach and a ResNet architecture-based approach, along with very promising test results at varying clutter densities across a range of complex scenario test cases.

Funder

Science Foundation Ireland

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/3/1513/pdf

Reference63 articles.

1. Prehensile Manipulation Planning: Modeling, Algorithms and Implementation;Lamiraux;IEEE Trans. Robot.,2021

2. A Planning Framework for Non-Prehensile Manipulation under Clutter and Uncertainty;Dogar;Auton Robot.,2012

3. Serra, D. (2022, December 02). Robot Control for Nonprehensile Dynamic Manipulation Tasks. Available online: https://www.researchgate.net/publication/310751102_Robot_Control_for_Nonprehensile_Dynamic_Manipulation_Tasks.

4. Weisz, J., and Allen, P.K. (2012, January 14–18). Pose error robust grasping from contact wrench space metrics. Proceedings of the 2012 IEEE International Conference on Robotics and Automation, St Paul, MN, USA.

5. Pinto, L., and Gupta, A. (2016). Learning to Push by Grasping: Using multiple tasks for effective learning. arXiv.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Technological development and optimization of pushing and grasping functions in robot arms: A review;Measurement;2024-09

2. PolyDexFrame: Deep Reinforcement Learning-Based Pick-and-Place of Objects in Clutter;Machines;2024-08-11

3. Reinforcement Learning Algorithms and Applications in Healthcare and Robotics: A Comprehensive and Systematic Review;Sensors;2024-04-11

4. Nonprehensile Manipulation for Rapid Object Spinning via Multisensory Learning from Demonstration;Sensors;2024-01-08

5. Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence;IEEE Access;2024