Author:
Wu Liangdong,Chen Yurou,Li Zhengwei,Liu Zhiyong
Abstract
Intelligent manipulation of robots in an unstructured environment is an important application field of artificial intelligence, which means that robots must have the ability of autonomous cognition and decision-making. A typical example of this type of environment is a cluttered scene where objects are stacked and close together. In clutter, the target(s) may be one or more, and efficiently completing the target(s) grasping task is challenging. In this study, an efficient push-grasping method based on reinforcement learning is proposed for multiple target objects in clutter. The key point of this method is to consider the states of all the targets so that the pushing action can expand the grasping space of all targets as much as possible to achieve the minimum total number of pushing and grasping actions and then improve the efficiency of the whole system. At this point, we adopted the mask fusion of multiple targets, clearly defined the concept of graspable probability, and provided the reward mechanism of multi-target push-grasping. Experiments were conducted in both the simulation and real systems. The experimental results indicated that, compared with other methods, the proposed method performed better for multiple target objects and a single target in clutter. It is worth noting that our policy was only trained under simulation, which was then transferred to the real system without retraining or fine-tuning.
Funder
National Key Research and Development Program of China
Subject
Artificial Intelligence,Biomedical Engineering
Reference36 articles.
1. “Hindsight experience replay,”;Andrychowicz;Advances in neural information processing systems,2017
2. A probabilistic data-driven model for planar pushing
3. Data-driven grasp synthesis—a survey;Bohg;IEEE Transact. Robot.,2013
4. “Efficient optimization for autonomous robotic manipulation of natural objects,”;Boularias;Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 28,2014
5. “Learning to manipulate unknown objects in clutter by reinforcement Twenty-Ninth,”;Boularias;AAAI Conference on Artificial Intelligence,2015
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献