Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning-Reference-Cited by-同舟云学术

Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning

Published:2022-02-09 Issue:3 Volume:46 Page:483-498
ISSN:0929-5593
Container-title:Autonomous Robots
language:en
Short-container-title:Auton Robot

Author:

Shahid Asad Ali,Piga Dario,Braghin Francesco,Roveda Loris^ORCID

Abstract

AbstractThis paper presents a learning-based method that uses simulation data to learn an object manipulation task using two model-free reinforcement learning (RL) algorithms. The learning performance is compared across on-policy and off-policy algorithms: Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC). In order to accelerate the learning process, the fine-tuning procedure is proposed that demonstrates the continuous adaptation of on-policy RL to new environments, allowing the learned policy to adapt and execute the (partially) modified task. A dense reward function is designed for the task to enable an efficient learning of the agent. A grasping task involving a Franka Emika Panda manipulator is considered as the reference task to be learned. The learned control policy is demonstrated to be generalizable across multiple object geometries and initial robot/parts configurations. The approach is finally tested on a real Franka Emika Panda robot, showing the possibility to transfer the learned behavior from simulation. Experimental results show 100% of successful grasping tasks, making the proposed approach applicable to real applications.

Funder

Horizon 2020 Framework Programme

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10514-022-10034-z.pdf

Reference64 articles.

1. Abbeel, P., Coates, A., Quigley, M., & Ng, A. Y. (2007). An application of reinforcement learning to aerobatic helicopter flight. In Advances in neural information processing systems (pp. 1–8).

2. Achiam, J. (2018). Spinning up in deep reinforcement learning. https://spinningup.openai.com/en/latest/algorithms/sac.html

3. Boularias, A., Kober, J., & Peters, J. (2011). Relative entropy inverse reinforcement learning. In Proceedings of the fourteenth international conference on artificial intelligence and statistics (pp. 182–189).

4. Chebotar, Y., Kalakrishnan, M., Yahya, A., Li, A., Schaal, S., & Levine, S. (2017). Path integral guided policy search. In 2017 IEEE international conference on robotics and automation (ICRA) (pp. 3381–3388). IEEE.

5. Cui, F., Cui, Q., & Song, Y. (2020). A survey on learning-based approaches for modeling and classification of human-machine dialog systems. IEEE Transactions on Neural Networks and Learning Systems

Cited by 34 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Gradient Descent-Based Task-Orientation Robot Control Enhanced With Gaussian Process Predictions;IEEE Robotics and Automation Letters;2024-09

2. A model-free method to learn multiple skills in parallel on modular robots;Nature Communications;2024-07-25

3. Adaptive Optimization of Hyper-Parameters for Robotic Manipulation through Evolutionary Reinforcement Learning;Journal of Intelligent & Robotic Systems;2024-07-24

4. Self-learning of Robot Arm using a Modified Soft-Actor-Critic (SAC) Approach: Complex 6D Pose Brick Grasping Test;Journal of Construction Automation and Robotics;2024-06-28

5. Transformable Quadruped Wheelchairs Capable of Autonomous Stair Ascent and Descent;Sensors;2024-06-06