A Deep Reinforcement Learning Algorithm Based on Tetanic Stimulation and Amnesic Mechanisms for Continuous Control of Multi-DOF Manipulator-Reference-Cited by-同舟云学术

A Deep Reinforcement Learning Algorithm Based on Tetanic Stimulation and Amnesic Mechanisms for Continuous Control of Multi-DOF Manipulator

Published:2021-09-29 Issue:10 Volume:10 Page:254
ISSN:2076-0825
Container-title:Actuators
language:en
Short-container-title:Actuators

Author:

Hou Yangyang,Hong Huajie^ORCID,Xu Dasheng,Zeng Zhe,Chen Yaping,Liu Zhaoyang

Abstract

Deep Reinforcement Learning (DRL) has been an active research area in view of its capability in solving large-scale control problems. Until presently, many algorithms have been developed, such as Deep Deterministic Policy Gradient (DDPG), Twin-Delayed Deep Deterministic Policy Gradient (TD3), and so on. However, the converging achievement of DRL often requires extensive collected data sets and training episodes, which is data inefficient and computing resource consuming. Motivated by the above problem, in this paper, we propose a Twin-Delayed Deep Deterministic Policy Gradient algorithm with a Rebirth Mechanism, Tetanic Stimulation and Amnesic Mechanisms (ATRTD3), for continuous control of a multi-DOF manipulator. In the training process of the proposed algorithm, the weighting parameters of the neural network are learned using Tetanic stimulation and Amnesia mechanism. The main contribution of this paper is that we show a biomimetic view to speed up the converging process by biochemical reactions generated by neurons in the biological brain during memory and forgetting. The effectiveness of the proposed algorithm is validated by a simulation example including the comparisons with previously developed DRL algorithms. The results indicate that our approach shows performance improvement in terms of convergence speed and precision.

Publisher

MDPI AG

Subject

Control and Optimization,Control and Systems Engineering

Link

https://www.mdpi.com/2076-0825/10/10/254/pdf

Reference35 articles.

1. Learning Hand-Eye Coordination for Robotic Grasping with Large-Scale Data Collection;Levine,2016

2. End-to-end training of deep visuomotor policies;Levine;J. Mach. Learn. Res.,2016

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Trajectory Planning Algorithm of Manipulator in Small Space Based on Reinforcement Learning;2023 China Automation Congress (CAC);2023-11-17