Q-LEARNING, POLICY ITERATION AND ACTOR-CRITIC REINFORCEMENT LEARNING COMBINED WITH METAHEURISTIC ALGORITHMS IN SERVO SYSTEM CONTROL-Reference-Cited by-同舟云学术

Q-LEARNING, POLICY ITERATION AND ACTOR-CRITIC REINFORCEMENT LEARNING COMBINED WITH METAHEURISTIC ALGORITHMS IN SERVO SYSTEM CONTROL

Published:2023-12-16 Issue:4 Volume:21 Page:615
ISSN:2335-0164
Container-title:Facta Universitatis, Series: Mechanical Engineering
language:
Short-container-title:FU Mech Eng

Author:

Zamfirache Iuliu Alexandru^ORCID,Precup Radu-Emil^ORCID,Petriu Emil M.^ORCID

Abstract

This paper carries out the performance analysis of three control system structures and approaches, which combine Reinforcement Learning (RL) and Metaheuristic Algorithms (MAs) as representative optimization algorithms. In the first approach, the Gravitational Search Algorithm (GSA) is employed to initialize the parameters (weights and biases) of the Neural Networks (NNs) involved in Deep Q-Learning by replacing the traditional way of initializing the NNs based on random generated values. In the second approach, the Grey Wolf Optimizer (GWO) algorithm is employed to train the policy NN in Policy Iteration RL-based control. In the third approach, the GWO algorithm is employed as a critic in an Actor-Critic framework, and used to evaluate the performance of the actor NN. The goal of this paper is to analyze all three RL-based control approaches, aiming to determine which one represents the best fit for solving the proposed control optimization problem. The performance analysis is based on non-parametric statistical tests conducted on the data obtained from real-time experimental results specific to nonlinear servo system position control.

Publisher

University of Nis

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep reinforcement learning driven cost minimization for batch order scheduling in robotic mobile fulfillment systems;Expert Systems with Applications;2024-12

2. Explainable data-driven Q-learning control for a class of discrete-time linear autonomous systems;Information Sciences;2024-11

3. A safe reinforcement learning algorithm for supervisory control of power plants;Knowledge-Based Systems;2024-10

4. Data-driven hierarchical learning approach for multi-point servo control of Pan–Tilt–Zoom cameras;Engineering Applications of Artificial Intelligence;2024-10

5. Adaptive type-2 fuzzy-neural switching control for wastewater treatment process under several operating conditions;Information Sciences;2024-09