Author:
Liu Xiaojuan,Diao Jietao,Li Nan
Abstract
Abstract
As one of the most classic reinforcement learning, Q_learning has a very wide range of applications in the control filed of robot. Path planning which is the center of robot control has high computation speed requirements for algorithms, so we need to accelerate by hardware while planning path using Q_learning. In this paper, a hardware accelerator architecture based on FPGA is implemented to accelerate Q_learning while used in path planning. And we evaluate our design on Artix-7 DDR3 board, leading to less FPGA resource consumption and swifter computation speed. Comparing to Q_learning which implemented based on CPU, the speed has improved approximately 519 times, achieving the purpose of accelerating the Q_learning algorithm.
Subject
General Physics and Astronomy
Reference5 articles.
1. The Experience-Memory Q-Leaming Algorithm for Robot Path Planning in Unknown Environment;Zhao;IEEE Access,2020
2. Parallel Implementation of Reinforcement Learning Q-Learning Technique for FPGA;Da Sliva;IEEE Access,2019
3. An Efficient Hardware Implementation of Reinforcement Learning: The Q-Learning Algorithm;Spano;IEEE Access,2019
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献