Neural Network Based Reinforcement Learning Acceleration on FPGA Platforms-Reference-Cited by-同舟云学术

Neural Network Based Reinforcement Learning Acceleration on FPGA Platforms

Published:2017-01-11 Issue:4 Volume:44 Page:68-73
ISSN:0163-5964
Container-title:ACM SIGARCH Computer Architecture News
language:en
Short-container-title:SIGARCH Comput. Archit. News

Author:

Su Jiang¹,Liu Jianxiong¹,Thomas David B.¹,Cheung Peter Y.K.¹

Affiliation:

1. Imperial College London

Abstract

Deep Q-learning (DQN) is a recently proposed reinforcement learning algorithm where a neural network is applied as a non-linear approximator to its value function. The exploitation-exploration mechanism allows the training and prediction of the NN to execute simultaneously in an agent during its interaction with the environment. Agents often act independently on battery power, so the training and prediction must occur within the agent and on a limited power budget. In this work, We propose an FPGA acceleration system design for Neural Network Q-learning (NNQL). Our proposed system has high flexibility due to the support to run-time network parameterization, which allows neuroevolution algorithms to dynamically restructure the network to achieve better learning results. Additionally, the power consumption of our proposed system is adaptive to the network size because of a new processing element design. Based on our test cases on networks with hidden layer size ranging from 32 to 16384, our proposed system achieves 7x to 346x speedup compared to GPU implementation and 22x to 77x speedup to hand-coded CPU counterpart.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3039902.3039915

Reference8 articles.

1. A. Karpathy etal Convnetjs deep q learning demo. http://cs.stanford.edu/people/karpathy/convnetjs/. A. Karpathy et al. Convnetjs deep q learning demo. http://cs.stanford.edu/people/karpathy/convnetjs/.

2. A highly scalable Restricted Boltzmann Machine FPGA implementation

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A FPGA Accelerator of Distributed A3C Algorithm with Optimal Resource Deployment;IET Computers & Digital Techniques;2024-05-27

2. Dielectric Elastomer-Based Actuators: A Modeling and Control Review for Non-Experts;Actuators;2024-04-17

3. FPGA-Accelerated Sim-to-Real Control Policy Learning for Robotic Arms;IEEE Transactions on Circuits and Systems II: Express Briefs;2024-03

4. DQN Algorithm Design for Fast Efficient Shortest Path System;2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC);2023-10-31

5. A Deep Q Network Hardware Accelerator Based on Heterogeneous Computing;2023 IEEE 15th International Conference on ASIC (ASICON);2023-10-24