A Fully Pipelined FPGA Architecture of a Factored Restricted Boltzmann Machine Artificial Neural Network-Reference-Cited by-同舟云学术

A Fully Pipelined FPGA Architecture of a Factored Restricted Boltzmann Machine Artificial Neural Network

Published:2014-02 Issue:1 Volume:7 Page:1-23
ISSN:1936-7406
Container-title:ACM Transactions on Reconfigurable Technology and Systems
language:en
Short-container-title:ACM Trans. Reconfigurable Technol. Syst.

Author:

Kim Lok-Won¹,Asaad Sameh²,Linsker Ralph²

Affiliation:

1. Cisco Systems

2. IBM T. J. Watson Research Center

Abstract

Artificial neural networks (ANNs) are a natural target for hardware acceleration by FPGAs and GPGPUs because commercial-scale applications can require days to weeks to train using CPUs, and the algorithms are highly parallelizable. Previous work on FPGAs has shown how hardware parallelism can be used to accelerate a “Restricted Boltzmann Machine” (RBM) ANN algorithm, and how to distribute computation across multiple FPGAs. Here we describe a fully pipelined parallel architecture that exploits “mini-batch” training (combining many input cases to compute each set of weight updates) to further accelerate ANN training. We implement on an FPGA, for the first time to our knowledge, a more powerful variant of the basic RBM, the “Factored RBM” (fRBM). The fRBM has proved valuable in learning transformations and in discovering features that are present across multiple types of input. We obtain (in simulation) a 100-fold acceleration (vs. CPU software) for an fRBM having N = 256 units in each of its four groups (two input, one output, one intermediate group of units) running on a Virtex-6 LX760 FPGA. Many of the architectural features we implement are applicable not only to fRBMs, but to basic RBMs and other ANN algorithms more broadly.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2539125

Reference21 articles.

1. Piecewise linear approximation applied to nonlinear function of a neural network

2. An analog neural network processor with programmable topology

3. Artificial neural networks: a review of commercial hardware

4. A Fast Learning Algorithm for Deep Belief Nets

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Review of Intelligent Detection Technologies Based on 5G Networks;Proceedings of the 2023 4th International Conference on Computer Science and Management Technology;2023-10-13

2. VLSI Implementation of Neural Systems;Advances in Systems Analysis, Software Engineering, and High Performance Computing;2023-06-16

3. FPGA-based implementation of deep neural network using stochastic computing;Applied Soft Computing;2023-04

4. Ising machines as hardware solvers of combinatorial optimization problems;Nature Reviews Physics;2022-05-04

5. Logically synthesized and hardware-accelerated restricted Boltzmann machines for combinatorial optimization and integer factorization;Nature Electronics;2022-02-28