A low-latency LSTM accelerator using balanced sparsity based on FPGA-Reference-Cited by-同舟云学术

A low-latency LSTM accelerator using balanced sparsity based on FPGA

Published:2022-03 Issue: Volume:89 Page:104417
ISSN:0141-9331
Container-title:Microprocessors and Microsystems
language:en
Short-container-title:Microprocessors and Microsystems

Author:

Jiang Jingfei,Xiao Tao,Xu Jinwei,Wen Dong,Gao Lei^ORCID,Dou Yong

Funder

National Natural Science Foundation of China

National University of Defense Technology

Publisher

Elsevier BV

Subject

Artificial Intelligence,Computer Networks and Communications,Hardware and Architecture,Software

Reference21 articles.

1. A.Y. Hannun, C. Case, J. Casper, B. Catanzaro, G. Diamos, E. Elsen, R. Prenger, S. Satheesh, S. Sengupta, A. Coates, A.Y. Ng, Deep speech: Scaling up end-to-end speech recognition, CoRR abs/1412.5567. arXiv:1412.5567. URL http://arxiv.org/abs/1412.5567.

2. Deep speech 2 : End-to-end speech recognition in english and mandarin;Amodei,2016

3. Y. Wu, M. Schuster, Z. Chen, Q.V. Le, M. Norouzi, W. Macherey, M. Krikun, Y. Cao, Q. Gao, K. Macherey, J. Klingner, A. Shah, M. Johnson, X. Liu, L. Kaiser, S. Gouws, Y. Kato, T. Kudo, H. Kazawa, K. Stevens, G. Kurian, N. Patil, W. Wang, C. Young, J. Smith, J. Riesa, A. Rudnick, O. Vinyals, G. Corrado, M. Hughes, J. Dean, Google’s neural machine translation system: Bridging the gap between human and machine translation, CoRR abs/1609.08144. arXiv:1609.08144. URL http://arxiv.org/abs/1609.08144.

4. Long Short-Term Memory, Vol. 9;Hochreiter,1997

5. ESE: efficient speech recognition engine with sparse LSTM on FPGA;Han,2017

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mixture-of-Rookies: Saving DNN computations by predicting ReLU outputs;Microprocessors and Microsystems;2024-09

2. Algorithm and hardware co-design co-optimization framework for LSTM accelerator using quantized fully decomposed tensor train;Internet of Things;2023-07

3. An Instruction-Driven Batch-Based High-Performance Resource-Efficient LSTM Accelerator on FPGA;Electronics;2023-04-05

4. DSIRBS : A Layer-wise Balanced DNN Weight Pruning Method;Proceedings of the 2023 15th International Conference on Machine Learning and Computing;2023-02-17

5. ROSETTA: A Resource and Energy-Efficient Inference Processor for Recurrent Neural Networks Based on Programmable Data Formats and Fine Activation Pruning;IEEE Transactions on Emerging Topics in Computing;2023