FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition-Reference-Cited by-同舟云学术

FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition

Published:2011 Issue: Volume:2011 Page:1-10
ISSN:1687-7195
Container-title:International Journal of Reconfigurable Computing
language:en
Short-container-title:International Journal of Reconfigurable Computing

Author:

Veitch Richard¹,Aubert Louis-Marie¹,Woods Roger¹,Fischaber Scott¹

Affiliation:

1. Electronics, Communications and Information Technology (ECIT), Queens University Belfast, Northern Ireland Science Park, Belfast BT3 9DT, UK

Abstract

A scalable large vocabulary, speaker independent speech recognition system is being developed using Hidden Markov Models (HMMs) for acoustic modeling and a Weighted Finite State Transducer (WFST) to compile sentence, word, and phoneme models. The system comprises a software backend search and an FPGA-based Gaussian calculation which are covered here. In this paper, we present an efficient pipelined design implemented both as an embedded peripheral and as a scalable, parallel hardware accelerator. Both architectures have been implemented on an Alpha Data XRC-5T1, reconfigurable computer housing a Virtex 5 SX95T FPGA. The core has been tested and is capable of calculating a full set of Gaussian results from 3825 acoustic models in 9.03 ms which coupled with a backend search of 5000 words has provided an accuracy of over 80%. Parallel implementations have been designed with up to 32 cores and have been successfully implemented with a clock frequency of 133 MHz.

Funder

Engineering and Physical Sciences Research Council

Publisher

Hindawi Limited

Subject

Hardware and Architecture

Link

http://downloads.hindawi.com/journals/ijrc/2011/697080.pdf

Reference8 articles.

1. Low-bitrate distributed speech recognition for packet-based and wireless communication

2. Let's hear it for audio mining

3. Mining customer care dialogs for "Daily News"

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An efficient implementation of lattice-ladder multilayer perceptrons in field programmable gate arrays;2016

2. Optimization of Weighted Finite State Transducer for Speech Recognition;IEEE Transactions on Computers;2013-08

3. FPGA-BASED IMPLEMENTATION OF LITHUANIAN ISOLATED WORD RECOGNITION ALGORITHM / LIETUVIŲ KALBOS PAVIENIŲ ŽODŽIŲ ATPAŽINIMO ALGORITMO ĮGYVENDINIMAS LAUKU PROGRAMUOJAMA LOGINE MATRICA;Mokslas - Lietuvos ateitis;2013-05-24

4. Wearable sensor-based human activity recognition from environmental background sounds;Journal of Ambient Intelligence and Humanized Computing;2012-05-26