FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition


Autoria(s): Veitch, Richard; Aubert, Louis-Marie; Woods, Roger; Fischaber, Scott
Data(s)

2011

Resumo

A scalable large vocabulary, speaker independent speech recognition system is being developed using Hidden Markov Models (HMMs) for acoustic modeling and a Weighted Finite State Transducer (WFST) to compile sentence, word, and phoneme models. The system comprises a software backend search and an FPGA-based Gaussian calculation which are covered here. In this paper, we present an efficient pipelined design implemented both as an embedded peripheral and as a scalable, parallel hardware accelerator. Both architectures have been implemented on an Alpha Data XRC-5T1, reconfigurable computer housing a Virtex 5 SX95T FPGA. The core has been tested and is capable of calculating a full set of Gaussian results from 3825 acoustic models in 9.03 ms which coupled with a backend search of 5000 words has provided an accuracy of over 80%. Parallel implementations have been designed with up to 32 cores and have been successfully implemented with a clock frequency of 133?MHz.

Formato

application/pdf

Identificador

http://pure.qub.ac.uk/portal/en/publications/fpga-implementation-of-a-pipelined-gaussian-calculation-for-hmmbased-large-vocabulary-speech-recognition(4e51c3da-18dc-4ec7-a762-384436434bda).html

http://dx.doi.org/10.1155/2011/697080

http://pure.qub.ac.uk/ws/files/821247/697080.pdf

Idioma(s)

eng

Direitos

info:eu-repo/semantics/restrictedAccess

Fonte

Veitch , R , Aubert , L-M , Woods , R & Fischaber , S 2011 , ' FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition ' International Journal of Reconfigurable Computing , vol 2011 , 697080 . DOI: 10.1155/2011/697080

Palavras-Chave #/dk/atira/pure/subjectarea/asjc/1700/1708 #Hardware and Architecture
Tipo

article