Optimization of Weighted Finite State Transducer for Speech Recognition


Autoria(s): Aubert, Louis-Marie; Woods, Roger; Fischaber, Scott; Veitch, Richard
Data(s)

01/08/2013

Resumo

There is considerable interest in creating embedded, speech recognition hardware using the weighted finite state transducer (WFST) technique but there are performance and memory usage challenges. Two system optimization techniques are presented to address this; one approach improves token propagation by removing the WFST epsilon input arcs; another one-pass, adaptive pruning algorithm gives a dramatic reduction in active nodes to be computed. Results for memory and bandwidth are given for a 5,000 word vocabulary giving a better practical performance than conventional WFST; this is then exploited in an adaptive pruning algorithm that reduces the active nodes from 30,000 down to 4,000 with only a 2 percent sacrifice in speech recognition accuracy; these optimizations lead to a more simplified design with deterministic performance.

Formato

application/pdf

Identificador

http://pure.qub.ac.uk/portal/en/publications/optimization-of-weighted-finite-state-transducer-for-speech-recognition(a6b7edc5-9328-4954-9b42-cdcad9c7d7f3).html

http://dx.doi.org/10.1109/TC.2013.51

http://pure.qub.ac.uk/ws/files/2773741/TCSI_2012_08_0557.R1_Woods.pdf

Idioma(s)

eng

Direitos

info:eu-repo/semantics/closedAccess

Fonte

Aubert , L-M , Woods , R , Fischaber , S & Veitch , R 2013 , ' Optimization of Weighted Finite State Transducer for Speech Recognition ' IEEE Transactions on Computers , vol 62 , no. 8 , pp. 1607-1615 . DOI: 10.1109/TC.2013.51

Palavras-Chave #Embedded processors #WFST #Memory organization #speech recognition
Tipo

article