Biblioteca Digital

The development of high-performance speech processing systems for low-resource languages is a challenging area. One approach to address the lack of resources is to make use of data from multiple languages. A popular direction in recent years is to use bottleneck features, or hybrid systems, trained on multilingual data for speech-to-text (STT) systems. This paper presents an investigation into the application of these multilingual approaches to spoken term detection. Experiments were run using the IARPA Babel limited language pack corpora (∼10 hours/language) with 4 languages for initial multilingual system development and an additional held-out target language. STT gains achieved through using multilingual bottleneck features in a Tandem configuration are shown to also apply to keyword search (KWS). Further improvements in both STT and KWS were observed by incorporating language questions into the Tandem GMM-HMM decision trees for the training set languages. Adapted hybrid systems performed slightly worse on average than the adapted Tandem systems. A language independent acoustic model test on the target language showed that retraining or adapting of the acoustic models to the target language is currently minimally needed to achieve reasonable performance. © 2013 IEEE.

Veja mais

Overview and results of Morpho Challenge 2009

Relevância:

10.00% 10.00%

Publicador:

Veja mais

6 resultados para Multilingual

em Cambridge University Engineering Department Publications Database

Filtro por publicador

Minimum risk acoustic clustering for multilingual acoustic model combination

Automatic recognition of spontaneous speech for access to multilingual oral history archives

Multilingual large vocabulary speech recognition: the European SQALE project

Large vocabulary multilingual speech recognition using HTK

Investigation of multilingual deep neural networks for spoken term detection

Overview and results of Morpho Challenge 2009