Biblioteca Digital

2 resultados para Audio-Visual Automatic Speech Recognition

em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)

Filtro por publicador

Wavelet-based cepstrum calculation

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we present a new wavelet-based algorithm for low-cost computation of the cepstrum. It can be used for real time precise pitch determination in automatic speech and speaker recognition systems. Many wavelet families are examined to determine the one that works best. The results confirm the efficacy and accuracy of the proposed technique for pitch extraction. (C) 2008 Elsevier B.V. All rights reserved.

Veja mais

Wavelet-based dynamic time warping

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dynamic Time Warping (DTW), a pattern matching technique traditionally used for restricted vocabulary speech recognition, is based on a temporal alignment of the input signal with the template models. The principal drawback of DTW is its high computational cost as the lengths of the signals increase. This paper shows extended results over our previously published conference paper, which introduces an optimized version of the DTW I hat is based on the Discrete Wavelet Transform (DWT). (C) 2008 Elsevier B.V. All rights reserved.

Veja mais